2025-05-07T19:42:38.8473381Z Current runner version: '2.323.0' 2025-05-07T19:42:38.8479711Z Runner name: 'i-07585e80669af62a2' 2025-05-07T19:42:38.8480655Z Machine name: 'ip-10-0-13-233' 2025-05-07T19:42:38.8483412Z ##[group]GITHUB_TOKEN Permissions 2025-05-07T19:42:38.8485669Z Contents: read 2025-05-07T19:42:38.8486262Z Metadata: read 2025-05-07T19:42:38.8486760Z Packages: read 2025-05-07T19:42:38.8487404Z ##[endgroup] 2025-05-07T19:42:38.8489818Z Secret source: None 2025-05-07T19:42:38.8490955Z Prepare workflow directory 2025-05-07T19:42:38.9091664Z Prepare all required actions 2025-05-07T19:42:38.9128247Z Getting action download info 2025-05-07T19:42:39.1013957Z Download action repository 'actions/checkout@v4' (SHA:11bd71901bbe5b1630ceea73d27597364c9af683) 2025-05-07T19:42:39.3657291Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-05-07T19:42:39.8982101Z Complete job name: build_artifact (x86, linux.24xlarge, default, 3.9, 12.6.3, clang) 2025-05-07T19:42:39.9868867Z A job started hook has been configured by the self-hosted runner administrator 2025-05-07T19:42:39.9999722Z ##[group]Run '/home/ec2-user/runner-scripts/before_job.sh' 2025-05-07T19:42:40.0010316Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:40.0011731Z ##[endgroup] 2025-05-07T19:42:41.1281395Z Runner Type: linux.24xlarge 2025-05-07T19:42:41.1281915Z Instance Type: c5.24xlarge 2025-05-07T19:42:41.1282231Z AMI Name: unknown 2025-05-07T19:42:41.1323163Z AMI ID: ami-071226ecf16aa7d96 2025-05-07T19:42:46.2088652Z ##[group]Checking docker version 2025-05-07T19:42:46.2101432Z ##[command]/usr/bin/docker version --format '{{.Server.APIVersion}}' 2025-05-07T19:42:46.2306950Z '1.44' 2025-05-07T19:42:46.2325264Z Docker daemon API version: '1.44' 2025-05-07T19:42:46.2325791Z ##[command]/usr/bin/docker version --format '{{.Client.APIVersion}}' 2025-05-07T19:42:46.2516104Z '1.44' 2025-05-07T19:42:46.2527788Z Docker client API version: '1.44' 2025-05-07T19:42:46.2532281Z ##[endgroup] 2025-05-07T19:42:46.2535160Z ##[group]Clean up resources from previous jobs 2025-05-07T19:42:46.2539783Z ##[command]/usr/bin/docker ps --all --quiet --no-trunc --filter "label=48f405" 2025-05-07T19:42:46.2695683Z ##[command]/usr/bin/docker network prune --force --filter "label=48f405" 2025-05-07T19:42:46.2839470Z ##[endgroup] 2025-05-07T19:42:46.2839845Z ##[group]Create local container network 2025-05-07T19:42:46.2848863Z ##[command]/usr/bin/docker network create --label 48f405 github_network_80f6ca3f88ec44e288b33a2daa062f9f 2025-05-07T19:42:46.5903962Z 287819e8c64fd98380a7a79f96fa4e762c769ac46452428ccd5da4052f4dea56 2025-05-07T19:42:46.5924418Z ##[endgroup] 2025-05-07T19:42:46.5955334Z ##[group]Starting job container 2025-05-07T19:42:46.5977333Z ##[command]/usr/bin/docker pull amazonlinux:2023 2025-05-07T19:42:46.7330356Z 2023: Pulling from library/amazonlinux 2025-05-07T19:42:46.7391301Z Digest: sha256:cb5b4c509d62ae388f674c139ae5e8281fc160c217d474445e912043e1941988 2025-05-07T19:42:46.7392090Z Status: Image is up to date for amazonlinux:2023 2025-05-07T19:42:46.7403874Z docker.io/library/amazonlinux:2023 2025-05-07T19:42:46.7488973Z ##[command]/usr/bin/docker create --name 8517788a26554f83a076d858efab411e_amazonlinux2023_2de706 --label 48f405 --workdir /__w/FBGEMM/FBGEMM --network github_network_80f6ca3f88ec44e288b33a2daa062f9f --user root -e "HOME=/github/home" -e GITHUB_ACTIONS=true -e CI=true -v "/var/run/docker.sock":"/var/run/docker.sock" -v "/home/ec2-user/actions-runner/_work":"/__w" -v "/home/ec2-user/actions-runner/externals":"/__e":ro -v "/home/ec2-user/actions-runner/_work/_temp":"/__w/_temp" -v "/home/ec2-user/actions-runner/_work/_actions":"/__w/_actions" -v "/home/ec2-user/actions-runner/_work/_tool":"/__w/_tool" -v "/home/ec2-user/actions-runner/_work/_temp/_github_home":"/github/home" -v "/home/ec2-user/actions-runner/_work/_temp/_github_workflow":"/github/workflow" --entrypoint "tail" amazonlinux:2023 "-f" "/dev/null" 2025-05-07T19:42:46.8318562Z bd0f6f4466627651321f9536b804a3259c307db0438d1e394995066ae59c3be1 2025-05-07T19:42:46.8344453Z ##[command]/usr/bin/docker start bd0f6f4466627651321f9536b804a3259c307db0438d1e394995066ae59c3be1 2025-05-07T19:42:47.3782291Z bd0f6f4466627651321f9536b804a3259c307db0438d1e394995066ae59c3be1 2025-05-07T19:42:47.3807655Z ##[command]/usr/bin/docker ps --all --filter id=bd0f6f4466627651321f9536b804a3259c307db0438d1e394995066ae59c3be1 --filter status=running --no-trunc --format "{{.ID}} {{.Status}}" 2025-05-07T19:42:47.3970030Z bd0f6f4466627651321f9536b804a3259c307db0438d1e394995066ae59c3be1 Up Less than a second 2025-05-07T19:42:47.3992404Z ##[command]/usr/bin/docker inspect --format "{{range .Config.Env}}{{println .}}{{end}}" bd0f6f4466627651321f9536b804a3259c307db0438d1e394995066ae59c3be1 2025-05-07T19:42:47.4147441Z HOME=/github/home 2025-05-07T19:42:47.4147984Z GITHUB_ACTIONS=true 2025-05-07T19:42:47.4148353Z CI=true 2025-05-07T19:42:47.4148791Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:42:47.4167867Z ##[endgroup] 2025-05-07T19:42:47.4179495Z ##[group]Waiting for all services to be ready 2025-05-07T19:42:47.4181826Z ##[endgroup] 2025-05-07T19:42:47.4268009Z ##[group]Run yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:47.4269017Z yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:47.4270206Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:47.4270647Z env: 2025-05-07T19:42:47.4270965Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:47.4271402Z BUILD_ENV: build_binary 2025-05-07T19:42:47.4271719Z BUILD_TARGET: default 2025-05-07T19:42:47.4272155Z BUILD_VARIANT: cuda 2025-05-07T19:42:47.4272519Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:42:47.4272943Z ##[endgroup] 2025-05-07T19:42:48.1298891Z Amazon Linux 2023 repository 103 MB/s | 37 MB 00:00 2025-05-07T19:42:54.7527522Z Last metadata expiration check: 0:00:07 ago on Wed May 7 19:42:47 2025. 2025-05-07T19:42:55.3124547Z Dependencies resolved. 2025-05-07T19:42:55.3300936Z Nothing to do. 2025-05-07T19:42:55.3302548Z Complete! 2025-05-07T19:42:55.5740354Z Last metadata expiration check: 0:00:08 ago on Wed May 7 19:42:47 2025. 2025-05-07T19:42:55.6377616Z Dependencies resolved. 2025-05-07T19:42:55.6603926Z ======================================================================================== 2025-05-07T19:42:55.6605366Z Package Arch Version Repository Size 2025-05-07T19:42:55.6606092Z ======================================================================================== 2025-05-07T19:42:55.6606747Z Installing: 2025-05-07T19:42:55.6607209Z binutils x86_64 2.41-50.amzn2023.0.3 amazonlinux 5.3 M 2025-05-07T19:42:55.6607814Z findutils x86_64 1:4.8.0-2.amzn2023.0.2 amazonlinux 539 k 2025-05-07T19:42:55.6608438Z git x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 54 k 2025-05-07T19:42:55.6609103Z pciutils x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 93 k 2025-05-07T19:42:55.6609681Z sudo x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 1.3 M 2025-05-07T19:42:55.6610274Z tar x86_64 2:1.34-1.amzn2023.0.4 amazonlinux 879 k 2025-05-07T19:42:55.6610805Z wget x86_64 1.21.3-1.amzn2023.0.4 amazonlinux 779 k 2025-05-07T19:42:55.6611397Z which x86_64 2.21-26.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:55.6611933Z Installing dependencies: 2025-05-07T19:42:55.6612419Z cracklib x86_64 2.9.6-27.amzn2023.0.2 amazonlinux 82 k 2025-05-07T19:42:55.6613062Z cyrus-sasl-lib x86_64 2.1.27-18.amzn2023.0.3 amazonlinux 786 k 2025-05-07T19:42:55.6613764Z elfutils-debuginfod-client x86_64 0.188-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:55.6614450Z git-core x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 4.7 M 2025-05-07T19:42:55.6615275Z git-core-doc noarch 2.47.1-1.amzn2023.0.2 amazonlinux 2.8 M 2025-05-07T19:42:55.6615959Z gnutls x86_64 3.8.3-6.amzn2023.0.1 amazonlinux 1.1 M 2025-05-07T19:42:55.6616575Z groff-base x86_64 1.22.4-7.amzn2023.0.2 amazonlinux 1.0 M 2025-05-07T19:42:55.6617104Z gzip x86_64 1.12-1.amzn2023.0.1 amazonlinux 160 k 2025-05-07T19:42:55.6617753Z hwdata noarch 0.384-1.amzn2023.0.3 amazonlinux 1.6 M 2025-05-07T19:42:55.6618448Z jansson x86_64 2.14-0.amzn2023 amazonlinux 46 k 2025-05-07T19:42:55.6619023Z kmod-libs x86_64 29-2.amzn2023.0.5 amazonlinux 62 k 2025-05-07T19:42:55.6619741Z less x86_64 608-2.amzn2023.0.2 amazonlinux 168 k 2025-05-07T19:42:55.6620695Z libcbor x86_64 0.7.0-3.amzn2023.0.2 amazonlinux 57 k 2025-05-07T19:42:55.6621281Z libdb x86_64 5.3.28-49.amzn2023.0.2 amazonlinux 756 k 2025-05-07T19:42:55.6621898Z libeconf x86_64 0.4.0-1.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:42:55.6622506Z libedit x86_64 3.1-38.20210714cvs.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:55.6623091Z libfdisk x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 153 k 2025-05-07T19:42:55.6623706Z libfido2 x86_64 1.10.0-2.amzn2023.0.2 amazonlinux 95 k 2025-05-07T19:42:55.6624326Z libmetalink x86_64 0.1.3-14.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:55.6625018Z libpwquality x86_64 1.4.4-6.amzn2023.0.2 amazonlinux 106 k 2025-05-07T19:42:55.6625698Z libsemanage x86_64 3.4-5.amzn2023.0.2 amazonlinux 121 k 2025-05-07T19:42:55.6626345Z libutempter x86_64 1.2.1-4.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:55.6626906Z nano x86_64 8.3-1.amzn2023 amazonlinux 706 k 2025-05-07T19:42:55.6730860Z ncurses x86_64 6.2-4.20200222.amzn2023.0.6 amazonlinux 394 k 2025-05-07T19:42:55.6731601Z nettle x86_64 3.10.1-1.amzn2023.0.1 amazonlinux 573 k 2025-05-07T19:42:55.6732163Z openldap x86_64 2.4.57-6.amzn2023.0.7 amazonlinux 256 k 2025-05-07T19:42:55.6732722Z openssh x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 454 k 2025-05-07T19:42:55.6733313Z openssh-clients x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 708 k 2025-05-07T19:42:55.6733957Z pam x86_64 1.5.1-8.amzn2023.0.4 amazonlinux 542 k 2025-05-07T19:42:55.6734603Z pciutils-libs x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:55.6735208Z perl-AutoLoader noarch 5.74-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:55.6735795Z perl-B x86_64 1.80-477.amzn2023.0.6 amazonlinux 179 k 2025-05-07T19:42:55.6736355Z perl-Carp noarch 1.50-458.amzn2023.0.2 amazonlinux 29 k 2025-05-07T19:42:55.6736936Z perl-Class-Struct noarch 0.66-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:55.6737609Z perl-Data-Dumper x86_64 2.174-460.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:55.6738194Z perl-Digest noarch 1.20-1.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:55.6738794Z perl-Digest-MD5 x86_64 2.58-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:55.6739379Z perl-DynaLoader x86_64 1.47-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:55.6740247Z perl-Encode x86_64 4:3.15-462.amzn2023.0.2 amazonlinux 1.7 M 2025-05-07T19:42:55.6740841Z perl-Errno x86_64 1.30-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:55.6741411Z perl-Error noarch 1:0.17029-5.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:55.6742036Z perl-Exporter noarch 5.74-459.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:55.6742610Z perl-Fcntl x86_64 1.13-477.amzn2023.0.6 amazonlinux 21 k 2025-05-07T19:42:55.6743233Z perl-File-Basename noarch 2.85-477.amzn2023.0.6 amazonlinux 18 k 2025-05-07T19:42:55.6743881Z perl-File-Find noarch 1.37-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:55.6744467Z perl-File-Path noarch 2.18-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:55.6745070Z perl-File-Temp noarch 1:0.231.100-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:55.6745785Z perl-File-stat noarch 1.09-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:55.6746388Z perl-FileHandle noarch 2.03-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:55.6746985Z perl-Getopt-Long noarch 1:2.52-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:55.6747589Z perl-Getopt-Std noarch 1.12-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:55.6748146Z perl-Git noarch 2.47.1-1.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:55.6748724Z perl-HTTP-Tiny noarch 0.078-1.amzn2023.0.3 amazonlinux 56 k 2025-05-07T19:42:55.6749312Z perl-IO x86_64 1.43-477.amzn2023.0.6 amazonlinux 87 k 2025-05-07T19:42:55.6749889Z perl-IPC-Open3 noarch 1.21-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:55.6750483Z perl-MIME-Base64 x86_64 3.16-2.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:55.6751059Z perl-Net-SSLeay x86_64 1.94-1.amzn2023.0.1 amazonlinux 392 k 2025-05-07T19:42:55.6751623Z perl-POSIX x86_64 1.94-477.amzn2023.0.6 amazonlinux 97 k 2025-05-07T19:42:55.6752178Z perl-PathTools x86_64 3.78-459.amzn2023.0.2 amazonlinux 85 k 2025-05-07T19:42:55.6752775Z perl-Pod-Escapes noarch 1:1.07-458.amzn2023.0.2 amazonlinux 20 k 2025-05-07T19:42:55.6753385Z perl-Pod-Perldoc noarch 3.28.01-459.amzn2023.0.3 amazonlinux 84 k 2025-05-07T19:42:55.6753979Z perl-Pod-Simple noarch 1:3.42-2.amzn2023.0.2 amazonlinux 215 k 2025-05-07T19:42:55.6754578Z perl-Pod-Usage noarch 4:2.01-2.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:55.6755183Z perl-Scalar-List-Utils x86_64 4:1.56-459.amzn2023.0.2 amazonlinux 71 k 2025-05-07T19:42:55.6755810Z perl-SelectSaver noarch 1.02-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:55.6756390Z perl-Socket x86_64 4:2.032-1.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:55.6756935Z perl-Storable x86_64 1:3.21-458.amzn2023.0.2 amazonlinux 96 k 2025-05-07T19:42:55.6757598Z perl-Symbol noarch 1.08-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:55.6758169Z perl-Term-ANSIColor noarch 5.01-459.amzn2023.0.2 amazonlinux 48 k 2025-05-07T19:42:55.6758762Z perl-Term-Cap noarch 1.17-458.amzn2023.0.2 amazonlinux 22 k 2025-05-07T19:42:55.6759333Z perl-TermReadKey x86_64 2.38-9.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:55.6759923Z perl-Text-ParseWords noarch 3.30-458.amzn2023.0.2 amazonlinux 17 k 2025-05-07T19:42:55.6760555Z perl-Text-Tabs+Wrap noarch 2021.0726-1.amzn2023.0.1 amazonlinux 22 k 2025-05-07T19:42:55.6761209Z perl-Time-Local noarch 2:1.300-5.amzn2023.0.2 amazonlinux 34 k 2025-05-07T19:42:55.6761764Z perl-URI noarch 5.09-1.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:55.6762285Z perl-base noarch 2.27-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:55.6762945Z perl-constant noarch 1.33-459.amzn2023.0.2 amazonlinux 23 k 2025-05-07T19:42:55.6763472Z perl-if noarch 0.60.800-477.amzn2023.0.6 amazonlinux 14 k 2025-05-07T19:42:55.6763986Z perl-interpreter x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 71 k 2025-05-07T19:42:55.6764497Z perl-lib x86_64 0.65-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:55.6764978Z perl-libnet noarch 3.13-2.amzn2023.0.2 amazonlinux 126 k 2025-05-07T19:42:55.6765475Z perl-libs x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 2.0 M 2025-05-07T19:42:55.6766010Z perl-mro x86_64 1.23-477.amzn2023.0.6 amazonlinux 29 k 2025-05-07T19:42:55.6766531Z perl-overload noarch 1.31-477.amzn2023.0.6 amazonlinux 46 k 2025-05-07T19:42:55.6767077Z perl-overloading noarch 0.02-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:55.6767620Z perl-parent noarch 1:0.238-458.amzn2023.0.2 amazonlinux 14 k 2025-05-07T19:42:55.6768135Z perl-podlators noarch 1:4.14-458.amzn2023.0.2 amazonlinux 112 k 2025-05-07T19:42:55.6768662Z perl-subs noarch 1.03-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:55.6769149Z perl-vars noarch 1.05-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:55.6769652Z shadow-utils x86_64 2:4.9-12.amzn2023.0.4 amazonlinux 1.1 M 2025-05-07T19:42:55.6770166Z systemd-libs x86_64 252.23-3.amzn2023 amazonlinux 613 k 2025-05-07T19:42:55.6770652Z util-linux x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 2.2 M 2025-05-07T19:42:55.6771162Z util-linux-core x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 432 k 2025-05-07T19:42:55.6771566Z Installing weak dependencies: 2025-05-07T19:42:55.6771996Z nano-default-editor noarch 8.3-1.amzn2023 amazonlinux 10 k 2025-05-07T19:42:55.6772547Z perl-IO-Socket-IP noarch 0.41-3.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:55.6773103Z perl-IO-Socket-SSL noarch 2.075-1.amzn2023.0.2 amazonlinux 218 k 2025-05-07T19:42:55.6773657Z perl-Mozilla-CA noarch 20200520-4.amzn2023.0.2 amazonlinux 13 k 2025-05-07T19:42:55.6774171Z perl-NDBM_File x86_64 1.15-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:55.6774705Z sudo-python-plugin x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 56 k 2025-05-07T19:42:55.6775033Z 2025-05-07T19:42:55.6775125Z Transaction Summary 2025-05-07T19:42:55.6775393Z ======================================================================================== 2025-05-07T19:42:55.6775709Z Install 107 Packages 2025-05-07T19:42:55.6775846Z 2025-05-07T19:42:55.6776292Z Total download size: 38 M 2025-05-07T19:42:55.6776557Z Installed size: 151 M 2025-05-07T19:42:55.6776964Z Downloading Packages: 2025-05-07T19:42:55.9607265Z (1/107): cracklib-2.9.6-27.amzn2023.0.2.x86_64. 3.7 MB/s | 82 kB 00:00 2025-05-07T19:42:55.9693332Z (2/107): cyrus-sasl-lib-2.1.27-18.amzn2023.0.3. 25 MB/s | 786 kB 00:00 2025-05-07T19:42:55.9963829Z (3/107): binutils-2.41-50.amzn2023.0.3.x86_64.r 90 MB/s | 5.3 MB 00:00 2025-05-07T19:42:55.9988326Z (4/107): elfutils-debuginfod-client-0.188-3.amz 1.1 MB/s | 41 kB 00:00 2025-05-07T19:42:56.0028528Z (5/107): findutils-4.8.0-2.amzn2023.0.2.x86_64. 17 MB/s | 539 kB 00:00 2025-05-07T19:42:56.0054524Z (6/107): git-2.47.1-1.amzn2023.0.2.x86_64.rpm 6.8 MB/s | 54 kB 00:00 2025-05-07T19:42:56.0214270Z (7/107): gnutls-3.8.3-6.amzn2023.0.1.x86_64.rpm 69 MB/s | 1.1 MB 00:00 2025-05-07T19:42:56.0406167Z (8/107): git-core-doc-2.47.1-1.amzn2023.0.2.noa 75 MB/s | 2.8 MB 00:00 2025-05-07T19:42:56.0485747Z (9/107): groff-base-1.22.4-7.amzn2023.0.2.x86_6 51 MB/s | 1.0 MB 00:00 2025-05-07T19:42:56.0702042Z (10/107): git-core-2.47.1-1.amzn2023.0.2.x86_64 71 MB/s | 4.7 MB 00:00 2025-05-07T19:42:56.0725584Z (11/107): gzip-1.12-1.amzn2023.0.1.x86_64.rpm 5.3 MB/s | 160 kB 00:00 2025-05-07T19:42:56.0883206Z (12/107): hwdata-0.384-1.amzn2023.0.3.noarch.rp 42 MB/s | 1.6 MB 00:00 2025-05-07T19:42:56.0896239Z (13/107): jansson-2.14-0.amzn2023.x86_64.rpm 3.1 MB/s | 46 kB 00:00 2025-05-07T19:42:56.0918298Z (14/107): kmod-libs-29-2.amzn2023.0.5.x86_64.rp 3.7 MB/s | 62 kB 00:00 2025-05-07T19:42:56.0967364Z (15/107): libcbor-0.7.0-3.amzn2023.0.2.x86_64.r 8.9 MB/s | 57 kB 00:00 2025-05-07T19:42:56.0995805Z (16/107): less-608-2.amzn2023.0.2.x86_64.rpm 18 MB/s | 168 kB 00:00 2025-05-07T19:42:56.1062046Z (17/107): libdb-5.3.28-49.amzn2023.0.2.x86_64.r 53 MB/s | 756 kB 00:00 2025-05-07T19:42:56.1080177Z (18/107): libeconf-0.4.0-1.amzn2023.0.3.x86_64. 2.5 MB/s | 28 kB 00:00 2025-05-07T19:42:56.1102101Z (19/107): libedit-3.1-38.20210714cvs.amzn2023.0 11 MB/s | 108 kB 00:00 2025-05-07T19:42:56.1133942Z (20/107): libfdisk-2.37.4-1.amzn2023.0.4.x86_64 22 MB/s | 153 kB 00:00 2025-05-07T19:42:56.1175516Z (21/107): libmetalink-0.1.3-14.amzn2023.0.2.x86 4.9 MB/s | 31 kB 00:00 2025-05-07T19:42:56.1205539Z (22/107): libpwquality-1.4.4-6.amzn2023.0.2.x86 15 MB/s | 106 kB 00:00 2025-05-07T19:42:56.1224685Z (23/107): libfido2-1.10.0-2.amzn2023.0.2.x86_64 8.0 MB/s | 95 kB 00:00 2025-05-07T19:42:56.1250505Z (24/107): libsemanage-3.4-5.amzn2023.0.2.x86_64 17 MB/s | 121 kB 00:00 2025-05-07T19:42:56.1262464Z (25/107): libutempter-1.2.1-4.amzn2023.0.2.x86_ 4.7 MB/s | 26 kB 00:00 2025-05-07T19:42:56.1300370Z (26/107): nano-default-editor-8.3-1.amzn2023.no 2.1 MB/s | 10 kB 00:00 2025-05-07T19:42:56.1344924Z (27/107): nano-8.3-1.amzn2023.x86_64.rpm 60 MB/s | 706 kB 00:00 2025-05-07T19:42:56.1388176Z (28/107): ncurses-6.2-4.20200222.amzn2023.0.6.x 31 MB/s | 394 kB 00:00 2025-05-07T19:42:56.1439025Z (29/107): nettle-3.10.1-1.amzn2023.0.1.x86_64.r 43 MB/s | 573 kB 00:00 2025-05-07T19:42:56.1484154Z (30/107): openldap-2.4.57-6.amzn2023.0.7.x86_64 20 MB/s | 256 kB 00:00 2025-05-07T19:42:56.1525770Z (31/107): openssh-8.7p1-8.amzn2023.0.14.x86_64. 35 MB/s | 454 kB 00:00 2025-05-07T19:42:56.1578668Z (32/107): openssh-clients-8.7p1-8.amzn2023.0.14 55 MB/s | 708 kB 00:00 2025-05-07T19:42:56.1631085Z (33/107): pam-1.5.1-8.amzn2023.0.4.x86_64.rpm 39 MB/s | 542 kB 00:00 2025-05-07T19:42:56.1651289Z (34/107): pciutils-3.7.0-3.amzn2023.0.2.x86_64. 7.9 MB/s | 93 kB 00:00 2025-05-07T19:42:56.1669671Z (35/107): pciutils-libs-3.7.0-3.amzn2023.0.2.x8 5.2 MB/s | 41 kB 00:00 2025-05-07T19:42:56.1723863Z (36/107): perl-B-1.80-477.amzn2023.0.6.x86_64.r 27 MB/s | 179 kB 00:00 2025-05-07T19:42:56.1743308Z (37/107): perl-AutoLoader-5.74-477.amzn2023.0.6 2.5 MB/s | 22 kB 00:00 2025-05-07T19:42:56.1755361Z (38/107): perl-Carp-1.50-458.amzn2023.0.2.noarc 3.6 MB/s | 29 kB 00:00 2025-05-07T19:42:56.1774421Z (39/107): perl-Class-Struct-0.66-477.amzn2023.0 4.9 MB/s | 22 kB 00:00 2025-05-07T19:42:56.1808428Z (40/107): perl-Data-Dumper-2.174-460.amzn2023.0 12 MB/s | 55 kB 00:00 2025-05-07T19:42:56.1844953Z (41/107): perl-Digest-MD5-2.58-2.amzn2023.0.2.x 5.3 MB/s | 36 kB 00:00 2025-05-07T19:42:56.1863713Z (42/107): perl-Digest-1.20-1.amzn2023.0.2.noarc 2.5 MB/s | 26 kB 00:00 2025-05-07T19:42:56.1876390Z (43/107): perl-DynaLoader-1.47-477.amzn2023.0.6 4.0 MB/s | 26 kB 00:00 2025-05-07T19:42:56.2025649Z (44/107): perl-Encode-3.15-462.amzn2023.0.2.x86 94 MB/s | 1.7 MB 00:00 2025-05-07T19:42:56.2044900Z (45/107): perl-Errno-1.30-477.amzn2023.0.6.x86_ 949 kB/s | 15 kB 00:00 2025-05-07T19:42:56.2055494Z (46/107): perl-Error-0.17029-5.amzn2023.0.2.noa 2.3 MB/s | 41 kB 00:00 2025-05-07T19:42:56.2082000Z (47/107): perl-Exporter-5.74-459.amzn2023.0.2.n 5.9 MB/s | 31 kB 00:00 2025-05-07T19:42:56.2104678Z (48/107): perl-File-Basename-2.85-477.amzn2023. 3.8 MB/s | 18 kB 00:00 2025-05-07T19:42:56.2123753Z (49/107): perl-Fcntl-1.13-477.amzn2023.0.6.x86_ 3.3 MB/s | 21 kB 00:00 2025-05-07T19:42:56.2142439Z (50/107): perl-File-Find-1.37-477.amzn2023.0.6. 4.6 MB/s | 26 kB 00:00 2025-05-07T19:42:56.2160442Z (51/107): perl-File-Path-2.18-2.amzn2023.0.2.no 7.2 MB/s | 36 kB 00:00 2025-05-07T19:42:56.2183768Z (52/107): perl-File-Temp-0.231.100-2.amzn2023.0 10 MB/s | 60 kB 00:00 2025-05-07T19:42:56.2204868Z (53/107): perl-File-stat-1.09-477.amzn2023.0.6. 3.1 MB/s | 17 kB 00:00 2025-05-07T19:42:56.2222291Z (54/107): perl-FileHandle-2.03-477.amzn2023.0.6 2.9 MB/s | 16 kB 00:00 2025-05-07T19:42:56.2246609Z (55/107): perl-Getopt-Long-2.52-2.amzn2023.0.2. 11 MB/s | 60 kB 00:00 2025-05-07T19:42:56.2263083Z (56/107): perl-Getopt-Std-1.12-477.amzn2023.0.6 3.0 MB/s | 16 kB 00:00 2025-05-07T19:42:56.2304372Z (57/107): perl-HTTP-Tiny-0.078-1.amzn2023.0.3.n 10 MB/s | 56 kB 00:00 2025-05-07T19:42:56.2327942Z (58/107): perl-Git-2.47.1-1.amzn2023.0.2.noarch 4.2 MB/s | 42 kB 00:00 2025-05-07T19:42:56.2344606Z (59/107): perl-IO-1.43-477.amzn2023.0.6.x86_64. 11 MB/s | 87 kB 00:00 2025-05-07T19:42:56.2366822Z (60/107): perl-IO-Socket-IP-0.41-3.amzn2023.0.2 7.5 MB/s | 42 kB 00:00 2025-05-07T19:42:56.2421684Z (61/107): perl-IO-Socket-SSL-2.075-1.amzn2023.0 31 MB/s | 218 kB 00:00 2025-05-07T19:42:56.2446178Z (62/107): perl-IPC-Open3-1.21-477.amzn2023.0.6. 2.5 MB/s | 23 kB 00:00 2025-05-07T19:42:56.2459601Z (63/107): perl-MIME-Base64-3.16-2.amzn2023.0.2. 3.3 MB/s | 31 kB 00:00 2025-05-07T19:42:56.2477419Z (64/107): perl-Mozilla-CA-20200520-4.amzn2023.0 2.7 MB/s | 13 kB 00:00 2025-05-07T19:42:56.2540485Z (65/107): perl-Net-SSLeay-1.94-1.amzn2023.0.1.x 51 MB/s | 392 kB 00:00 2025-05-07T19:42:56.2558647Z (66/107): perl-NDBM_File-1.15-477.amzn2023.0.6. 2.3 MB/s | 23 kB 00:00 2025-05-07T19:42:56.2577493Z (67/107): perl-POSIX-1.94-477.amzn2023.0.6.x86_ 9.9 MB/s | 97 kB 00:00 2025-05-07T19:42:56.2600165Z (68/107): perl-PathTools-3.78-459.amzn2023.0.2. 15 MB/s | 85 kB 00:00 2025-05-07T19:42:56.2639687Z (69/107): perl-Pod-Perldoc-3.28.01-459.amzn2023 15 MB/s | 84 kB 00:00 2025-05-07T19:42:56.2663357Z (70/107): perl-Pod-Escapes-1.07-458.amzn2023.0. 2.4 MB/s | 20 kB 00:00 2025-05-07T19:42:56.2691490Z (71/107): perl-Pod-Simple-3.42-2.amzn2023.0.2.n 24 MB/s | 215 kB 00:00 2025-05-07T19:42:56.2752554Z (72/107): perl-Pod-Usage-2.01-2.amzn2023.0.2.no 3.9 MB/s | 41 kB 00:00 2025-05-07T19:42:56.2764050Z (73/107): perl-SelectSaver-1.02-477.amzn2023.0. 1.8 MB/s | 12 kB 00:00 2025-05-07T19:42:56.2788356Z (74/107): perl-Scalar-List-Utils-1.56-459.amzn2 7.6 MB/s | 71 kB 00:00 2025-05-07T19:42:56.2826408Z (75/107): perl-Storable-3.21-458.amzn2023.0.2.x 17 MB/s | 96 kB 00:00 2025-05-07T19:42:56.2847139Z (76/107): perl-Socket-2.032-1.amzn2023.0.2.x86_ 7.2 MB/s | 55 kB 00:00 2025-05-07T19:42:56.2863873Z (77/107): perl-Symbol-1.08-477.amzn2023.0.6.noa 2.0 MB/s | 15 kB 00:00 2025-05-07T19:42:56.2884099Z (78/107): perl-Term-ANSIColor-5.01-459.amzn2023 8.6 MB/s | 48 kB 00:00 2025-05-07T19:42:56.2901514Z (79/107): perl-Term-Cap-1.17-458.amzn2023.0.2.n 4.6 MB/s | 22 kB 00:00 2025-05-07T19:42:56.2923633Z (80/107): perl-TermReadKey-2.38-9.amzn2023.0.2. 6.7 MB/s | 36 kB 00:00 2025-05-07T19:42:56.2940426Z (81/107): perl-Text-ParseWords-3.30-458.amzn202 3.3 MB/s | 17 kB 00:00 2025-05-07T19:42:56.2961350Z (82/107): perl-Text-Tabs+Wrap-2021.0726-1.amzn2 4.2 MB/s | 22 kB 00:00 2025-05-07T19:42:56.2981394Z (83/107): perl-Time-Local-1.300-5.amzn2023.0.2. 6.1 MB/s | 34 kB 00:00 2025-05-07T19:42:56.3012777Z (84/107): perl-URI-5.09-1.amzn2023.0.2.noarch.r 16 MB/s | 108 kB 00:00 2025-05-07T19:42:56.3031222Z (85/107): perl-base-2.27-477.amzn2023.0.6.noarc 2.5 MB/s | 17 kB 00:00 2025-05-07T19:42:56.3054080Z (86/107): perl-constant-1.33-459.amzn2023.0.2.n 3.6 MB/s | 23 kB 00:00 2025-05-07T19:42:56.3069381Z (87/107): perl-if-0.60.800-477.amzn2023.0.6.noa 2.7 MB/s | 14 kB 00:00 2025-05-07T19:42:56.3104946Z (88/107): perl-interpreter-5.32.1-477.amzn2023. 10 MB/s | 71 kB 00:00 2025-05-07T19:42:56.3119156Z (89/107): perl-lib-0.65-477.amzn2023.0.6.x86_64 2.5 MB/s | 15 kB 00:00 2025-05-07T19:42:56.3142857Z (90/107): perl-libnet-3.13-2.amzn2023.0.2.noarc 18 MB/s | 126 kB 00:00 2025-05-07T19:42:56.3223273Z (91/107): perl-mro-1.23-477.amzn2023.0.6.x86_64 2.9 MB/s | 29 kB 00:00 2025-05-07T19:42:56.3322337Z (92/107): perl-libs-5.32.1-477.amzn2023.0.6.x86 104 MB/s | 2.0 MB 00:00 2025-05-07T19:42:56.3341904Z (93/107): perl-overload-1.31-477.amzn2023.0.6.n 2.3 MB/s | 46 kB 00:00 2025-05-07T19:42:56.3348935Z (94/107): perl-overloading-0.02-477.amzn2023.0. 1.0 MB/s | 13 kB 00:00 2025-05-07T19:42:56.3382552Z (95/107): perl-parent-0.238-458.amzn2023.0.2.no 2.4 MB/s | 14 kB 00:00 2025-05-07T19:42:56.3407124Z (96/107): perl-podlators-4.14-458.amzn2023.0.2. 21 MB/s | 112 kB 00:00 2025-05-07T19:42:56.3417487Z (97/107): perl-subs-1.03-477.amzn2023.0.6.noarc 1.8 MB/s | 12 kB 00:00 2025-05-07T19:42:56.3433879Z (98/107): perl-vars-1.05-477.amzn2023.0.6.noarc 2.9 MB/s | 13 kB 00:00 2025-05-07T19:42:56.3550959Z (99/107): shadow-utils-4.9-12.amzn2023.0.4.x86_ 78 MB/s | 1.1 MB 00:00 2025-05-07T19:42:56.3639796Z (100/107): sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 58 MB/s | 1.3 MB 00:00 2025-05-07T19:42:56.3656339Z (101/107): sudo-python-plugin-1.9.15-1.p5.amzn2 2.5 MB/s | 56 kB 00:00 2025-05-07T19:42:56.3713600Z (102/107): systemd-libs-252.23-3.amzn2023.x86_6 44 MB/s | 613 kB 00:00 2025-05-07T19:42:56.3784807Z (103/107): tar-1.34-1.amzn2023.0.4.x86_64.rpm 73 MB/s | 879 kB 00:00 2025-05-07T19:42:56.3952902Z (104/107): util-linux-2.37.4-1.amzn2023.0.4.x86 77 MB/s | 2.2 MB 00:00 2025-05-07T19:42:56.3984454Z (105/107): util-linux-core-2.37.4-1.amzn2023.0. 16 MB/s | 432 kB 00:00 2025-05-07T19:42:56.4046181Z (106/107): wget-1.21.3-1.amzn2023.0.4.x86_64.rp 33 MB/s | 779 kB 00:00 2025-05-07T19:42:56.4063604Z (107/107): which-2.21-26.amzn2023.0.2.x86_64.rp 6.4 MB/s | 42 kB 00:00 2025-05-07T19:42:56.4080594Z -------------------------------------------------------------------------------- 2025-05-07T19:42:56.4081343Z Total 51 MB/s | 38 MB 00:00 2025-05-07T19:42:57.4779473Z Running transaction check 2025-05-07T19:42:57.5238149Z Transaction check succeeded. 2025-05-07T19:42:57.5238478Z Running transaction test 2025-05-07T19:42:57.8920629Z Transaction test succeeded. 2025-05-07T19:42:57.8921104Z Running transaction 2025-05-07T19:42:58.9286789Z Preparing : 1/1 2025-05-07T19:42:58.9448992Z Installing : systemd-libs-252.23-3.amzn2023.x86_64 1/107 2025-05-07T19:42:58.9702911Z Installing : nettle-3.10.1-1.amzn2023.0.1.x86_64 2/107 2025-05-07T19:42:58.9924377Z Installing : gnutls-3.8.3-6.amzn2023.0.1.x86_64 3/107 2025-05-07T19:42:59.0006045Z Installing : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:59.0073920Z Running scriptlet: util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:59.0180144Z Installing : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:59.0478357Z Installing : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 6/107 2025-05-07T19:42:59.0566645Z Installing : nano-8.3-1.amzn2023.x86_64 7/107 2025-05-07T19:42:59.0627614Z Installing : nano-default-editor-8.3-1.amzn2023.noarch 8/107 2025-05-07T19:42:59.1148300Z Installing : libsemanage-3.4-5.amzn2023.0.2.x86_64 9/107 2025-05-07T19:42:59.1243687Z Installing : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 10/107 2025-05-07T19:42:59.1702414Z Running scriptlet: libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:59.1774714Z Installing : libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:59.1846538Z Installing : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 12/107 2025-05-07T19:42:59.1909749Z Installing : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 13/107 2025-05-07T19:42:59.1972839Z Installing : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 14/107 2025-05-07T19:42:59.2123194Z Installing : libeconf-0.4.0-1.amzn2023.0.3.x86_64 15/107 2025-05-07T19:42:59.2187477Z Installing : libdb-5.3.28-49.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:59.2260056Z Installing : libcbor-0.7.0-3.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:59.2338845Z Installing : libfido2-1.10.0-2.amzn2023.0.2.x86_64 18/107 2025-05-07T19:42:59.2407706Z Installing : less-608-2.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:59.2463909Z Installing : kmod-libs-29-2.amzn2023.0.5.x86_64 20/107 2025-05-07T19:42:59.2911050Z Installing : jansson-2.14-0.amzn2023.x86_64 21/107 2025-05-07T19:42:59.3009859Z Installing : hwdata-0.384-1.amzn2023.0.3.noarch 22/107 2025-05-07T19:42:59.3170656Z Installing : gzip-1.12-1.amzn2023.0.1.x86_64 23/107 2025-05-07T19:42:59.3622041Z Installing : cracklib-2.9.6-27.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:59.3815729Z Installing : pam-1.5.1-8.amzn2023.0.4.x86_64 25/107 2025-05-07T19:42:59.4650234Z Installing : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 26/107 2025-05-07T19:42:59.4650848Z Installing : util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:59.4651412Z warning: /etc/adjtime created as /etc/adjtime.rpmnew 2025-05-07T19:42:59.4651688Z 2025-05-07T19:42:59.4859948Z Running scriptlet: util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:59.5210389Z Running scriptlet: openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:59.5407697Z Installing : openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:59.5478069Z Installing : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:59.6605919Z Running scriptlet: openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:59.8132973Z Installing : git-core-2.47.1-1.amzn2023.0.2.x86_64 30/107 2025-05-07T19:42:59.8263814Z Installing : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 31/107 2025-05-07T19:42:59.8694252Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:59.8778284Z Installing : groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:59.8852253Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:59.8930507Z Installing : perl-Digest-1.20-1.amzn2023.0.2.noarch 33/107 2025-05-07T19:42:59.9022715Z Installing : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:59.9078356Z Installing : perl-B-1.80-477.amzn2023.0.6.x86_64 35/107 2025-05-07T19:42:59.9124218Z Installing : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:59.9177443Z Installing : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 37/107 2025-05-07T19:42:59.9269272Z Installing : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 38/107 2025-05-07T19:42:59.9342865Z Installing : perl-libnet-3.13-2.amzn2023.0.2.noarch 39/107 2025-05-07T19:42:59.9443526Z Installing : perl-base-2.27-477.amzn2023.0.6.noarch 40/107 2025-05-07T19:42:59.9665480Z Installing : perl-URI-5.09-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:59.9750973Z Installing : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 42/107 2025-05-07T19:42:59.9803295Z Installing : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 43/107 2025-05-07T19:42:59.9847611Z Installing : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 44/107 2025-05-07T19:42:59.9905167Z Installing : perl-if-0.60.800-477.amzn2023.0.6.noarch 45/107 2025-05-07T19:42:59.9972831Z Installing : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 46/107 2025-05-07T19:43:00.0031233Z Installing : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 47/107 2025-05-07T19:43:00.0128007Z Installing : perl-File-Path-2.18-2.amzn2023.0.2.noarch 48/107 2025-05-07T19:43:00.0201707Z Installing : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 49/107 2025-05-07T19:43:00.0248437Z Installing : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 50/107 2025-05-07T19:43:00.0308466Z Installing : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 51/107 2025-05-07T19:43:00.0368710Z Installing : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 52/107 2025-05-07T19:43:00.0429585Z Installing : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 53/107 2025-05-07T19:43:00.0475849Z Installing : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:43:00.0535320Z Installing : perl-subs-1.03-477.amzn2023.0.6.noarch 55/107 2025-05-07T19:43:00.0603289Z Installing : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 56/107 2025-05-07T19:43:00.0660877Z Installing : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 57/107 2025-05-07T19:43:00.0772707Z Installing : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 58/107 2025-05-07T19:43:00.0861142Z Installing : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 59/107 2025-05-07T19:43:00.0918970Z Installing : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 60/107 2025-05-07T19:43:00.0967486Z Installing : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 61/107 2025-05-07T19:43:00.1009480Z Installing : perl-Symbol-1.08-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:43:00.1082869Z Installing : perl-File-stat-1.09-477.amzn2023.0.6.noarch 63/107 2025-05-07T19:43:00.1182992Z Installing : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 64/107 2025-05-07T19:43:00.1261678Z Installing : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 65/107 2025-05-07T19:43:00.1319432Z Installing : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 66/107 2025-05-07T19:43:00.1379390Z Installing : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 67/107 2025-05-07T19:43:00.1460770Z Installing : perl-mro-1.23-477.amzn2023.0.6.x86_64 68/107 2025-05-07T19:43:00.1527167Z Installing : perl-IO-1.43-477.amzn2023.0.6.x86_64 69/107 2025-05-07T19:43:00.1587778Z Installing : perl-overloading-0.02-477.amzn2023.0.6.noarch 70/107 2025-05-07T19:43:00.1664489Z Installing : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:43:00.1718809Z Installing : perl-Errno-1.30-477.amzn2023.0.6.x86_64 72/107 2025-05-07T19:43:00.1771448Z Installing : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 73/107 2025-05-07T19:43:00.1835621Z Installing : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:43:00.1917784Z Installing : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 75/107 2025-05-07T19:43:00.1992102Z Installing : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 76/107 2025-05-07T19:43:00.2062369Z Installing : perl-constant-1.33-459.amzn2023.0.2.noarch 77/107 2025-05-07T19:43:00.2129736Z Installing : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 78/107 2025-05-07T19:43:00.2179549Z Installing : perl-overload-1.31-477.amzn2023.0.6.noarch 79/107 2025-05-07T19:43:00.2236901Z Installing : perl-parent-1:0.238-458.amzn2023.0.2.noarch 80/107 2025-05-07T19:43:00.2303375Z Installing : perl-vars-1.05-477.amzn2023.0.6.noarch 81/107 2025-05-07T19:43:00.2354035Z Installing : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 82/107 2025-05-07T19:43:00.2414049Z Installing : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 83/107 2025-05-07T19:43:00.2470090Z Installing : perl-Carp-1.50-458.amzn2023.0.2.noarch 84/107 2025-05-07T19:43:00.2528838Z Installing : perl-Exporter-5.74-459.amzn2023.0.2.noarch 85/107 2025-05-07T19:43:00.2610780Z Installing : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 86/107 2025-05-07T19:43:00.3144749Z Installing : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 87/107 2025-05-07T19:43:00.4121391Z Installing : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 88/107 2025-05-07T19:43:00.4252030Z Installing : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:43:00.4334649Z Installing : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 90/107 2025-05-07T19:43:00.4401600Z Installing : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 91/107 2025-05-07T19:43:00.4467162Z Installing : perl-File-Find-1.37-477.amzn2023.0.6.noarch 92/107 2025-05-07T19:43:00.4536427Z Installing : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 93/107 2025-05-07T19:43:00.4591975Z Installing : perl-lib-0.65-477.amzn2023.0.6.x86_64 94/107 2025-05-07T19:43:00.4657033Z Installing : perl-Git-2.47.1-1.amzn2023.0.2.noarch 95/107 2025-05-07T19:43:00.4734576Z Installing : git-2.47.1-1.amzn2023.0.2.x86_64 96/107 2025-05-07T19:43:00.4933849Z Installing : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 97/107 2025-05-07T19:43:00.5062275Z Installing : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 98/107 2025-05-07T19:43:00.5144455Z Installing : openldap-2.4.57-6.amzn2023.0.7.x86_64 99/107 2025-05-07T19:43:00.5546731Z Installing : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 100/107 2025-05-07T19:43:00.6774671Z Installing : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 101/107 2025-05-07T19:43:00.6867905Z Installing : binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:43:00.6979563Z Running scriptlet: binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:43:00.7281460Z Installing : pciutils-3.7.0-3.amzn2023.0.2.x86_64 103/107 2025-05-07T19:43:00.7378873Z Installing : wget-1.21.3-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:43:00.7624499Z Installing : which-2.21-26.amzn2023.0.2.x86_64 105/107 2025-05-07T19:43:00.7831640Z Installing : tar-2:1.34-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:43:00.7917943Z Installing : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:43:00.8029348Z Running scriptlet: pam-1.5.1-8.amzn2023.0.4.x86_64 107/107 2025-05-07T19:43:01.5705713Z Running scriptlet: findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:43:01.5707829Z Verifying : binutils-2.41-50.amzn2023.0.3.x86_64 1/107 2025-05-07T19:43:01.5709502Z Verifying : cracklib-2.9.6-27.amzn2023.0.2.x86_64 2/107 2025-05-07T19:43:01.5711707Z Verifying : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 3/107 2025-05-07T19:43:01.5713742Z Verifying : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 4/107 2025-05-07T19:43:01.5715569Z Verifying : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 5/107 2025-05-07T19:43:01.5716412Z Verifying : git-2.47.1-1.amzn2023.0.2.x86_64 6/107 2025-05-07T19:43:01.5716967Z Verifying : git-core-2.47.1-1.amzn2023.0.2.x86_64 7/107 2025-05-07T19:43:01.5717600Z Verifying : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 8/107 2025-05-07T19:43:01.5718559Z Verifying : gnutls-3.8.3-6.amzn2023.0.1.x86_64 9/107 2025-05-07T19:43:01.5719140Z Verifying : groff-base-1.22.4-7.amzn2023.0.2.x86_64 10/107 2025-05-07T19:43:01.5719775Z Verifying : gzip-1.12-1.amzn2023.0.1.x86_64 11/107 2025-05-07T19:43:01.5720367Z Verifying : hwdata-0.384-1.amzn2023.0.3.noarch 12/107 2025-05-07T19:43:01.5721003Z Verifying : jansson-2.14-0.amzn2023.x86_64 13/107 2025-05-07T19:43:01.5721664Z Verifying : kmod-libs-29-2.amzn2023.0.5.x86_64 14/107 2025-05-07T19:43:01.5722388Z Verifying : less-608-2.amzn2023.0.2.x86_64 15/107 2025-05-07T19:43:01.5723015Z Verifying : libcbor-0.7.0-3.amzn2023.0.2.x86_64 16/107 2025-05-07T19:43:01.5723556Z Verifying : libdb-5.3.28-49.amzn2023.0.2.x86_64 17/107 2025-05-07T19:43:01.5724243Z Verifying : libeconf-0.4.0-1.amzn2023.0.3.x86_64 18/107 2025-05-07T19:43:01.5724876Z Verifying : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 19/107 2025-05-07T19:43:01.5725428Z Verifying : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 20/107 2025-05-07T19:43:01.5726109Z Verifying : libfido2-1.10.0-2.amzn2023.0.2.x86_64 21/107 2025-05-07T19:43:01.5726688Z Verifying : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 22/107 2025-05-07T19:43:01.5727390Z Verifying : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 23/107 2025-05-07T19:43:01.5728057Z Verifying : libsemanage-3.4-5.amzn2023.0.2.x86_64 24/107 2025-05-07T19:43:01.5728642Z Verifying : libutempter-1.2.1-4.amzn2023.0.2.x86_64 25/107 2025-05-07T19:43:01.5729276Z Verifying : nano-8.3-1.amzn2023.x86_64 26/107 2025-05-07T19:43:01.5730012Z Verifying : nano-default-editor-8.3-1.amzn2023.noarch 27/107 2025-05-07T19:43:01.5730678Z Verifying : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 28/107 2025-05-07T19:43:01.5731355Z Verifying : nettle-3.10.1-1.amzn2023.0.1.x86_64 29/107 2025-05-07T19:43:01.5731949Z Verifying : openldap-2.4.57-6.amzn2023.0.7.x86_64 30/107 2025-05-07T19:43:01.5732568Z Verifying : openssh-8.7p1-8.amzn2023.0.14.x86_64 31/107 2025-05-07T19:43:01.5733151Z Verifying : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 32/107 2025-05-07T19:43:01.5733860Z Verifying : pam-1.5.1-8.amzn2023.0.4.x86_64 33/107 2025-05-07T19:43:01.5734423Z Verifying : pciutils-3.7.0-3.amzn2023.0.2.x86_64 34/107 2025-05-07T19:43:01.5735048Z Verifying : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 35/107 2025-05-07T19:43:01.5735740Z Verifying : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:43:01.5736472Z Verifying : perl-B-1.80-477.amzn2023.0.6.x86_64 37/107 2025-05-07T19:43:01.5737062Z Verifying : perl-Carp-1.50-458.amzn2023.0.2.noarch 38/107 2025-05-07T19:43:01.5737815Z Verifying : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 39/107 2025-05-07T19:43:01.5738479Z Verifying : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 40/107 2025-05-07T19:43:01.5739099Z Verifying : perl-Digest-1.20-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:43:01.5739752Z Verifying : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 42/107 2025-05-07T19:43:01.5740786Z Verifying : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 43/107 2025-05-07T19:43:01.5741430Z Verifying : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 44/107 2025-05-07T19:43:01.5742138Z Verifying : perl-Errno-1.30-477.amzn2023.0.6.x86_64 45/107 2025-05-07T19:43:01.5742883Z Verifying : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 46/107 2025-05-07T19:43:01.5743505Z Verifying : perl-Exporter-5.74-459.amzn2023.0.2.noarch 47/107 2025-05-07T19:43:01.5744079Z Verifying : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 48/107 2025-05-07T19:43:01.5744697Z Verifying : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 49/107 2025-05-07T19:43:01.5745347Z Verifying : perl-File-Find-1.37-477.amzn2023.0.6.noarch 50/107 2025-05-07T19:43:01.5745918Z Verifying : perl-File-Path-2.18-2.amzn2023.0.2.noarch 51/107 2025-05-07T19:43:01.5746512Z Verifying : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 52/107 2025-05-07T19:43:01.5747184Z Verifying : perl-File-stat-1.09-477.amzn2023.0.6.noarch 53/107 2025-05-07T19:43:01.5747776Z Verifying : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:43:01.5748369Z Verifying : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 55/107 2025-05-07T19:43:01.5748934Z Verifying : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 56/107 2025-05-07T19:43:01.5749515Z Verifying : perl-Git-2.47.1-1.amzn2023.0.2.noarch 57/107 2025-05-07T19:43:01.5750064Z Verifying : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 58/107 2025-05-07T19:43:01.5750787Z Verifying : perl-IO-1.43-477.amzn2023.0.6.x86_64 59/107 2025-05-07T19:43:01.5751305Z Verifying : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 60/107 2025-05-07T19:43:01.5751875Z Verifying : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 61/107 2025-05-07T19:43:01.5752441Z Verifying : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:43:01.5752963Z Verifying : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 63/107 2025-05-07T19:43:01.5753514Z Verifying : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 64/107 2025-05-07T19:43:01.5754031Z Verifying : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 65/107 2025-05-07T19:43:01.5754572Z Verifying : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 66/107 2025-05-07T19:43:01.5755081Z Verifying : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 67/107 2025-05-07T19:43:01.5755616Z Verifying : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 68/107 2025-05-07T19:43:01.5756170Z Verifying : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 69/107 2025-05-07T19:43:01.5756695Z Verifying : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 70/107 2025-05-07T19:43:01.5757239Z Verifying : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:43:01.5757743Z Verifying : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 72/107 2025-05-07T19:43:01.5758277Z Verifying : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 73/107 2025-05-07T19:43:01.5758915Z Verifying : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:43:01.5759430Z Verifying : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 75/107 2025-05-07T19:43:01.5759955Z Verifying : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 76/107 2025-05-07T19:43:01.5760462Z Verifying : perl-Symbol-1.08-477.amzn2023.0.6.noarch 77/107 2025-05-07T19:43:01.5761017Z Verifying : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 78/107 2025-05-07T19:43:01.5761540Z Verifying : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 79/107 2025-05-07T19:43:01.5762093Z Verifying : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 80/107 2025-05-07T19:43:01.5762654Z Verifying : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 81/107 2025-05-07T19:43:01.5763190Z Verifying : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 82/107 2025-05-07T19:43:01.5763737Z Verifying : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 83/107 2025-05-07T19:43:01.5764309Z Verifying : perl-URI-5.09-1.amzn2023.0.2.noarch 84/107 2025-05-07T19:43:01.5764853Z Verifying : perl-base-2.27-477.amzn2023.0.6.noarch 85/107 2025-05-07T19:43:01.5765369Z Verifying : perl-constant-1.33-459.amzn2023.0.2.noarch 86/107 2025-05-07T19:43:01.5765911Z Verifying : perl-if-0.60.800-477.amzn2023.0.6.noarch 87/107 2025-05-07T19:43:01.5766452Z Verifying : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 88/107 2025-05-07T19:43:01.5766962Z Verifying : perl-lib-0.65-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:43:01.5767487Z Verifying : perl-libnet-3.13-2.amzn2023.0.2.noarch 90/107 2025-05-07T19:43:01.5767988Z Verifying : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 91/107 2025-05-07T19:43:01.5768502Z Verifying : perl-mro-1.23-477.amzn2023.0.6.x86_64 92/107 2025-05-07T19:43:01.5769043Z Verifying : perl-overload-1.31-477.amzn2023.0.6.noarch 93/107 2025-05-07T19:43:01.5769577Z Verifying : perl-overloading-0.02-477.amzn2023.0.6.noarch 94/107 2025-05-07T19:43:01.5770120Z Verifying : perl-parent-1:0.238-458.amzn2023.0.2.noarch 95/107 2025-05-07T19:43:01.5770629Z Verifying : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 96/107 2025-05-07T19:43:01.5771169Z Verifying : perl-subs-1.03-477.amzn2023.0.6.noarch 97/107 2025-05-07T19:43:01.5771673Z Verifying : perl-vars-1.05-477.amzn2023.0.6.noarch 98/107 2025-05-07T19:43:01.5772211Z Verifying : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 99/107 2025-05-07T19:43:01.5772733Z Verifying : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 100/107 2025-05-07T19:43:01.5773242Z Verifying : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 101/107 2025-05-07T19:43:01.5773799Z Verifying : systemd-libs-252.23-3.amzn2023.x86_64 102/107 2025-05-07T19:43:01.5774290Z Verifying : tar-2:1.34-1.amzn2023.0.4.x86_64 103/107 2025-05-07T19:43:01.5774801Z Verifying : util-linux-2.37.4-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:43:01.5775339Z Verifying : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 105/107 2025-05-07T19:43:01.5775836Z Verifying : wget-1.21.3-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:43:01.6816251Z Verifying : which-2.21-26.amzn2023.0.2.x86_64 107/107 2025-05-07T19:43:01.6817382Z 2025-05-07T19:43:01.6817630Z Installed: 2025-05-07T19:43:01.6818594Z binutils-2.41-50.amzn2023.0.3.x86_64 2025-05-07T19:43:01.6820396Z cracklib-2.9.6-27.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6822004Z cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 2025-05-07T19:43:01.6824248Z elfutils-debuginfod-client-0.188-3.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6825936Z findutils-1:4.8.0-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6827267Z git-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6827768Z git-core-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6828273Z git-core-doc-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.6828805Z gnutls-3.8.3-6.amzn2023.0.1.x86_64 2025-05-07T19:43:01.6829322Z groff-base-1.22.4-7.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6829809Z gzip-1.12-1.amzn2023.0.1.x86_64 2025-05-07T19:43:01.6830324Z hwdata-0.384-1.amzn2023.0.3.noarch 2025-05-07T19:43:01.6830927Z jansson-2.14-0.amzn2023.x86_64 2025-05-07T19:43:01.6831457Z kmod-libs-29-2.amzn2023.0.5.x86_64 2025-05-07T19:43:01.6831983Z less-608-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6832471Z libcbor-0.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6832998Z libdb-5.3.28-49.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6833498Z libeconf-0.4.0-1.amzn2023.0.3.x86_64 2025-05-07T19:43:01.6834041Z libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6834556Z libfdisk-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.6835080Z libfido2-1.10.0-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6835619Z libmetalink-0.1.3-14.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6836151Z libpwquality-1.4.4-6.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6836698Z libsemanage-3.4-5.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6837220Z libutempter-1.2.1-4.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6837736Z nano-8.3-1.amzn2023.x86_64 2025-05-07T19:43:01.6838229Z nano-default-editor-8.3-1.amzn2023.noarch 2025-05-07T19:43:01.6838772Z ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 2025-05-07T19:43:01.6839278Z nettle-3.10.1-1.amzn2023.0.1.x86_64 2025-05-07T19:43:01.6839758Z openldap-2.4.57-6.amzn2023.0.7.x86_64 2025-05-07T19:43:01.6840266Z openssh-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:43:01.6840780Z openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:43:01.6841292Z pam-1.5.1-8.amzn2023.0.4.x86_64 2025-05-07T19:43:01.6841784Z pciutils-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6842309Z pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6842894Z perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.6843526Z perl-B-1.80-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.6844086Z perl-Carp-1.50-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.6844625Z perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.6845198Z perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6845847Z perl-Digest-1.20-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.6846374Z perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6846926Z perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.6847432Z perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6847944Z perl-Errno-1.30-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.6848463Z perl-Error-1:0.17029-5.amzn2023.0.2.noarch 2025-05-07T19:43:01.6848976Z perl-Exporter-5.74-459.amzn2023.0.2.noarch 2025-05-07T19:43:01.6849509Z perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.6850035Z perl-File-Basename-2.85-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.6850688Z perl-File-Find-1.37-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.6851220Z perl-File-Path-2.18-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.6851767Z perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.6852325Z perl-File-stat-1.09-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.6852866Z perl-FileHandle-2.03-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.6853442Z perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.6853984Z perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.6854538Z perl-Git-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.6855066Z perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 2025-05-07T19:43:01.6855618Z perl-IO-1.43-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.6856172Z perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 2025-05-07T19:43:01.6856720Z perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.6857287Z perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.6857812Z perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6858377Z perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 2025-05-07T19:43:01.6858920Z perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.6859427Z perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 2025-05-07T19:43:01.6859960Z perl-POSIX-1.94-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.6860805Z perl-PathTools-3.78-459.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6861414Z perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.6861987Z perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 2025-05-07T19:43:01.6862574Z perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.6863139Z perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.6863691Z perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6864284Z perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.6864843Z perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6865400Z perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6865969Z perl-Symbol-1.08-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.6866867Z perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 2025-05-07T19:43:01.6867424Z perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.6867947Z perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6868510Z perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.6869066Z perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noarch 2025-05-07T19:43:01.6869620Z perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 2025-05-07T19:43:01.6870140Z perl-URI-5.09-1.amzn2023.0.2.noarch 2025-05-07T19:43:01.6870636Z perl-base-2.27-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.6871166Z perl-constant-1.33-459.amzn2023.0.2.noarch 2025-05-07T19:43:01.6871673Z perl-if-0.60.800-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.6872387Z perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.6872887Z perl-lib-0.65-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.6873402Z perl-libnet-3.13-2.amzn2023.0.2.noarch 2025-05-07T19:43:01.6873914Z perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.6874394Z perl-mro-1.23-477.amzn2023.0.6.x86_64 2025-05-07T19:43:01.6874915Z perl-overload-1.31-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.6875445Z perl-overloading-0.02-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.6876359Z perl-parent-1:0.238-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.6876929Z perl-podlators-1:4.14-458.amzn2023.0.2.noarch 2025-05-07T19:43:01.6877486Z perl-subs-1.03-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.6878049Z perl-vars-1.05-477.amzn2023.0.6.noarch 2025-05-07T19:43:01.6878586Z shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 2025-05-07T19:43:01.6879129Z sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:43:01.6879672Z sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:43:01.6880255Z systemd-libs-252.23-3.amzn2023.x86_64 2025-05-07T19:43:01.6880785Z tar-2:1.34-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.6881289Z util-linux-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.6881843Z util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.6882501Z wget-1.21.3-1.amzn2023.0.4.x86_64 2025-05-07T19:43:01.6882971Z which-2.21-26.amzn2023.0.2.x86_64 2025-05-07T19:43:01.6883255Z 2025-05-07T19:43:01.6883356Z Complete! 2025-05-07T19:43:01.7558204Z ##[group]Run actions/checkout@v4 2025-05-07T19:43:01.7558564Z with: 2025-05-07T19:43:01.7558759Z submodules: true 2025-05-07T19:43:01.7558995Z repository: pytorch/FBGEMM 2025-05-07T19:43:01.7559438Z token: *** 2025-05-07T19:43:01.7559634Z ssh-strict: true 2025-05-07T19:43:01.7559854Z ssh-user: git 2025-05-07T19:43:01.7560066Z persist-credentials: true 2025-05-07T19:43:01.7560318Z clean: true 2025-05-07T19:43:01.7560529Z sparse-checkout-cone-mode: true 2025-05-07T19:43:01.7560799Z fetch-depth: 1 2025-05-07T19:43:01.7560995Z fetch-tags: false 2025-05-07T19:43:01.7561217Z show-progress: true 2025-05-07T19:43:01.7561424Z lfs: false 2025-05-07T19:43:01.7561632Z set-safe-directory: true 2025-05-07T19:43:01.7562077Z env: 2025-05-07T19:43:01.7562290Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:01.7562586Z BUILD_ENV: build_binary 2025-05-07T19:43:01.7562812Z BUILD_TARGET: default 2025-05-07T19:43:01.7563087Z BUILD_VARIANT: cuda 2025-05-07T19:43:01.7563351Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:01.7563600Z ##[endgroup] 2025-05-07T19:43:01.7603698Z ##[command]/usr/bin/docker exec bd0f6f4466627651321f9536b804a3259c307db0438d1e394995066ae59c3be1 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T19:43:02.0754354Z Syncing repository: pytorch/FBGEMM 2025-05-07T19:43:02.0755828Z ##[group]Getting Git version info 2025-05-07T19:43:02.0756212Z Working directory is '/__w/FBGEMM/FBGEMM' 2025-05-07T19:43:02.0756784Z [command]/usr/bin/git version 2025-05-07T19:43:02.0757072Z git version 2.47.1 2025-05-07T19:43:02.0758071Z ##[endgroup] 2025-05-07T19:43:02.0762060Z Temporarily overriding HOME='/__w/_temp/b92aba3f-09b5-43e1-8f2e-822c7289064d' before making global git config changes 2025-05-07T19:43:02.0762881Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T19:43:02.0763564Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T19:43:02.0792070Z [command]/usr/bin/git config --local --get remote.origin.url 2025-05-07T19:43:02.0812536Z https://github.com/pytorch/FBGEMM 2025-05-07T19:43:02.0824454Z ##[group]Removing previously created refs, to avoid conflicts 2025-05-07T19:43:02.0827082Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-05-07T19:43:02.0847672Z HEAD 2025-05-07T19:43:02.0881395Z ##[endgroup] 2025-05-07T19:43:02.0882123Z [command]/usr/bin/git submodule status 2025-05-07T19:43:02.1210788Z e5d7c0bd5d9aec44d68830187138149e6a8c4e32 external/asmjit (e5d7c0b) 2025-05-07T19:43:02.1283272Z 4a61bdd4bd4ed730e078aebc7c0fcf046ff29406 external/composable_kernel (4a61bdd) 2025-05-07T19:43:02.1353675Z 6543fec09b2f04ac4a666882998b534afc9c1349 external/cpuinfo (6543fec) 2025-05-07T19:43:02.1423268Z 3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3 external/cutlass (3ed8d2e) 2025-05-07T19:43:02.1487809Z f8d7d77c06936315286eb55f8de22cd23c188571 external/googletest (f8d7d77) 2025-05-07T19:43:02.1546178Z 420084499c7c1e1c2d801922f40df202eac5f3a0 external/hipify_torch (4200844) 2025-05-07T19:43:02.1605020Z 9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03 external/json (9cca280) 2025-05-07T19:43:02.1611690Z ##[group]Cleaning the repository 2025-05-07T19:43:02.1612631Z [command]/usr/bin/git clean -ffdx 2025-05-07T19:43:02.4691212Z Removing build_only/ 2025-05-07T19:43:02.4692038Z Removing collect_env.py 2025-05-07T19:43:02.4692820Z Removing fbgemm_gpu/_skbuild/ 2025-05-07T19:43:02.4693747Z Removing fbgemm_gpu/codegen/genscript/__pycache__/ 2025-05-07T19:43:02.4694801Z Removing fbgemm_gpu/dist/ 2025-05-07T19:43:02.4695320Z Removing fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:43:02.4695721Z Removing fbgemm_gpu/fbgemm_gpu_nightly.egg-info/ 2025-05-07T19:43:02.4696827Z [command]/usr/bin/git reset --hard HEAD 2025-05-07T19:43:02.5765776Z HEAD is now at 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:02.5767029Z ##[endgroup] 2025-05-07T19:43:02.5768794Z ##[group]Disabling automatic garbage collection 2025-05-07T19:43:02.5774878Z [command]/usr/bin/git config --local gc.auto 0 2025-05-07T19:43:02.5804369Z ##[endgroup] 2025-05-07T19:43:02.5804824Z ##[group]Setting up auth 2025-05-07T19:43:02.5805828Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T19:43:02.5829294Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T19:43:02.6100400Z Entering 'external/asmjit' 2025-05-07T19:43:02.6143937Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.6201445Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.6266847Z Entering 'external/cutlass' 2025-05-07T19:43:02.6329427Z Entering 'external/googletest' 2025-05-07T19:43:02.6378258Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.6428103Z Entering 'external/json' 2025-05-07T19:43:02.6503350Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T19:43:02.6527767Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T19:43:02.6804142Z Entering 'external/asmjit' 2025-05-07T19:43:02.6850419Z Entering 'external/composable_kernel' 2025-05-07T19:43:02.6902303Z Entering 'external/cpuinfo' 2025-05-07T19:43:02.6968164Z Entering 'external/cutlass' 2025-05-07T19:43:02.7043490Z Entering 'external/googletest' 2025-05-07T19:43:02.7113299Z Entering 'external/hipify_torch' 2025-05-07T19:43:02.7173136Z Entering 'external/json' 2025-05-07T19:43:02.7239307Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:43:02.7281452Z ##[endgroup] 2025-05-07T19:43:02.7281875Z ##[group]Fetching the repository 2025-05-07T19:43:02.7292619Z [command]/usr/bin/git -c protocol.version=2 fetch --no-tags --prune --no-recurse-submodules --depth=1 origin +a2f4c52051596e74bc8c16e3d2867a4ecdd271e0:refs/remotes/pull/4066/merge 2025-05-07T19:43:02.8974201Z From https://github.com/pytorch/FBGEMM 2025-05-07T19:43:02.8975327Z + 1c9ad64...a2f4c52 a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 -> pull/4066/merge (forced update) 2025-05-07T19:43:02.8987588Z ##[endgroup] 2025-05-07T19:43:02.8987990Z ##[group]Determining the checkout info 2025-05-07T19:43:02.9017914Z ##[endgroup] 2025-05-07T19:43:02.9018820Z [command]/usr/bin/git sparse-checkout disable 2025-05-07T19:43:02.9496058Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-05-07T19:43:02.9521412Z ##[group]Checking out the ref 2025-05-07T19:43:02.9521891Z [command]/usr/bin/git checkout --progress --force refs/remotes/pull/4066/merge 2025-05-07T19:43:03.0496841Z Warning: you are leaving 1 commit behind, not connected to 2025-05-07T19:43:03.0497295Z any of your branches: 2025-05-07T19:43:03.0497455Z 2025-05-07T19:43:03.0497841Z 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:03.0498322Z 2025-05-07T19:43:03.0498533Z If you want to keep it by creating a new branch, this may be a good time 2025-05-07T19:43:03.0498942Z to do so with: 2025-05-07T19:43:03.0499072Z 2025-05-07T19:43:03.0499199Z git branch 1c9ad64 2025-05-07T19:43:03.0499420Z 2025-05-07T19:43:03.0499818Z HEAD is now at a2f4c52 Merge 6060cd4b5f971680caecdcc657faccb5720d1c3e into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:43:03.0501154Z ##[endgroup] 2025-05-07T19:43:03.0501633Z ##[group]Setting up auth for fetching submodules 2025-05-07T19:43:03.0504987Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:43:03.0542475Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-05-07T19:43:03.0563396Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-05-07T19:43:03.0585726Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-05-07T19:43:03.0608618Z ##[endgroup] 2025-05-07T19:43:03.0609054Z ##[group]Fetching submodules 2025-05-07T19:43:03.0609353Z [command]/usr/bin/git submodule sync 2025-05-07T19:43:03.0934562Z Synchronizing submodule url for 'external/asmjit' 2025-05-07T19:43:03.0935569Z Synchronizing submodule url for 'external/composable_kernel' 2025-05-07T19:43:03.0936047Z Synchronizing submodule url for 'external/cpuinfo' 2025-05-07T19:43:03.0936572Z Synchronizing submodule url for 'external/cutlass' 2025-05-07T19:43:03.0936983Z Synchronizing submodule url for 'external/googletest' 2025-05-07T19:43:03.0937397Z Synchronizing submodule url for 'external/hipify_torch' 2025-05-07T19:43:03.0938045Z Synchronizing submodule url for 'external/json' 2025-05-07T19:43:03.0946005Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --depth=1 2025-05-07T19:43:03.1716161Z Submodule path 'external/asmjit': checked out 'e5d7c0bd5d9aec44d68830187138149e6a8c4e32' 2025-05-07T19:43:03.4432481Z Submodule path 'external/composable_kernel': checked out '4a61bdd4bd4ed730e078aebc7c0fcf046ff29406' 2025-05-07T19:43:03.5363274Z Submodule path 'external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-05-07T19:43:04.2042223Z Submodule path 'external/cutlass': checked out '3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3' 2025-05-07T19:43:04.2423404Z Submodule path 'external/googletest': checked out 'f8d7d77c06936315286eb55f8de22cd23c188571' 2025-05-07T19:43:04.2503344Z Submodule path 'external/hipify_torch': checked out '420084499c7c1e1c2d801922f40df202eac5f3a0' 2025-05-07T19:43:04.3578909Z Submodule path 'external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-05-07T19:43:04.3586460Z [command]/usr/bin/git submodule foreach git config --local gc.auto 0 2025-05-07T19:43:04.3901572Z Entering 'external/asmjit' 2025-05-07T19:43:04.3926601Z Entering 'external/composable_kernel' 2025-05-07T19:43:04.3954388Z Entering 'external/cpuinfo' 2025-05-07T19:43:04.3984215Z Entering 'external/cutlass' 2025-05-07T19:43:04.4011369Z Entering 'external/googletest' 2025-05-07T19:43:04.4041265Z Entering 'external/hipify_torch' 2025-05-07T19:43:04.4072344Z Entering 'external/json' 2025-05-07T19:43:04.4117745Z ##[endgroup] 2025-05-07T19:43:04.4118174Z ##[group]Persisting credentials for submodules 2025-05-07T19:43:04.4121085Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-05-07T19:43:04.4413641Z Entering 'external/asmjit' 2025-05-07T19:43:04.4451868Z url.https://github.com/.insteadof 2025-05-07T19:43:04.4452924Z url.https://github.com/.insteadof 2025-05-07T19:43:04.4481728Z Entering 'external/composable_kernel' 2025-05-07T19:43:04.4524808Z url.https://github.com/.insteadof 2025-05-07T19:43:04.4525787Z url.https://github.com/.insteadof 2025-05-07T19:43:04.4568301Z Entering 'external/cpuinfo' 2025-05-07T19:43:04.4603013Z url.https://github.com/.insteadof 2025-05-07T19:43:04.4603573Z url.https://github.com/.insteadof 2025-05-07T19:43:04.4642456Z Entering 'external/cutlass' 2025-05-07T19:43:04.4682367Z url.https://github.com/.insteadof 2025-05-07T19:43:04.4683354Z url.https://github.com/.insteadof 2025-05-07T19:43:04.4726265Z Entering 'external/googletest' 2025-05-07T19:43:04.4761316Z url.https://github.com/.insteadof 2025-05-07T19:43:04.4761779Z url.https://github.com/.insteadof 2025-05-07T19:43:04.4799289Z Entering 'external/hipify_torch' 2025-05-07T19:43:04.4834400Z url.https://github.com/.insteadof 2025-05-07T19:43:04.4834839Z url.https://github.com/.insteadof 2025-05-07T19:43:04.4869292Z Entering 'external/json' 2025-05-07T19:43:04.4911134Z url.https://github.com/.insteadof 2025-05-07T19:43:04.4912157Z url.https://github.com/.insteadof 2025-05-07T19:43:04.4967359Z [command]/usr/bin/git submodule foreach sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-05-07T19:43:04.5245391Z Entering 'external/asmjit' 2025-05-07T19:43:04.5289527Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/asmjit/config remote.origin.url 2025-05-07T19:43:04.5290972Z Entering 'external/composable_kernel' 2025-05-07T19:43:04.5336680Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/composable_kernel/config remote.origin.url 2025-05-07T19:43:04.5338234Z Entering 'external/cpuinfo' 2025-05-07T19:43:04.5382698Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cpuinfo/config remote.origin.url 2025-05-07T19:43:04.5383221Z Entering 'external/cutlass' 2025-05-07T19:43:04.5428316Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cutlass/config remote.origin.url 2025-05-07T19:43:04.5430189Z Entering 'external/googletest' 2025-05-07T19:43:04.5477570Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/googletest/config remote.origin.url 2025-05-07T19:43:04.5479102Z Entering 'external/hipify_torch' 2025-05-07T19:43:04.5527525Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/hipify_torch/config remote.origin.url 2025-05-07T19:43:04.5528987Z Entering 'external/json' 2025-05-07T19:43:04.5572658Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/json/config remote.origin.url 2025-05-07T19:43:04.5675637Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-05-07T19:43:04.5940566Z Entering 'external/asmjit' 2025-05-07T19:43:04.5963510Z Entering 'external/composable_kernel' 2025-05-07T19:43:04.5984842Z Entering 'external/cpuinfo' 2025-05-07T19:43:04.6009969Z Entering 'external/cutlass' 2025-05-07T19:43:04.6034971Z Entering 'external/googletest' 2025-05-07T19:43:04.6058380Z Entering 'external/hipify_torch' 2025-05-07T19:43:04.6091387Z Entering 'external/json' 2025-05-07T19:43:04.6126792Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-05-07T19:43:04.6404854Z Entering 'external/asmjit' 2025-05-07T19:43:04.6429605Z Entering 'external/composable_kernel' 2025-05-07T19:43:04.6461980Z Entering 'external/cpuinfo' 2025-05-07T19:43:04.6485301Z Entering 'external/cutlass' 2025-05-07T19:43:04.6511676Z Entering 'external/googletest' 2025-05-07T19:43:04.6545929Z Entering 'external/hipify_torch' 2025-05-07T19:43:04.6577186Z Entering 'external/json' 2025-05-07T19:43:04.6621504Z ##[endgroup] 2025-05-07T19:43:04.6659885Z [command]/usr/bin/git log -1 --format=%H 2025-05-07T19:43:04.6682209Z a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:04.6844272Z ##[group]Run . $PRELUDE; print_system_info 2025-05-07T19:43:04.6844648Z . $PRELUDE; print_system_info 2025-05-07T19:43:04.6845177Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:04.6845504Z env: 2025-05-07T19:43:04.6845728Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:04.6846013Z BUILD_ENV: build_binary 2025-05-07T19:43:04.6846260Z BUILD_TARGET: default 2025-05-07T19:43:04.6846476Z BUILD_VARIANT: cuda 2025-05-07T19:43:04.6846711Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:04.6846946Z ##[endgroup] 2025-05-07T19:43:05.1186776Z ################################################################################ 2025-05-07T19:43:05.1187809Z # Print System Info 2025-05-07T19:43:05.1188131Z # 2025-05-07T19:43:05.1204050Z # [2025-05-07T19:43:05.119Z] + print_system_info 2025-05-07T19:43:05.1205139Z ################################################################################ 2025-05-07T19:43:05.1205847Z 2025-05-07T19:43:05.1206315Z ################################################################################ 2025-05-07T19:43:05.1206953Z [INFO] Printing environment variables ... 2025-05-07T19:43:05.1207296Z + printenv 2025-05-07T19:43:05.1207415Z 2025-05-07T19:43:05.1212812Z GITHUB_WORKSPACE=/__w/FBGEMM/FBGEMM 2025-05-07T19:43:05.1213205Z BUILD_VARIANT=cuda 2025-05-07T19:43:05.1213507Z HOSTNAME=bd0f6f446662 2025-05-07T19:43:05.1213924Z GITHUB_PATH=/__w/_temp/_runner_file_commands/add_path_1f1553a6-b15e-4f1f-8005-fe37e20170c4 2025-05-07T19:43:05.1214584Z GITHUB_ACTION=__run_2 2025-05-07T19:43:05.1214901Z GITHUB_RUN_NUMBER=10601 2025-05-07T19:43:05.1215174Z RUNNER_NAME=i-07585e80669af62a2 2025-05-07T19:43:05.1215474Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-05-07T19:43:05.1215787Z PLATFORM_NAME_LC=linux-x86_64 2025-05-07T19:43:05.1216080Z MACHINE_NAME_LC=x86_64 2025-05-07T19:43:05.1216330Z GITHUB_TRIGGERING_ACTOR=q10 2025-05-07T19:43:05.1216627Z PRELUDE=.github/scripts/setup_env.bash 2025-05-07T19:43:05.1216929Z GITHUB_REF_TYPE=branch 2025-05-07T19:43:05.1217426Z *** 2025-05-07T19:43:05.1217634Z GITHUB_REPOSITORY_ID=150154628 2025-05-07T19:43:05.1217919Z GITHUB_ACTIONS=true 2025-05-07T19:43:05.1218490Z GITHUB_SHA=a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:43:05.1219063Z GITHUB_WORKFLOW_REF=pytorch/FBGEMM/.github/workflows/fbgemm_gpu_ci_cuda.yml@refs/pull/4066/merge 2025-05-07T19:43:05.1219630Z RUNNER_ENVIRONMENT=self-hosted 2025-05-07T19:43:05.1219918Z GITHUB_REF=refs/pull/4066/merge 2025-05-07T19:43:05.1220355Z RUNNER_OS=Linux 2025-05-07T19:43:05.1220591Z GITHUB_REF_PROTECTED=false 2025-05-07T19:43:05.1220873Z HOME=/github/home 2025-05-07T19:43:05.1221196Z GITHUB_API_URL=https://api.github.com 2025-05-07T19:43:05.1221516Z RUNNER_ARCH=X64 2025-05-07T19:43:05.1221761Z RUNNER_TEMP=/__w/_temp 2025-05-07T19:43:05.1221996Z BUILD_TARGET=default 2025-05-07T19:43:05.1222436Z GITHUB_STATE=/__w/_temp/_runner_file_commands/save_state_1f1553a6-b15e-4f1f-8005-fe37e20170c4 2025-05-07T19:43:05.1223088Z GITHUB_ENV=/__w/_temp/_runner_file_commands/set_env_1f1553a6-b15e-4f1f-8005-fe37e20170c4 2025-05-07T19:43:05.1223603Z GITHUB_EVENT_PATH=/github/workflow/event.json 2025-05-07T19:43:05.1223937Z GITHUB_EVENT_NAME=pull_request 2025-05-07T19:43:05.1224234Z GITHUB_RUN_ID=14891846252 2025-05-07T19:43:05.1224705Z GITHUB_STEP_SUMMARY=/__w/_temp/_runner_file_commands/step_summary_1f1553a6-b15e-4f1f-8005-fe37e20170c4 2025-05-07T19:43:05.1225241Z BUILD_ENV=build_binary 2025-05-07T19:43:05.1225493Z GITHUB_ACTOR=q10 2025-05-07T19:43:05.1225716Z GITHUB_RUN_ATTEMPT=1 2025-05-07T19:43:05.1225965Z KERN_NAME_LC=linux 2025-05-07T19:43:05.1226194Z BUILD_CUDA_VERSION=12.6.3 2025-05-07T19:43:05.1226517Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-05-07T19:43:05.1226869Z PLATFORM_NAME=Linux-x86_64 2025-05-07T19:43:05.1227167Z GITHUB_SERVER_URL=https://github.com 2025-05-07T19:43:05.1227451Z SHLVL=1 2025-05-07T19:43:05.1227671Z GITHUB_ACTOR_ID=255046 2025-05-07T19:43:05.1227920Z RUNNER_TOOL_CACHE=/__w/_tool 2025-05-07T19:43:05.1232431Z GITHUB_WORKFLOW_SHA=6060cd4b5f971680caecdcc657faccb5720d1c3e 2025-05-07T19:43:05.1232935Z GITHUB_REF_NAME=4066/merge 2025-05-07T19:43:05.1233185Z KERN_NAME=Linux 2025-05-07T19:43:05.1233431Z GITHUB_JOB=build_artifact 2025-05-07T19:43:05.1233713Z GITHUB_REPOSITORY=pytorch/FBGEMM 2025-05-07T19:43:05.1234017Z GITHUB_RETENTION_DAYS=90 2025-05-07T19:43:05.1234273Z RUNNER_WORKSPACE=/__w/FBGEMM 2025-05-07T19:43:05.1234558Z GITHUB_ACTION_REPOSITORY= 2025-05-07T19:43:05.1234915Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:43:05.1235317Z GITHUB_BASE_REF=main 2025-05-07T19:43:05.1235543Z CI=true 2025-05-07T19:43:05.1235770Z GITHUB_REPOSITORY_OWNER=pytorch 2025-05-07T19:43:05.1236062Z GITHUB_HEAD_REF=bm/genai-rocm-oss-6 2025-05-07T19:43:05.1236369Z GITHUB_ACTION_REF= 2025-05-07T19:43:05.1236634Z GITHUB_WORKFLOW=FBGEMM GPU/GenAI CUDA CI 2025-05-07T19:43:05.1237133Z GITHUB_OUTPUT=/__w/_temp/_runner_file_commands/set_output_1f1553a6-b15e-4f1f-8005-fe37e20170c4 2025-05-07T19:43:05.1237645Z MACHINE_NAME=x86_64 2025-05-07T19:43:05.1237882Z _=/usr/bin/printenv 2025-05-07T19:43:05.1238044Z 2025-05-07T19:43:05.1238168Z ################################################################################ 2025-05-07T19:43:05.1238508Z [INFO] Print ldd version ... 2025-05-07T19:43:05.1238786Z + ldd --version 2025-05-07T19:43:05.1238920Z 2025-05-07T19:43:05.1239062Z ldd (GNU libc) 2.34 2025-05-07T19:43:05.1239338Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:43:05.1239817Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:43:05.1240382Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:43:05.1240878Z Written by Roland McGrath and Ulrich Drepper. 2025-05-07T19:43:05.1241110Z 2025-05-07T19:43:05.1241232Z ################################################################################ 2025-05-07T19:43:05.1241578Z [INFO] Print CPU info ... 2025-05-07T19:43:05.1241841Z + nproc 2025-05-07T19:43:05.1241956Z 2025-05-07T19:43:05.1242053Z 96 2025-05-07T19:43:05.1242164Z 2025-05-07T19:43:05.1242678Z + lscpu 2025-05-07T19:43:05.1242847Z 2025-05-07T19:43:05.1503670Z Architecture: x86_64 2025-05-07T19:43:05.1505369Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:43:05.1506021Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1506455Z Byte Order: Little Endian 2025-05-07T19:43:05.1506846Z CPU(s): 96 2025-05-07T19:43:05.1507331Z On-line CPU(s) list: 0-95 2025-05-07T19:43:05.1507732Z Vendor ID: GenuineIntel 2025-05-07T19:43:05.1508138Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1508554Z CPU family: 6 2025-05-07T19:43:05.1508858Z Model: 85 2025-05-07T19:43:05.1509181Z Thread(s) per core: 2 2025-05-07T19:43:05.1509545Z Core(s) per socket: 24 2025-05-07T19:43:05.1509841Z Socket(s): 2 2025-05-07T19:43:05.1510152Z Stepping: 7 2025-05-07T19:43:05.1510467Z BogoMIPS: 6000.01 2025-05-07T19:43:05.1512966Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1515594Z Hypervisor vendor: KVM 2025-05-07T19:43:05.1516119Z Virtualization type: full 2025-05-07T19:43:05.1516485Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:43:05.1516953Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:43:05.1517348Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:43:05.1517738Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:43:05.1518112Z NUMA node(s): 2 2025-05-07T19:43:05.1518429Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:43:05.1518797Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:43:05.1519294Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:43:05.1519877Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:43:05.1520443Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:43:05.1521069Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:05.1521673Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:43:05.1522320Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:43:05.1522967Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:43:05.1523392Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:43:05.1523785Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:43:05.1524192Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:43:05.1524759Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:43:05.1525618Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:43:05.1526308Z Vulnerability Srbds: Not affected 2025-05-07T19:43:05.1526701Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:43:05.1526948Z 2025-05-07T19:43:05.1527071Z + cat /proc/cpuinfo 2025-05-07T19:43:05.1527213Z 2025-05-07T19:43:05.1527694Z processor : 0 2025-05-07T19:43:05.1528024Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1528295Z cpu family : 6 2025-05-07T19:43:05.1528507Z model : 85 2025-05-07T19:43:05.1528817Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1529174Z stepping : 7 2025-05-07T19:43:05.1529422Z microcode : 0x5003901 2025-05-07T19:43:05.1529666Z cpu MHz : 1268.059 2025-05-07T19:43:05.1529904Z cache size : 36608 KB 2025-05-07T19:43:05.1530147Z physical id : 0 2025-05-07T19:43:05.1530379Z siblings : 48 2025-05-07T19:43:05.1530592Z core id : 0 2025-05-07T19:43:05.1530814Z cpu cores : 24 2025-05-07T19:43:05.1531048Z apicid : 0 2025-05-07T19:43:05.1531255Z initial apicid : 0 2025-05-07T19:43:05.1531490Z fpu : yes 2025-05-07T19:43:05.1531718Z fpu_exception : yes 2025-05-07T19:43:05.1531969Z cpuid level : 13 2025-05-07T19:43:05.1532185Z wp : yes 2025-05-07T19:43:05.1534496Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1537187Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1537794Z bogomips : 6000.01 2025-05-07T19:43:05.1538058Z clflush size : 64 2025-05-07T19:43:05.1538289Z cache_alignment : 64 2025-05-07T19:43:05.1538603Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1539021Z power management: 2025-05-07T19:43:05.1539205Z 2025-05-07T19:43:05.1539297Z processor : 1 2025-05-07T19:43:05.1539529Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1539819Z cpu family : 6 2025-05-07T19:43:05.1540048Z model : 85 2025-05-07T19:43:05.1540454Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1540835Z stepping : 7 2025-05-07T19:43:05.1541137Z microcode : 0x5003901 2025-05-07T19:43:05.1541382Z cpu MHz : 3294.242 2025-05-07T19:43:05.1541602Z cache size : 36608 KB 2025-05-07T19:43:05.1541846Z physical id : 0 2025-05-07T19:43:05.1542069Z siblings : 48 2025-05-07T19:43:05.1542312Z core id : 1 2025-05-07T19:43:05.1542525Z cpu cores : 24 2025-05-07T19:43:05.1542764Z apicid : 2 2025-05-07T19:43:05.1542964Z initial apicid : 2 2025-05-07T19:43:05.1543199Z fpu : yes 2025-05-07T19:43:05.1543428Z fpu_exception : yes 2025-05-07T19:43:05.1543666Z cpuid level : 13 2025-05-07T19:43:05.1543901Z wp : yes 2025-05-07T19:43:05.1546184Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1548848Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1549449Z bogomips : 6000.01 2025-05-07T19:43:05.1549675Z clflush size : 64 2025-05-07T19:43:05.1549915Z cache_alignment : 64 2025-05-07T19:43:05.1550192Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1550540Z power management: 2025-05-07T19:43:05.1550681Z 2025-05-07T19:43:05.1550771Z processor : 2 2025-05-07T19:43:05.1551010Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1551275Z cpu family : 6 2025-05-07T19:43:05.1551577Z model : 85 2025-05-07T19:43:05.1552054Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1552404Z stepping : 7 2025-05-07T19:43:05.1552626Z microcode : 0x5003901 2025-05-07T19:43:05.1552854Z cpu MHz : 3000.006 2025-05-07T19:43:05.1553087Z cache size : 36608 KB 2025-05-07T19:43:05.1553309Z physical id : 0 2025-05-07T19:43:05.1553531Z siblings : 48 2025-05-07T19:43:05.1553729Z core id : 2 2025-05-07T19:43:05.1553936Z cpu cores : 24 2025-05-07T19:43:05.1554133Z apicid : 4 2025-05-07T19:43:05.1554341Z initial apicid : 4 2025-05-07T19:43:05.1554567Z fpu : yes 2025-05-07T19:43:05.1554762Z fpu_exception : yes 2025-05-07T19:43:05.1554989Z cpuid level : 13 2025-05-07T19:43:05.1555192Z wp : yes 2025-05-07T19:43:05.1557497Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1560131Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1560712Z bogomips : 6000.01 2025-05-07T19:43:05.1560938Z clflush size : 64 2025-05-07T19:43:05.1561151Z cache_alignment : 64 2025-05-07T19:43:05.1561436Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1561757Z power management: 2025-05-07T19:43:05.1561902Z 2025-05-07T19:43:05.1561985Z processor : 3 2025-05-07T19:43:05.1562270Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1562506Z cpu family : 6 2025-05-07T19:43:05.1562717Z model : 85 2025-05-07T19:43:05.1562984Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1563351Z stepping : 7 2025-05-07T19:43:05.1563553Z microcode : 0x5003901 2025-05-07T19:43:05.1563791Z cpu MHz : 3000.006 2025-05-07T19:43:05.1564005Z cache size : 36608 KB 2025-05-07T19:43:05.1564244Z physical id : 0 2025-05-07T19:43:05.1564452Z siblings : 48 2025-05-07T19:43:05.1564669Z core id : 3 2025-05-07T19:43:05.1564868Z cpu cores : 24 2025-05-07T19:43:05.1565089Z apicid : 6 2025-05-07T19:43:05.1565283Z initial apicid : 6 2025-05-07T19:43:05.1565506Z fpu : yes 2025-05-07T19:43:05.1565713Z fpu_exception : yes 2025-05-07T19:43:05.1565928Z cpuid level : 13 2025-05-07T19:43:05.1566145Z wp : yes 2025-05-07T19:43:05.1568410Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1571103Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1571698Z bogomips : 6000.01 2025-05-07T19:43:05.1571914Z clflush size : 64 2025-05-07T19:43:05.1572148Z cache_alignment : 64 2025-05-07T19:43:05.1572423Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1572763Z power management: 2025-05-07T19:43:05.1572897Z 2025-05-07T19:43:05.1572983Z processor : 4 2025-05-07T19:43:05.1573211Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1573472Z cpu family : 6 2025-05-07T19:43:05.1573670Z model : 85 2025-05-07T19:43:05.1573956Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1574409Z stepping : 7 2025-05-07T19:43:05.1574633Z microcode : 0x5003901 2025-05-07T19:43:05.1574860Z cpu MHz : 3000.006 2025-05-07T19:43:05.1575089Z cache size : 36608 KB 2025-05-07T19:43:05.1575312Z physical id : 0 2025-05-07T19:43:05.1575535Z siblings : 48 2025-05-07T19:43:05.1575733Z core id : 4 2025-05-07T19:43:05.1576130Z cpu cores : 24 2025-05-07T19:43:05.1576332Z apicid : 8 2025-05-07T19:43:05.1576544Z initial apicid : 8 2025-05-07T19:43:05.1576879Z fpu : yes 2025-05-07T19:43:05.1577131Z fpu_exception : yes 2025-05-07T19:43:05.1577361Z cpuid level : 13 2025-05-07T19:43:05.1577566Z wp : yes 2025-05-07T19:43:05.1579835Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1582549Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1583133Z bogomips : 6000.01 2025-05-07T19:43:05.1583358Z clflush size : 64 2025-05-07T19:43:05.1583571Z cache_alignment : 64 2025-05-07T19:43:05.1583853Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1584174Z power management: 2025-05-07T19:43:05.1584320Z 2025-05-07T19:43:05.1584405Z processor : 5 2025-05-07T19:43:05.1584627Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1584859Z cpu family : 6 2025-05-07T19:43:05.1585190Z model : 85 2025-05-07T19:43:05.1585507Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1585868Z stepping : 7 2025-05-07T19:43:05.1586077Z microcode : 0x5003901 2025-05-07T19:43:05.1586313Z cpu MHz : 3000.006 2025-05-07T19:43:05.1586530Z cache size : 36608 KB 2025-05-07T19:43:05.1586764Z physical id : 0 2025-05-07T19:43:05.1586971Z siblings : 48 2025-05-07T19:43:05.1587184Z core id : 5 2025-05-07T19:43:05.1587385Z cpu cores : 24 2025-05-07T19:43:05.1587601Z apicid : 10 2025-05-07T19:43:05.1587813Z initial apicid : 10 2025-05-07T19:43:05.1588021Z fpu : yes 2025-05-07T19:43:05.1588229Z fpu_exception : yes 2025-05-07T19:43:05.1588443Z cpuid level : 13 2025-05-07T19:43:05.1588658Z wp : yes 2025-05-07T19:43:05.1590929Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1593648Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1594249Z bogomips : 6000.01 2025-05-07T19:43:05.1594464Z clflush size : 64 2025-05-07T19:43:05.1594693Z cache_alignment : 64 2025-05-07T19:43:05.1594961Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1595298Z power management: 2025-05-07T19:43:05.1595430Z 2025-05-07T19:43:05.1595528Z processor : 6 2025-05-07T19:43:05.1595742Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1595991Z cpu family : 6 2025-05-07T19:43:05.1596195Z model : 85 2025-05-07T19:43:05.1596482Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1596831Z stepping : 7 2025-05-07T19:43:05.1597051Z microcode : 0x5003901 2025-05-07T19:43:05.1597366Z cpu MHz : 3000.006 2025-05-07T19:43:05.1597597Z cache size : 36608 KB 2025-05-07T19:43:05.1597819Z physical id : 0 2025-05-07T19:43:05.1598048Z siblings : 48 2025-05-07T19:43:05.1598249Z core id : 6 2025-05-07T19:43:05.1598466Z cpu cores : 24 2025-05-07T19:43:05.1598687Z apicid : 12 2025-05-07T19:43:05.1598885Z initial apicid : 12 2025-05-07T19:43:05.1599112Z fpu : yes 2025-05-07T19:43:05.1599312Z fpu_exception : yes 2025-05-07T19:43:05.1599540Z cpuid level : 13 2025-05-07T19:43:05.1599745Z wp : yes 2025-05-07T19:43:05.1602042Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1604922Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1605503Z bogomips : 6000.01 2025-05-07T19:43:05.1605730Z clflush size : 64 2025-05-07T19:43:05.1605946Z cache_alignment : 64 2025-05-07T19:43:05.1606230Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1606553Z power management: 2025-05-07T19:43:05.1606705Z 2025-05-07T19:43:05.1606791Z processor : 7 2025-05-07T19:43:05.1607016Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1607252Z cpu family : 6 2025-05-07T19:43:05.1607465Z model : 85 2025-05-07T19:43:05.1607736Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1609339Z stepping : 7 2025-05-07T19:43:05.1609561Z microcode : 0x5003901 2025-05-07T19:43:05.1609815Z cpu MHz : 3000.006 2025-05-07T19:43:05.1610037Z cache size : 36608 KB 2025-05-07T19:43:05.1610278Z physical id : 0 2025-05-07T19:43:05.1610489Z siblings : 48 2025-05-07T19:43:05.1610705Z core id : 7 2025-05-07T19:43:05.1610921Z cpu cores : 24 2025-05-07T19:43:05.1611126Z apicid : 14 2025-05-07T19:43:05.1611354Z initial apicid : 14 2025-05-07T19:43:05.1611572Z fpu : yes 2025-05-07T19:43:05.1611791Z fpu_exception : yes 2025-05-07T19:43:05.1612009Z cpuid level : 13 2025-05-07T19:43:05.1612350Z wp : yes 2025-05-07T19:43:05.1614549Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1617235Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1617795Z bogomips : 6000.01 2025-05-07T19:43:05.1618000Z clflush size : 64 2025-05-07T19:43:05.1618220Z cache_alignment : 64 2025-05-07T19:43:05.1618473Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1618791Z power management: 2025-05-07T19:43:05.1618916Z 2025-05-07T19:43:05.1619015Z processor : 8 2025-05-07T19:43:05.1619213Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1619447Z cpu family : 6 2025-05-07T19:43:05.1619638Z model : 85 2025-05-07T19:43:05.1619912Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1620311Z stepping : 7 2025-05-07T19:43:05.1620692Z microcode : 0x5003901 2025-05-07T19:43:05.1620915Z cpu MHz : 3000.006 2025-05-07T19:43:05.1621143Z cache size : 36608 KB 2025-05-07T19:43:05.1621447Z physical id : 0 2025-05-07T19:43:05.1621745Z siblings : 48 2025-05-07T19:43:05.1621940Z core id : 8 2025-05-07T19:43:05.1622149Z cpu cores : 24 2025-05-07T19:43:05.1622363Z apicid : 16 2025-05-07T19:43:05.1622561Z initial apicid : 16 2025-05-07T19:43:05.1622786Z fpu : yes 2025-05-07T19:43:05.1622982Z fpu_exception : yes 2025-05-07T19:43:05.1623211Z cpuid level : 13 2025-05-07T19:43:05.1623416Z wp : yes 2025-05-07T19:43:05.1625692Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1628336Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1628919Z bogomips : 6000.01 2025-05-07T19:43:05.1629148Z clflush size : 64 2025-05-07T19:43:05.1629361Z cache_alignment : 64 2025-05-07T19:43:05.1629643Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1629964Z power management: 2025-05-07T19:43:05.1630109Z 2025-05-07T19:43:05.1630193Z processor : 9 2025-05-07T19:43:05.1630418Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1630654Z cpu family : 6 2025-05-07T19:43:05.1630865Z model : 85 2025-05-07T19:43:05.1631134Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1631494Z stepping : 7 2025-05-07T19:43:05.1631700Z microcode : 0x5003901 2025-05-07T19:43:05.1631998Z cpu MHz : 3000.006 2025-05-07T19:43:05.1632213Z cache size : 36608 KB 2025-05-07T19:43:05.1632449Z physical id : 0 2025-05-07T19:43:05.1632655Z siblings : 48 2025-05-07T19:43:05.1632980Z core id : 9 2025-05-07T19:43:05.1633184Z cpu cores : 24 2025-05-07T19:43:05.1633377Z apicid : 18 2025-05-07T19:43:05.1633584Z initial apicid : 18 2025-05-07T19:43:05.1633784Z fpu : yes 2025-05-07T19:43:05.1633979Z fpu_exception : yes 2025-05-07T19:43:05.1634179Z cpuid level : 13 2025-05-07T19:43:05.1634380Z wp : yes 2025-05-07T19:43:05.1636467Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1638894Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1639449Z bogomips : 6000.01 2025-05-07T19:43:05.1639650Z clflush size : 64 2025-05-07T19:43:05.1639866Z cache_alignment : 64 2025-05-07T19:43:05.1640118Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1640431Z power management: 2025-05-07T19:43:05.1640554Z 2025-05-07T19:43:05.1640649Z processor : 10 2025-05-07T19:43:05.1640846Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1641076Z cpu family : 6 2025-05-07T19:43:05.1641437Z model : 85 2025-05-07T19:43:05.1641757Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1642098Z stepping : 7 2025-05-07T19:43:05.1642307Z microcode : 0x5003901 2025-05-07T19:43:05.1642523Z cpu MHz : 3000.006 2025-05-07T19:43:05.1642745Z cache size : 36608 KB 2025-05-07T19:43:05.1643180Z physical id : 0 2025-05-07T19:43:05.1643398Z siblings : 48 2025-05-07T19:43:05.1643593Z core id : 10 2025-05-07T19:43:05.1643872Z cpu cores : 24 2025-05-07T19:43:05.1644087Z apicid : 20 2025-05-07T19:43:05.1644287Z initial apicid : 20 2025-05-07T19:43:05.1644531Z fpu : yes 2025-05-07T19:43:05.1644727Z fpu_exception : yes 2025-05-07T19:43:05.1644965Z cpuid level : 13 2025-05-07T19:43:05.1645169Z wp : yes 2025-05-07T19:43:05.1647444Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1650077Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1650659Z bogomips : 6000.01 2025-05-07T19:43:05.1650882Z clflush size : 64 2025-05-07T19:43:05.1651265Z cache_alignment : 64 2025-05-07T19:43:05.1651569Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1651906Z power management: 2025-05-07T19:43:05.1652070Z 2025-05-07T19:43:05.1652164Z processor : 11 2025-05-07T19:43:05.1652416Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1652667Z cpu family : 6 2025-05-07T19:43:05.1652910Z model : 85 2025-05-07T19:43:05.1653198Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1653565Z stepping : 7 2025-05-07T19:43:05.1653771Z microcode : 0x5003901 2025-05-07T19:43:05.1654023Z cpu MHz : 3000.006 2025-05-07T19:43:05.1654263Z cache size : 36608 KB 2025-05-07T19:43:05.1654587Z physical id : 0 2025-05-07T19:43:05.1654796Z siblings : 48 2025-05-07T19:43:05.1655016Z core id : 11 2025-05-07T19:43:05.1655234Z cpu cores : 24 2025-05-07T19:43:05.1655442Z apicid : 22 2025-05-07T19:43:05.1655681Z initial apicid : 22 2025-05-07T19:43:05.1655912Z fpu : yes 2025-05-07T19:43:05.1656161Z fpu_exception : yes 2025-05-07T19:43:05.1656402Z cpuid level : 13 2025-05-07T19:43:05.1656654Z wp : yes 2025-05-07T19:43:05.1658937Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1661702Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1662304Z bogomips : 6000.01 2025-05-07T19:43:05.1662530Z clflush size : 64 2025-05-07T19:43:05.1662769Z cache_alignment : 64 2025-05-07T19:43:05.1663043Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1663382Z power management: 2025-05-07T19:43:05.1663515Z 2025-05-07T19:43:05.1663615Z processor : 12 2025-05-07T19:43:05.1663832Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1664078Z cpu family : 6 2025-05-07T19:43:05.1664275Z model : 85 2025-05-07T19:43:05.1664557Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1664902Z stepping : 7 2025-05-07T19:43:05.1665115Z microcode : 0x5003901 2025-05-07T19:43:05.1665335Z cpu MHz : 1199.461 2025-05-07T19:43:05.1665582Z cache size : 36608 KB 2025-05-07T19:43:05.1665804Z physical id : 0 2025-05-07T19:43:05.1666047Z siblings : 48 2025-05-07T19:43:05.1666258Z core id : 12 2025-05-07T19:43:05.1666496Z cpu cores : 24 2025-05-07T19:43:05.1666730Z apicid : 24 2025-05-07T19:43:05.1667019Z initial apicid : 24 2025-05-07T19:43:05.1667267Z fpu : yes 2025-05-07T19:43:05.1667484Z fpu_exception : yes 2025-05-07T19:43:05.1667739Z cpuid level : 13 2025-05-07T19:43:05.1667964Z wp : yes 2025-05-07T19:43:05.1670262Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1672924Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1673529Z bogomips : 6000.01 2025-05-07T19:43:05.1673786Z clflush size : 64 2025-05-07T19:43:05.1674017Z cache_alignment : 64 2025-05-07T19:43:05.1674324Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1674662Z power management: 2025-05-07T19:43:05.1674827Z 2025-05-07T19:43:05.1674919Z processor : 13 2025-05-07T19:43:05.1675168Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1675421Z cpu family : 6 2025-05-07T19:43:05.1675663Z model : 85 2025-05-07T19:43:05.1676086Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1676484Z stepping : 7 2025-05-07T19:43:05.1676784Z microcode : 0x5003901 2025-05-07T19:43:05.1677119Z cpu MHz : 3000.006 2025-05-07T19:43:05.1677356Z cache size : 36608 KB 2025-05-07T19:43:05.1677626Z physical id : 0 2025-05-07T19:43:05.1677851Z siblings : 48 2025-05-07T19:43:05.1678185Z core id : 13 2025-05-07T19:43:05.1678430Z cpu cores : 24 2025-05-07T19:43:05.1678646Z apicid : 26 2025-05-07T19:43:05.1678893Z initial apicid : 26 2025-05-07T19:43:05.1679127Z fpu : yes 2025-05-07T19:43:05.1679364Z fpu_exception : yes 2025-05-07T19:43:05.1679598Z cpuid level : 13 2025-05-07T19:43:05.1679849Z wp : yes 2025-05-07T19:43:05.1682132Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1684793Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1685413Z bogomips : 6000.01 2025-05-07T19:43:05.1685640Z clflush size : 64 2025-05-07T19:43:05.1685896Z cache_alignment : 64 2025-05-07T19:43:05.1686180Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1686545Z power management: 2025-05-07T19:43:05.1686683Z 2025-05-07T19:43:05.1686795Z processor : 14 2025-05-07T19:43:05.1687025Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1687291Z cpu family : 6 2025-05-07T19:43:05.1687511Z model : 85 2025-05-07T19:43:05.1687815Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1688188Z stepping : 7 2025-05-07T19:43:05.1688426Z microcode : 0x5003901 2025-05-07T19:43:05.1688662Z cpu MHz : 3000.006 2025-05-07T19:43:05.1688912Z cache size : 36608 KB 2025-05-07T19:43:05.1689150Z physical id : 0 2025-05-07T19:43:05.1689396Z siblings : 48 2025-05-07T19:43:05.1689610Z core id : 14 2025-05-07T19:43:05.1689853Z cpu cores : 24 2025-05-07T19:43:05.1690099Z apicid : 28 2025-05-07T19:43:05.1690321Z initial apicid : 28 2025-05-07T19:43:05.1690586Z fpu : yes 2025-05-07T19:43:05.1690800Z fpu_exception : yes 2025-05-07T19:43:05.1691143Z cpuid level : 13 2025-05-07T19:43:05.1691365Z wp : yes 2025-05-07T19:43:05.1693658Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1696332Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1696933Z bogomips : 6000.01 2025-05-07T19:43:05.1697183Z clflush size : 64 2025-05-07T19:43:05.1697416Z cache_alignment : 64 2025-05-07T19:43:05.1697739Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1698134Z power management: 2025-05-07T19:43:05.1698286Z 2025-05-07T19:43:05.1698409Z processor : 15 2025-05-07T19:43:05.1698675Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1698966Z cpu family : 6 2025-05-07T19:43:05.1699192Z model : 85 2025-05-07T19:43:05.1699503Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1699873Z stepping : 7 2025-05-07T19:43:05.1700196Z microcode : 0x5003901 2025-05-07T19:43:05.1700443Z cpu MHz : 3000.006 2025-05-07T19:43:05.1700710Z cache size : 36608 KB 2025-05-07T19:43:05.1700957Z physical id : 0 2025-05-07T19:43:05.1701251Z siblings : 48 2025-05-07T19:43:05.1701499Z core id : 15 2025-05-07T19:43:05.1701719Z cpu cores : 24 2025-05-07T19:43:05.1701960Z apicid : 30 2025-05-07T19:43:05.1702254Z initial apicid : 30 2025-05-07T19:43:05.1702510Z fpu : yes 2025-05-07T19:43:05.1702731Z fpu_exception : yes 2025-05-07T19:43:05.1702994Z cpuid level : 13 2025-05-07T19:43:05.1703228Z wp : yes 2025-05-07T19:43:05.1705541Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1708208Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1708813Z bogomips : 6000.01 2025-05-07T19:43:05.1709061Z clflush size : 64 2025-05-07T19:43:05.1709290Z cache_alignment : 64 2025-05-07T19:43:05.1709593Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1709952Z power management: 2025-05-07T19:43:05.1710090Z 2025-05-07T19:43:05.1710182Z processor : 16 2025-05-07T19:43:05.1710431Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1710678Z cpu family : 6 2025-05-07T19:43:05.1710914Z model : 85 2025-05-07T19:43:05.1711200Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1711583Z stepping : 7 2025-05-07T19:43:05.1711807Z microcode : 0x5003901 2025-05-07T19:43:05.1712067Z cpu MHz : 3000.006 2025-05-07T19:43:05.1712298Z cache size : 36608 KB 2025-05-07T19:43:05.1712553Z physical id : 0 2025-05-07T19:43:05.1712776Z siblings : 48 2025-05-07T19:43:05.1712973Z core id : 16 2025-05-07T19:43:05.1713188Z cpu cores : 24 2025-05-07T19:43:05.1713386Z apicid : 32 2025-05-07T19:43:05.1713598Z initial apicid : 32 2025-05-07T19:43:05.1713809Z fpu : yes 2025-05-07T19:43:05.1714017Z fpu_exception : yes 2025-05-07T19:43:05.1714332Z cpuid level : 13 2025-05-07T19:43:05.1714624Z wp : yes 2025-05-07T19:43:05.1716946Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1719575Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1720175Z bogomips : 6000.01 2025-05-07T19:43:05.1720396Z clflush size : 64 2025-05-07T19:43:05.1720631Z cache_alignment : 64 2025-05-07T19:43:05.1720903Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1721248Z power management: 2025-05-07T19:43:05.1721380Z 2025-05-07T19:43:05.1721477Z processor : 17 2025-05-07T19:43:05.1721690Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1721939Z cpu family : 6 2025-05-07T19:43:05.1722137Z model : 85 2025-05-07T19:43:05.1722419Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1722764Z stepping : 7 2025-05-07T19:43:05.1722981Z microcode : 0x5003901 2025-05-07T19:43:05.1723203Z cpu MHz : 3000.006 2025-05-07T19:43:05.1723428Z cache size : 36608 KB 2025-05-07T19:43:05.1723650Z physical id : 0 2025-05-07T19:43:05.1723870Z siblings : 48 2025-05-07T19:43:05.1724081Z core id : 17 2025-05-07T19:43:05.1724278Z cpu cores : 24 2025-05-07T19:43:05.1724492Z apicid : 34 2025-05-07T19:43:05.1724692Z initial apicid : 34 2025-05-07T19:43:05.1724914Z fpu : yes 2025-05-07T19:43:05.1725178Z fpu_exception : yes 2025-05-07T19:43:05.1725411Z cpuid level : 13 2025-05-07T19:43:05.1725612Z wp : yes 2025-05-07T19:43:05.1727951Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1730904Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1731505Z bogomips : 6000.01 2025-05-07T19:43:05.1731731Z clflush size : 64 2025-05-07T19:43:05.1731948Z cache_alignment : 64 2025-05-07T19:43:05.1732227Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1732560Z power management: 2025-05-07T19:43:05.1732696Z 2025-05-07T19:43:05.1732780Z processor : 18 2025-05-07T19:43:05.1733004Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1733237Z cpu family : 6 2025-05-07T19:43:05.1733449Z model : 85 2025-05-07T19:43:05.1733716Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1734078Z stepping : 7 2025-05-07T19:43:05.1734287Z microcode : 0x5003901 2025-05-07T19:43:05.1734527Z cpu MHz : 3000.006 2025-05-07T19:43:05.1734740Z cache size : 36608 KB 2025-05-07T19:43:05.1734974Z physical id : 0 2025-05-07T19:43:05.1735189Z siblings : 48 2025-05-07T19:43:05.1735389Z core id : 18 2025-05-07T19:43:05.1735621Z cpu cores : 24 2025-05-07T19:43:05.1735836Z apicid : 36 2025-05-07T19:43:05.1736073Z initial apicid : 36 2025-05-07T19:43:05.1736297Z fpu : yes 2025-05-07T19:43:05.1736535Z fpu_exception : yes 2025-05-07T19:43:05.1736767Z cpuid level : 13 2025-05-07T19:43:05.1737012Z wp : yes 2025-05-07T19:43:05.1739291Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1742100Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1742731Z bogomips : 6000.01 2025-05-07T19:43:05.1742961Z clflush size : 64 2025-05-07T19:43:05.1743226Z cache_alignment : 64 2025-05-07T19:43:05.1743546Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1743892Z power management: 2025-05-07T19:43:05.1744038Z 2025-05-07T19:43:05.1744156Z processor : 19 2025-05-07T19:43:05.1744400Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1763959Z cpu family : 6 2025-05-07T19:43:05.1764250Z model : 85 2025-05-07T19:43:05.1764552Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1764908Z stepping : 7 2025-05-07T19:43:05.1765115Z microcode : 0x5003901 2025-05-07T19:43:05.1765334Z cpu MHz : 3000.006 2025-05-07T19:43:05.1765531Z cache size : 36608 KB 2025-05-07T19:43:05.1765751Z physical id : 0 2025-05-07T19:43:05.1765943Z siblings : 48 2025-05-07T19:43:05.1766138Z core id : 19 2025-05-07T19:43:05.1766315Z cpu cores : 24 2025-05-07T19:43:05.1766511Z apicid : 38 2025-05-07T19:43:05.1766695Z initial apicid : 38 2025-05-07T19:43:05.1766902Z fpu : yes 2025-05-07T19:43:05.1767082Z fpu_exception : yes 2025-05-07T19:43:05.1767291Z cpuid level : 13 2025-05-07T19:43:05.1767480Z wp : yes 2025-05-07T19:43:05.1769725Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1772177Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1772730Z bogomips : 6000.01 2025-05-07T19:43:05.1772924Z clflush size : 64 2025-05-07T19:43:05.1773137Z cache_alignment : 64 2025-05-07T19:43:05.1773383Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1773697Z power management: 2025-05-07T19:43:05.1773820Z 2025-05-07T19:43:05.1773898Z processor : 20 2025-05-07T19:43:05.1774096Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1774316Z cpu family : 6 2025-05-07T19:43:05.1774510Z model : 85 2025-05-07T19:43:05.1774759Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1775084Z stepping : 7 2025-05-07T19:43:05.1775282Z microcode : 0x5003901 2025-05-07T19:43:05.1775488Z cpu MHz : 3000.006 2025-05-07T19:43:05.1775693Z cache size : 36608 KB 2025-05-07T19:43:05.1776047Z physical id : 0 2025-05-07T19:43:05.1776426Z siblings : 48 2025-05-07T19:43:05.1776622Z core id : 20 2025-05-07T19:43:05.1776958Z cpu cores : 24 2025-05-07T19:43:05.1777152Z apicid : 40 2025-05-07T19:43:05.1777411Z initial apicid : 40 2025-05-07T19:43:05.1777613Z fpu : yes 2025-05-07T19:43:05.1777813Z fpu_exception : yes 2025-05-07T19:43:05.1778022Z cpuid level : 13 2025-05-07T19:43:05.1778229Z wp : yes 2025-05-07T19:43:05.1780574Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1783316Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1783901Z bogomips : 6000.01 2025-05-07T19:43:05.1784118Z clflush size : 64 2025-05-07T19:43:05.1784329Z cache_alignment : 64 2025-05-07T19:43:05.1784605Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1784924Z power management: 2025-05-07T19:43:05.1785068Z 2025-05-07T19:43:05.1785149Z processor : 21 2025-05-07T19:43:05.1785354Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1785583Z cpu family : 6 2025-05-07T19:43:05.1785770Z model : 85 2025-05-07T19:43:05.1786044Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1786385Z stepping : 7 2025-05-07T19:43:05.1786590Z microcode : 0x5003901 2025-05-07T19:43:05.1786808Z cpu MHz : 1200.057 2025-05-07T19:43:05.1787017Z cache size : 36608 KB 2025-05-07T19:43:05.1787238Z physical id : 0 2025-05-07T19:43:05.1787431Z siblings : 48 2025-05-07T19:43:05.1787628Z core id : 21 2025-05-07T19:43:05.1787822Z cpu cores : 24 2025-05-07T19:43:05.1788025Z apicid : 42 2025-05-07T19:43:05.1788218Z initial apicid : 42 2025-05-07T19:43:05.1788428Z fpu : yes 2025-05-07T19:43:05.1788615Z fpu_exception : yes 2025-05-07T19:43:05.1788833Z cpuid level : 13 2025-05-07T19:43:05.1789028Z wp : yes 2025-05-07T19:43:05.1791379Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1793980Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1794513Z bogomips : 6000.01 2025-05-07T19:43:05.1794701Z clflush size : 64 2025-05-07T19:43:05.1794900Z cache_alignment : 64 2025-05-07T19:43:05.1795142Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1795437Z power management: 2025-05-07T19:43:05.1795555Z 2025-05-07T19:43:05.1795632Z processor : 22 2025-05-07T19:43:05.1795830Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1796039Z cpu family : 6 2025-05-07T19:43:05.1796222Z model : 85 2025-05-07T19:43:05.1796467Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1796789Z stepping : 7 2025-05-07T19:43:05.1796977Z microcode : 0x5003901 2025-05-07T19:43:05.1797177Z cpu MHz : 3000.006 2025-05-07T19:43:05.1797377Z cache size : 36608 KB 2025-05-07T19:43:05.1797577Z physical id : 0 2025-05-07T19:43:05.1797765Z siblings : 48 2025-05-07T19:43:05.1797944Z core id : 22 2025-05-07T19:43:05.1798125Z cpu cores : 24 2025-05-07T19:43:05.1798301Z apicid : 44 2025-05-07T19:43:05.1798488Z initial apicid : 44 2025-05-07T19:43:05.1798675Z fpu : yes 2025-05-07T19:43:05.1798858Z fpu_exception : yes 2025-05-07T19:43:05.1799049Z cpuid level : 13 2025-05-07T19:43:05.1799242Z wp : yes 2025-05-07T19:43:05.1801325Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1803806Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1804342Z bogomips : 6000.01 2025-05-07T19:43:05.1804545Z clflush size : 64 2025-05-07T19:43:05.1804739Z cache_alignment : 64 2025-05-07T19:43:05.1804988Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1805281Z power management: 2025-05-07T19:43:05.1805407Z 2025-05-07T19:43:05.1805480Z processor : 23 2025-05-07T19:43:05.1805675Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1805894Z cpu family : 6 2025-05-07T19:43:05.1806076Z model : 85 2025-05-07T19:43:05.1806333Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1806651Z stepping : 7 2025-05-07T19:43:05.1806843Z microcode : 0x5003901 2025-05-07T19:43:05.1807051Z cpu MHz : 3000.006 2025-05-07T19:43:05.1807240Z cache size : 36608 KB 2025-05-07T19:43:05.1807448Z physical id : 0 2025-05-07T19:43:05.1807630Z siblings : 48 2025-05-07T19:43:05.1807814Z core id : 23 2025-05-07T19:43:05.1807989Z cpu cores : 24 2025-05-07T19:43:05.1808176Z apicid : 46 2025-05-07T19:43:05.1808356Z initial apicid : 46 2025-05-07T19:43:05.1808550Z fpu : yes 2025-05-07T19:43:05.1808725Z fpu_exception : yes 2025-05-07T19:43:05.1808920Z cpuid level : 13 2025-05-07T19:43:05.1809100Z wp : yes 2025-05-07T19:43:05.1811242Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1813669Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1814208Z bogomips : 6000.01 2025-05-07T19:43:05.1814399Z clflush size : 64 2025-05-07T19:43:05.1814599Z cache_alignment : 64 2025-05-07T19:43:05.1814838Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1815135Z power management: 2025-05-07T19:43:05.1815251Z 2025-05-07T19:43:05.1815332Z processor : 24 2025-05-07T19:43:05.1815530Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1815750Z cpu family : 6 2025-05-07T19:43:05.1815937Z model : 85 2025-05-07T19:43:05.1816178Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1816491Z stepping : 7 2025-05-07T19:43:05.1816690Z microcode : 0x5003901 2025-05-07T19:43:05.1816884Z cpu MHz : 3276.339 2025-05-07T19:43:05.1817080Z cache size : 36608 KB 2025-05-07T19:43:05.1817275Z physical id : 1 2025-05-07T19:43:05.1817463Z siblings : 48 2025-05-07T19:43:05.1817637Z core id : 0 2025-05-07T19:43:05.1817817Z cpu cores : 24 2025-05-07T19:43:05.1817991Z apicid : 64 2025-05-07T19:43:05.1818178Z initial apicid : 64 2025-05-07T19:43:05.1818362Z fpu : yes 2025-05-07T19:43:05.1818546Z fpu_exception : yes 2025-05-07T19:43:05.1818732Z cpuid level : 13 2025-05-07T19:43:05.1818920Z wp : yes 2025-05-07T19:43:05.1821292Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1823995Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1824576Z bogomips : 6000.01 2025-05-07T19:43:05.1824834Z clflush size : 64 2025-05-07T19:43:05.1825037Z cache_alignment : 64 2025-05-07T19:43:05.1825299Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1825608Z power management: 2025-05-07T19:43:05.1825740Z 2025-05-07T19:43:05.1825818Z processor : 25 2025-05-07T19:43:05.1826022Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1826255Z cpu family : 6 2025-05-07T19:43:05.1826446Z model : 85 2025-05-07T19:43:05.1826715Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1827072Z stepping : 7 2025-05-07T19:43:05.1827287Z microcode : 0x5003901 2025-05-07T19:43:05.1827516Z cpu MHz : 3000.006 2025-05-07T19:43:05.1827733Z cache size : 36608 KB 2025-05-07T19:43:05.1827948Z physical id : 1 2025-05-07T19:43:05.1828168Z siblings : 48 2025-05-07T19:43:05.1828359Z core id : 1 2025-05-07T19:43:05.1828564Z cpu cores : 24 2025-05-07T19:43:05.1828756Z apicid : 66 2025-05-07T19:43:05.1828968Z initial apicid : 66 2025-05-07T19:43:05.1829183Z fpu : yes 2025-05-07T19:43:05.1829378Z fpu_exception : yes 2025-05-07T19:43:05.1829592Z cpuid level : 13 2025-05-07T19:43:05.1829789Z wp : yes 2025-05-07T19:43:05.1832099Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1834684Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1835225Z bogomips : 6000.01 2025-05-07T19:43:05.1835431Z clflush size : 64 2025-05-07T19:43:05.1835625Z cache_alignment : 64 2025-05-07T19:43:05.1835877Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1836165Z power management: 2025-05-07T19:43:05.1836293Z 2025-05-07T19:43:05.1836367Z processor : 26 2025-05-07T19:43:05.1836567Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1836779Z cpu family : 6 2025-05-07T19:43:05.1836974Z model : 85 2025-05-07T19:43:05.1837227Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1837569Z stepping : 7 2025-05-07T19:43:05.1837749Z microcode : 0x5003901 2025-05-07T19:43:05.1837969Z cpu MHz : 3000.006 2025-05-07T19:43:05.1838162Z cache size : 36608 KB 2025-05-07T19:43:05.1838388Z physical id : 1 2025-05-07T19:43:05.1838567Z siblings : 48 2025-05-07T19:43:05.1838769Z core id : 2 2025-05-07T19:43:05.1838944Z cpu cores : 24 2025-05-07T19:43:05.1839145Z apicid : 68 2025-05-07T19:43:05.1839325Z initial apicid : 68 2025-05-07T19:43:05.1839523Z fpu : yes 2025-05-07T19:43:05.1839716Z fpu_exception : yes 2025-05-07T19:43:05.1839906Z cpuid level : 13 2025-05-07T19:43:05.1840088Z wp : yes 2025-05-07T19:43:05.1842162Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1844626Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1845161Z bogomips : 6000.01 2025-05-07T19:43:05.1845355Z clflush size : 64 2025-05-07T19:43:05.1845548Z cache_alignment : 64 2025-05-07T19:43:05.1845789Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1846084Z power management: 2025-05-07T19:43:05.1846200Z 2025-05-07T19:43:05.1846272Z processor : 27 2025-05-07T19:43:05.1846465Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1846675Z cpu family : 6 2025-05-07T19:43:05.1846851Z model : 85 2025-05-07T19:43:05.1847092Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1847406Z stepping : 7 2025-05-07T19:43:05.1847588Z microcode : 0x5003901 2025-05-07T19:43:05.1847780Z cpu MHz : 3000.006 2025-05-07T19:43:05.1847975Z cache size : 36608 KB 2025-05-07T19:43:05.1848175Z physical id : 1 2025-05-07T19:43:05.1848368Z siblings : 48 2025-05-07T19:43:05.1848540Z core id : 3 2025-05-07T19:43:05.1848722Z cpu cores : 24 2025-05-07T19:43:05.1848895Z apicid : 70 2025-05-07T19:43:05.1849079Z initial apicid : 70 2025-05-07T19:43:05.1849273Z fpu : yes 2025-05-07T19:43:05.1849444Z fpu_exception : yes 2025-05-07T19:43:05.1849646Z cpuid level : 13 2025-05-07T19:43:05.1849823Z wp : yes 2025-05-07T19:43:05.1852113Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1854535Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1855068Z bogomips : 6000.01 2025-05-07T19:43:05.1855264Z clflush size : 64 2025-05-07T19:43:05.1855455Z cache_alignment : 64 2025-05-07T19:43:05.1855705Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1855991Z power management: 2025-05-07T19:43:05.1856117Z 2025-05-07T19:43:05.1856195Z processor : 28 2025-05-07T19:43:05.1856390Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1856615Z cpu family : 6 2025-05-07T19:43:05.1856800Z model : 85 2025-05-07T19:43:05.1857038Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1857360Z stepping : 7 2025-05-07T19:43:05.1857541Z microcode : 0x5003901 2025-05-07T19:43:05.1857744Z cpu MHz : 3212.252 2025-05-07T19:43:05.1857938Z cache size : 36608 KB 2025-05-07T19:43:05.1858137Z physical id : 1 2025-05-07T19:43:05.1858321Z siblings : 48 2025-05-07T19:43:05.1858503Z core id : 4 2025-05-07T19:43:05.1858673Z cpu cores : 24 2025-05-07T19:43:05.1858854Z apicid : 72 2025-05-07T19:43:05.1859031Z initial apicid : 72 2025-05-07T19:43:05.1859223Z fpu : yes 2025-05-07T19:43:05.1859400Z fpu_exception : yes 2025-05-07T19:43:05.1859591Z cpuid level : 13 2025-05-07T19:43:05.1859778Z wp : yes 2025-05-07T19:43:05.1862212Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1864892Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1865482Z bogomips : 6000.01 2025-05-07T19:43:05.1865696Z clflush size : 64 2025-05-07T19:43:05.1865927Z cache_alignment : 64 2025-05-07T19:43:05.1866195Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1866524Z power management: 2025-05-07T19:43:05.1866655Z 2025-05-07T19:43:05.1866739Z processor : 29 2025-05-07T19:43:05.1866958Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1867203Z cpu family : 6 2025-05-07T19:43:05.1867393Z model : 85 2025-05-07T19:43:05.1867669Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1868009Z stepping : 7 2025-05-07T19:43:05.1868217Z microcode : 0x5003901 2025-05-07T19:43:05.1868438Z cpu MHz : 3236.346 2025-05-07T19:43:05.1868647Z cache size : 36608 KB 2025-05-07T19:43:05.1868858Z physical id : 1 2025-05-07T19:43:05.1869069Z siblings : 48 2025-05-07T19:43:05.1869272Z core id : 5 2025-05-07T19:43:05.1869516Z cpu cores : 24 2025-05-07T19:43:05.1869734Z apicid : 74 2025-05-07T19:43:05.1869991Z initial apicid : 74 2025-05-07T19:43:05.1870252Z fpu : yes 2025-05-07T19:43:05.1870473Z fpu_exception : yes 2025-05-07T19:43:05.1870740Z cpuid level : 13 2025-05-07T19:43:05.1870964Z wp : yes 2025-05-07T19:43:05.1873424Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1876010Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1876769Z bogomips : 6000.01 2025-05-07T19:43:05.1877095Z clflush size : 64 2025-05-07T19:43:05.1877354Z cache_alignment : 64 2025-05-07T19:43:05.1877677Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1878023Z power management: 2025-05-07T19:43:05.1878189Z 2025-05-07T19:43:05.1878280Z processor : 30 2025-05-07T19:43:05.1878526Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1878778Z cpu family : 6 2025-05-07T19:43:05.1879010Z model : 85 2025-05-07T19:43:05.1879297Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1879680Z stepping : 7 2025-05-07T19:43:05.1879901Z microcode : 0x5003901 2025-05-07T19:43:05.1880160Z cpu MHz : 3000.006 2025-05-07T19:43:05.1880394Z cache size : 36608 KB 2025-05-07T19:43:05.1880652Z physical id : 1 2025-05-07T19:43:05.1880875Z siblings : 48 2025-05-07T19:43:05.1881111Z core id : 6 2025-05-07T19:43:05.1881325Z cpu cores : 24 2025-05-07T19:43:05.1881573Z apicid : 76 2025-05-07T19:43:05.1881794Z initial apicid : 76 2025-05-07T19:43:05.1882054Z fpu : yes 2025-05-07T19:43:05.1882291Z fpu_exception : yes 2025-05-07T19:43:05.1882522Z cpuid level : 13 2025-05-07T19:43:05.1882761Z wp : yes 2025-05-07T19:43:05.1885038Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1887698Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1888525Z bogomips : 6000.01 2025-05-07T19:43:05.1888856Z clflush size : 64 2025-05-07T19:43:05.1889091Z cache_alignment : 64 2025-05-07T19:43:05.1889366Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1889714Z power management: 2025-05-07T19:43:05.1889850Z 2025-05-07T19:43:05.1889941Z processor : 31 2025-05-07T19:43:05.1890194Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1890465Z cpu family : 6 2025-05-07T19:43:05.1890670Z model : 85 2025-05-07T19:43:05.1890952Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1891295Z stepping : 7 2025-05-07T19:43:05.1891523Z microcode : 0x5003901 2025-05-07T19:43:05.1891748Z cpu MHz : 3000.006 2025-05-07T19:43:05.1891982Z cache size : 36608 KB 2025-05-07T19:43:05.1892204Z physical id : 1 2025-05-07T19:43:05.1892437Z siblings : 48 2025-05-07T19:43:05.1892637Z core id : 7 2025-05-07T19:43:05.1892858Z cpu cores : 24 2025-05-07T19:43:05.1893063Z apicid : 78 2025-05-07T19:43:05.1893291Z initial apicid : 78 2025-05-07T19:43:05.1893529Z fpu : yes 2025-05-07T19:43:05.1893730Z fpu_exception : yes 2025-05-07T19:43:05.1893970Z cpuid level : 13 2025-05-07T19:43:05.1894177Z wp : yes 2025-05-07T19:43:05.1896369Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1898827Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1899388Z bogomips : 6000.01 2025-05-07T19:43:05.1899630Z clflush size : 64 2025-05-07T19:43:05.1899842Z cache_alignment : 64 2025-05-07T19:43:05.1900192Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1900506Z power management: 2025-05-07T19:43:05.1900840Z 2025-05-07T19:43:05.1900933Z processor : 32 2025-05-07T19:43:05.1901188Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1901500Z cpu family : 6 2025-05-07T19:43:05.1901744Z model : 85 2025-05-07T19:43:05.1902033Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1902425Z stepping : 7 2025-05-07T19:43:05.1902650Z microcode : 0x5003901 2025-05-07T19:43:05.1902912Z cpu MHz : 3000.006 2025-05-07T19:43:05.1903146Z cache size : 36608 KB 2025-05-07T19:43:05.1903408Z physical id : 1 2025-05-07T19:43:05.1903631Z siblings : 48 2025-05-07T19:43:05.1903880Z core id : 8 2025-05-07T19:43:05.1904100Z cpu cores : 24 2025-05-07T19:43:05.1904343Z apicid : 80 2025-05-07T19:43:05.1904565Z initial apicid : 80 2025-05-07T19:43:05.1904818Z fpu : yes 2025-05-07T19:43:05.1905053Z fpu_exception : yes 2025-05-07T19:43:05.1905282Z cpuid level : 13 2025-05-07T19:43:05.1905521Z wp : yes 2025-05-07T19:43:05.1907788Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1910465Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1911090Z bogomips : 6000.01 2025-05-07T19:43:05.1913011Z clflush size : 64 2025-05-07T19:43:05.1913211Z cache_alignment : 64 2025-05-07T19:43:05.1913452Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1913758Z power management: 2025-05-07T19:43:05.1913877Z 2025-05-07T19:43:05.1913951Z processor : 33 2025-05-07T19:43:05.1914151Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1914369Z cpu family : 6 2025-05-07T19:43:05.1914550Z model : 85 2025-05-07T19:43:05.1914805Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1915123Z stepping : 7 2025-05-07T19:43:05.1915320Z microcode : 0x5003901 2025-05-07T19:43:05.1915520Z cpu MHz : 3000.006 2025-05-07T19:43:05.1915719Z cache size : 36608 KB 2025-05-07T19:43:05.1915917Z physical id : 1 2025-05-07T19:43:05.1916109Z siblings : 48 2025-05-07T19:43:05.1916289Z core id : 9 2025-05-07T19:43:05.1916484Z cpu cores : 24 2025-05-07T19:43:05.1916666Z apicid : 82 2025-05-07T19:43:05.1916849Z initial apicid : 82 2025-05-07T19:43:05.1917049Z fpu : yes 2025-05-07T19:43:05.1917229Z fpu_exception : yes 2025-05-07T19:43:05.1917431Z cpuid level : 13 2025-05-07T19:43:05.1917611Z wp : yes 2025-05-07T19:43:05.1919693Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1922169Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1922695Z bogomips : 6000.01 2025-05-07T19:43:05.1922894Z clflush size : 64 2025-05-07T19:43:05.1923085Z cache_alignment : 64 2025-05-07T19:43:05.1923331Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1923618Z power management: 2025-05-07T19:43:05.1923745Z 2025-05-07T19:43:05.1923818Z processor : 34 2025-05-07T19:43:05.1924011Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1924216Z cpu family : 6 2025-05-07T19:43:05.1924396Z model : 85 2025-05-07T19:43:05.1924633Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1924950Z stepping : 7 2025-05-07T19:43:05.1925129Z microcode : 0x5003901 2025-05-07T19:43:05.1925336Z cpu MHz : 3000.006 2025-05-07T19:43:05.1925525Z cache size : 36608 KB 2025-05-07T19:43:05.1925729Z physical id : 1 2025-05-07T19:43:05.1925910Z siblings : 48 2025-05-07T19:43:05.1926092Z core id : 10 2025-05-07T19:43:05.1926267Z cpu cores : 24 2025-05-07T19:43:05.1926453Z apicid : 84 2025-05-07T19:43:05.1926633Z initial apicid : 84 2025-05-07T19:43:05.1926826Z fpu : yes 2025-05-07T19:43:05.1927006Z fpu_exception : yes 2025-05-07T19:43:05.1927198Z cpuid level : 13 2025-05-07T19:43:05.1927270Z wp : yes 2025-05-07T19:43:05.1929269Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1929624Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1929713Z bogomips : 6000.01 2025-05-07T19:43:05.1929788Z clflush size : 64 2025-05-07T19:43:05.1929867Z cache_alignment : 64 2025-05-07T19:43:05.1930037Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1930125Z power management: 2025-05-07T19:43:05.1930130Z 2025-05-07T19:43:05.1930202Z processor : 35 2025-05-07T19:43:05.1930281Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1930363Z cpu family : 6 2025-05-07T19:43:05.1930434Z model : 85 2025-05-07T19:43:05.1930579Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1930653Z stepping : 7 2025-05-07T19:43:05.1930743Z microcode : 0x5003901 2025-05-07T19:43:05.1930816Z cpu MHz : 3000.006 2025-05-07T19:43:05.1930891Z cache size : 36608 KB 2025-05-07T19:43:05.1930973Z physical id : 1 2025-05-07T19:43:05.1931044Z siblings : 48 2025-05-07T19:43:05.1931113Z core id : 11 2025-05-07T19:43:05.1931185Z cpu cores : 24 2025-05-07T19:43:05.1931261Z apicid : 86 2025-05-07T19:43:05.1931340Z initial apicid : 86 2025-05-07T19:43:05.1931409Z fpu : yes 2025-05-07T19:43:05.1931495Z fpu_exception : yes 2025-05-07T19:43:05.1931570Z cpuid level : 13 2025-05-07T19:43:05.1931645Z wp : yes 2025-05-07T19:43:05.1933630Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1933993Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1934068Z bogomips : 6000.01 2025-05-07T19:43:05.1934204Z clflush size : 64 2025-05-07T19:43:05.1934279Z cache_alignment : 64 2025-05-07T19:43:05.1934397Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1934477Z power management: 2025-05-07T19:43:05.1934481Z 2025-05-07T19:43:05.1934561Z processor : 36 2025-05-07T19:43:05.1934643Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1934715Z cpu family : 6 2025-05-07T19:43:05.1934794Z model : 85 2025-05-07T19:43:05.1934943Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1935017Z stepping : 7 2025-05-07T19:43:05.1935092Z microcode : 0x5003901 2025-05-07T19:43:05.1935175Z cpu MHz : 3259.388 2025-05-07T19:43:05.1935253Z cache size : 36608 KB 2025-05-07T19:43:05.1935325Z physical id : 1 2025-05-07T19:43:05.1935404Z siblings : 48 2025-05-07T19:43:05.1935478Z core id : 12 2025-05-07T19:43:05.1935552Z cpu cores : 24 2025-05-07T19:43:05.1935621Z apicid : 88 2025-05-07T19:43:05.1935706Z initial apicid : 88 2025-05-07T19:43:05.1935776Z fpu : yes 2025-05-07T19:43:05.1935858Z fpu_exception : yes 2025-05-07T19:43:05.1935931Z cpuid level : 13 2025-05-07T19:43:05.1936009Z wp : yes 2025-05-07T19:43:05.1937989Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1938354Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1938433Z bogomips : 6000.01 2025-05-07T19:43:05.1938507Z clflush size : 64 2025-05-07T19:43:05.1938598Z cache_alignment : 64 2025-05-07T19:43:05.1938714Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1938792Z power management: 2025-05-07T19:43:05.1938844Z 2025-05-07T19:43:05.1938918Z processor : 37 2025-05-07T19:43:05.1939007Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1939079Z cpu family : 6 2025-05-07T19:43:05.1939149Z model : 85 2025-05-07T19:43:05.1939300Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1939372Z stepping : 7 2025-05-07T19:43:05.1939448Z microcode : 0x5003901 2025-05-07T19:43:05.1939522Z cpu MHz : 3000.006 2025-05-07T19:43:05.1939601Z cache size : 36608 KB 2025-05-07T19:43:05.1939673Z physical id : 1 2025-05-07T19:43:05.1939741Z siblings : 48 2025-05-07T19:43:05.1939819Z core id : 13 2025-05-07T19:43:05.1939888Z cpu cores : 24 2025-05-07T19:43:05.1939955Z apicid : 90 2025-05-07T19:43:05.1940028Z initial apicid : 90 2025-05-07T19:43:05.1940185Z fpu : yes 2025-05-07T19:43:05.1940265Z fpu_exception : yes 2025-05-07T19:43:05.1940340Z cpuid level : 13 2025-05-07T19:43:05.1940410Z wp : yes 2025-05-07T19:43:05.1942716Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1943102Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1943193Z bogomips : 6000.01 2025-05-07T19:43:05.1943272Z clflush size : 64 2025-05-07T19:43:05.1943355Z cache_alignment : 64 2025-05-07T19:43:05.1943534Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1943627Z power management: 2025-05-07T19:43:05.1943632Z 2025-05-07T19:43:05.1943713Z processor : 38 2025-05-07T19:43:05.1943798Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1943886Z cpu family : 6 2025-05-07T19:43:05.1943960Z model : 85 2025-05-07T19:43:05.1944113Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1944214Z stepping : 7 2025-05-07T19:43:05.1944297Z microcode : 0x5003901 2025-05-07T19:43:05.1944376Z cpu MHz : 3000.006 2025-05-07T19:43:05.1944463Z cache size : 36608 KB 2025-05-07T19:43:05.1944567Z physical id : 1 2025-05-07T19:43:05.1944646Z siblings : 48 2025-05-07T19:43:05.1944728Z core id : 14 2025-05-07T19:43:05.1944809Z cpu cores : 24 2025-05-07T19:43:05.1944907Z apicid : 92 2025-05-07T19:43:05.1944993Z initial apicid : 92 2025-05-07T19:43:05.1945074Z fpu : yes 2025-05-07T19:43:05.1945177Z fpu_exception : yes 2025-05-07T19:43:05.1945262Z cpuid level : 13 2025-05-07T19:43:05.1945344Z wp : yes 2025-05-07T19:43:05.1947505Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1947899Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1947982Z bogomips : 6000.01 2025-05-07T19:43:05.1948071Z clflush size : 64 2025-05-07T19:43:05.1948154Z cache_alignment : 64 2025-05-07T19:43:05.1948285Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1948368Z power management: 2025-05-07T19:43:05.1948385Z 2025-05-07T19:43:05.1948468Z processor : 39 2025-05-07T19:43:05.1948623Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1948700Z cpu family : 6 2025-05-07T19:43:05.1948779Z model : 85 2025-05-07T19:43:05.1948935Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1949018Z stepping : 7 2025-05-07T19:43:05.1949099Z microcode : 0x5003901 2025-05-07T19:43:05.1949189Z cpu MHz : 3000.006 2025-05-07T19:43:05.1949269Z cache size : 36608 KB 2025-05-07T19:43:05.1949349Z physical id : 1 2025-05-07T19:43:05.1949434Z siblings : 48 2025-05-07T19:43:05.1949509Z core id : 15 2025-05-07T19:43:05.1949583Z cpu cores : 24 2025-05-07T19:43:05.1949668Z apicid : 94 2025-05-07T19:43:05.1949757Z initial apicid : 94 2025-05-07T19:43:05.1949835Z fpu : yes 2025-05-07T19:43:05.1949919Z fpu_exception : yes 2025-05-07T19:43:05.1950007Z cpuid level : 13 2025-05-07T19:43:05.1950079Z wp : yes 2025-05-07T19:43:05.1952448Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1952966Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1953052Z bogomips : 6000.01 2025-05-07T19:43:05.1953128Z clflush size : 64 2025-05-07T19:43:05.1953220Z cache_alignment : 64 2025-05-07T19:43:05.1953339Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1953481Z power management: 2025-05-07T19:43:05.1953485Z 2025-05-07T19:43:05.1953558Z processor : 40 2025-05-07T19:43:05.1953646Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1953721Z cpu family : 6 2025-05-07T19:43:05.1953798Z model : 85 2025-05-07T19:43:05.1953953Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1954026Z stepping : 7 2025-05-07T19:43:05.1954103Z microcode : 0x5003901 2025-05-07T19:43:05.1954178Z cpu MHz : 3000.006 2025-05-07T19:43:05.1954259Z cache size : 36608 KB 2025-05-07T19:43:05.1954331Z physical id : 1 2025-05-07T19:43:05.1954404Z siblings : 48 2025-05-07T19:43:05.1954480Z core id : 16 2025-05-07T19:43:05.1954551Z cpu cores : 24 2025-05-07T19:43:05.1954624Z apicid : 96 2025-05-07T19:43:05.1954703Z initial apicid : 96 2025-05-07T19:43:05.1954785Z fpu : yes 2025-05-07T19:43:05.1954860Z fpu_exception : yes 2025-05-07T19:43:05.1954933Z cpuid level : 13 2025-05-07T19:43:05.1955013Z wp : yes 2025-05-07T19:43:05.1956999Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1957355Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1957441Z bogomips : 6000.01 2025-05-07T19:43:05.1957515Z clflush size : 64 2025-05-07T19:43:05.1957590Z cache_alignment : 64 2025-05-07T19:43:05.1957717Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1957795Z power management: 2025-05-07T19:43:05.1957800Z 2025-05-07T19:43:05.1957876Z processor : 41 2025-05-07T19:43:05.1957959Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1958046Z cpu family : 6 2025-05-07T19:43:05.1958118Z model : 85 2025-05-07T19:43:05.1958316Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1958405Z stepping : 7 2025-05-07T19:43:05.1958487Z microcode : 0x5003901 2025-05-07T19:43:05.1958558Z cpu MHz : 3000.006 2025-05-07T19:43:05.1958635Z cache size : 36608 KB 2025-05-07T19:43:05.1958731Z physical id : 1 2025-05-07T19:43:05.1958805Z siblings : 48 2025-05-07T19:43:05.1958874Z core id : 17 2025-05-07T19:43:05.1958965Z cpu cores : 24 2025-05-07T19:43:05.1959044Z apicid : 98 2025-05-07T19:43:05.1959129Z initial apicid : 98 2025-05-07T19:43:05.1959197Z fpu : yes 2025-05-07T19:43:05.1959285Z fpu_exception : yes 2025-05-07T19:43:05.1959359Z cpuid level : 13 2025-05-07T19:43:05.1959434Z wp : yes 2025-05-07T19:43:05.1961421Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1961781Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1961856Z bogomips : 6000.01 2025-05-07T19:43:05.1961951Z clflush size : 64 2025-05-07T19:43:05.1962036Z cache_alignment : 64 2025-05-07T19:43:05.1962155Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1962246Z power management: 2025-05-07T19:43:05.1962251Z 2025-05-07T19:43:05.1962328Z processor : 42 2025-05-07T19:43:05.1962462Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1962535Z cpu family : 6 2025-05-07T19:43:05.1962622Z model : 85 2025-05-07T19:43:05.1962774Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1962853Z stepping : 7 2025-05-07T19:43:05.1962944Z microcode : 0x5003901 2025-05-07T19:43:05.1963024Z cpu MHz : 3211.431 2025-05-07T19:43:05.1963100Z cache size : 36608 KB 2025-05-07T19:43:05.1963174Z physical id : 1 2025-05-07T19:43:05.1963263Z siblings : 48 2025-05-07T19:43:05.1963341Z core id : 18 2025-05-07T19:43:05.1963420Z cpu cores : 24 2025-05-07T19:43:05.1963507Z apicid : 100 2025-05-07T19:43:05.1963594Z initial apicid : 100 2025-05-07T19:43:05.1963672Z fpu : yes 2025-05-07T19:43:05.1963759Z fpu_exception : yes 2025-05-07T19:43:05.1963854Z cpuid level : 13 2025-05-07T19:43:05.1963926Z wp : yes 2025-05-07T19:43:05.1965908Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1966285Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1966367Z bogomips : 6000.01 2025-05-07T19:43:05.1966445Z clflush size : 64 2025-05-07T19:43:05.1966538Z cache_alignment : 64 2025-05-07T19:43:05.1966660Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1966742Z power management: 2025-05-07T19:43:05.1966746Z 2025-05-07T19:43:05.1966833Z processor : 43 2025-05-07T19:43:05.1966914Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1966994Z cpu family : 6 2025-05-07T19:43:05.1967070Z model : 85 2025-05-07T19:43:05.1967230Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1967357Z stepping : 7 2025-05-07T19:43:05.1967432Z microcode : 0x5003901 2025-05-07T19:43:05.1967520Z cpu MHz : 3000.006 2025-05-07T19:43:05.1967598Z cache size : 36608 KB 2025-05-07T19:43:05.1967674Z physical id : 1 2025-05-07T19:43:05.1967744Z siblings : 48 2025-05-07T19:43:05.1967827Z core id : 19 2025-05-07T19:43:05.1967902Z cpu cores : 24 2025-05-07T19:43:05.1967974Z apicid : 102 2025-05-07T19:43:05.1968051Z initial apicid : 102 2025-05-07T19:43:05.1968134Z fpu : yes 2025-05-07T19:43:05.1968538Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:05.1968617Z fpu_exception : yes 2025-05-07T19:43:05.1968705Z cpuid level : 13 2025-05-07T19:43:05.1968776Z wp : yes 2025-05-07T19:43:05.1970766Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1971143Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1971221Z bogomips : 6000.01 2025-05-07T19:43:05.1971299Z clflush size : 64 2025-05-07T19:43:05.1971393Z cache_alignment : 64 2025-05-07T19:43:05.1971515Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1971592Z power management: 2025-05-07T19:43:05.1971596Z 2025-05-07T19:43:05.1971683Z processor : 44 2025-05-07T19:43:05.1971814Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1971888Z cpu family : 6 2025-05-07T19:43:05.1971963Z model : 85 2025-05-07T19:43:05.1972129Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1972216Z stepping : 7 2025-05-07T19:43:05.1972292Z microcode : 0x5003901 2025-05-07T19:43:05.1972388Z cpu MHz : 3000.006 2025-05-07T19:43:05.1972470Z cache size : 36608 KB 2025-05-07T19:43:05.1972554Z physical id : 1 2025-05-07T19:43:05.1972627Z siblings : 48 2025-05-07T19:43:05.1972720Z core id : 20 2025-05-07T19:43:05.1972799Z cpu cores : 24 2025-05-07T19:43:05.1972871Z apicid : 104 2025-05-07T19:43:05.1972970Z initial apicid : 104 2025-05-07T19:43:05.1973042Z fpu : yes 2025-05-07T19:43:05.1973125Z fpu_exception : yes 2025-05-07T19:43:05.1973203Z cpuid level : 13 2025-05-07T19:43:05.1973291Z wp : yes 2025-05-07T19:43:05.1975276Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1975654Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1975730Z bogomips : 6000.01 2025-05-07T19:43:05.1975806Z clflush size : 64 2025-05-07T19:43:05.1976009Z cache_alignment : 64 2025-05-07T19:43:05.1976141Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1976392Z power management: 2025-05-07T19:43:05.1976397Z 2025-05-07T19:43:05.1976484Z processor : 45 2025-05-07T19:43:05.1976603Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1976693Z cpu family : 6 2025-05-07T19:43:05.1976767Z model : 85 2025-05-07T19:43:05.1977048Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1977245Z stepping : 7 2025-05-07T19:43:05.1977326Z microcode : 0x5003901 2025-05-07T19:43:05.1977421Z cpu MHz : 3000.006 2025-05-07T19:43:05.1977520Z cache size : 36608 KB 2025-05-07T19:43:05.1977608Z physical id : 1 2025-05-07T19:43:05.1977685Z siblings : 48 2025-05-07T19:43:05.1977761Z core id : 21 2025-05-07T19:43:05.1977856Z cpu cores : 24 2025-05-07T19:43:05.1977940Z apicid : 106 2025-05-07T19:43:05.1978024Z initial apicid : 106 2025-05-07T19:43:05.1978111Z fpu : yes 2025-05-07T19:43:05.1978200Z fpu_exception : yes 2025-05-07T19:43:05.1978277Z cpuid level : 13 2025-05-07T19:43:05.1978358Z wp : yes 2025-05-07T19:43:05.1980572Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1980966Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1981066Z bogomips : 6000.01 2025-05-07T19:43:05.1981151Z clflush size : 64 2025-05-07T19:43:05.1981243Z cache_alignment : 64 2025-05-07T19:43:05.1981370Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1981476Z power management: 2025-05-07T19:43:05.1981481Z 2025-05-07T19:43:05.1981566Z processor : 46 2025-05-07T19:43:05.1981658Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1981758Z cpu family : 6 2025-05-07T19:43:05.1981834Z model : 85 2025-05-07T19:43:05.1982060Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1982144Z stepping : 7 2025-05-07T19:43:05.1982238Z microcode : 0x5003901 2025-05-07T19:43:05.1982322Z cpu MHz : 3000.006 2025-05-07T19:43:05.1982402Z cache size : 36608 KB 2025-05-07T19:43:05.1982493Z physical id : 1 2025-05-07T19:43:05.1982569Z siblings : 48 2025-05-07T19:43:05.1982652Z core id : 22 2025-05-07T19:43:05.1982728Z cpu cores : 24 2025-05-07T19:43:05.1982811Z apicid : 108 2025-05-07T19:43:05.1982895Z initial apicid : 108 2025-05-07T19:43:05.1982968Z fpu : yes 2025-05-07T19:43:05.1983051Z fpu_exception : yes 2025-05-07T19:43:05.1983138Z cpuid level : 13 2025-05-07T19:43:05.1983211Z wp : yes 2025-05-07T19:43:05.1985352Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1985748Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1985828Z bogomips : 6000.01 2025-05-07T19:43:05.1985921Z clflush size : 64 2025-05-07T19:43:05.1986000Z cache_alignment : 64 2025-05-07T19:43:05.1986124Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1986210Z power management: 2025-05-07T19:43:05.1986215Z 2025-05-07T19:43:05.1986302Z processor : 47 2025-05-07T19:43:05.1986389Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1986468Z cpu family : 6 2025-05-07T19:43:05.1986552Z model : 85 2025-05-07T19:43:05.1986711Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1986791Z stepping : 7 2025-05-07T19:43:05.1986873Z microcode : 0x5003901 2025-05-07T19:43:05.1986964Z cpu MHz : 3000.006 2025-05-07T19:43:05.1987099Z cache size : 36608 KB 2025-05-07T19:43:05.1987180Z physical id : 1 2025-05-07T19:43:05.1987265Z siblings : 48 2025-05-07T19:43:05.1987340Z core id : 23 2025-05-07T19:43:05.1987418Z cpu cores : 24 2025-05-07T19:43:05.1987495Z apicid : 110 2025-05-07T19:43:05.1987586Z initial apicid : 110 2025-05-07T19:43:05.1987658Z fpu : yes 2025-05-07T19:43:05.1987741Z fpu_exception : yes 2025-05-07T19:43:05.1987828Z cpuid level : 13 2025-05-07T19:43:05.1987910Z wp : yes 2025-05-07T19:43:05.1990059Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1990446Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1990524Z bogomips : 6000.01 2025-05-07T19:43:05.1990603Z clflush size : 64 2025-05-07T19:43:05.1990685Z cache_alignment : 64 2025-05-07T19:43:05.1990814Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1990897Z power management: 2025-05-07T19:43:05.1990902Z 2025-05-07T19:43:05.1990978Z processor : 48 2025-05-07T19:43:05.1991072Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1991149Z cpu family : 6 2025-05-07T19:43:05.1991224Z model : 85 2025-05-07T19:43:05.1991388Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1991524Z stepping : 7 2025-05-07T19:43:05.1991613Z microcode : 0x5003901 2025-05-07T19:43:05.1991690Z cpu MHz : 3000.006 2025-05-07T19:43:05.1991783Z cache size : 36608 KB 2025-05-07T19:43:05.1991869Z physical id : 0 2025-05-07T19:43:05.1991949Z siblings : 48 2025-05-07T19:43:05.1992022Z core id : 0 2025-05-07T19:43:05.1992100Z cpu cores : 24 2025-05-07T19:43:05.1992175Z apicid : 1 2025-05-07T19:43:05.1992254Z initial apicid : 1 2025-05-07T19:43:05.1992445Z fpu : yes 2025-05-07T19:43:05.1992523Z fpu_exception : yes 2025-05-07T19:43:05.1992596Z cpuid level : 13 2025-05-07T19:43:05.1992666Z wp : yes 2025-05-07T19:43:05.1994651Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1995009Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1995097Z bogomips : 6000.01 2025-05-07T19:43:05.1995171Z clflush size : 64 2025-05-07T19:43:05.1995248Z cache_alignment : 64 2025-05-07T19:43:05.1995362Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1995446Z power management: 2025-05-07T19:43:05.1995450Z 2025-05-07T19:43:05.1995525Z processor : 49 2025-05-07T19:43:05.1995604Z vendor_id : GenuineIntel 2025-05-07T19:43:05.1995679Z cpu family : 6 2025-05-07T19:43:05.1995751Z model : 85 2025-05-07T19:43:05.1995898Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.1995974Z stepping : 7 2025-05-07T19:43:05.1996057Z microcode : 0x5003901 2025-05-07T19:43:05.1996128Z cpu MHz : 3000.006 2025-05-07T19:43:05.1996201Z cache size : 36608 KB 2025-05-07T19:43:05.1996278Z physical id : 0 2025-05-07T19:43:05.1996395Z siblings : 48 2025-05-07T19:43:05.1996462Z core id : 1 2025-05-07T19:43:05.1996531Z cpu cores : 24 2025-05-07T19:43:05.1996605Z apicid : 3 2025-05-07T19:43:05.1996683Z initial apicid : 3 2025-05-07T19:43:05.1996753Z fpu : yes 2025-05-07T19:43:05.1996835Z fpu_exception : yes 2025-05-07T19:43:05.1996908Z cpuid level : 13 2025-05-07T19:43:05.1996979Z wp : yes 2025-05-07T19:43:05.1998971Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.1999327Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.1999403Z bogomips : 6000.01 2025-05-07T19:43:05.1999495Z clflush size : 64 2025-05-07T19:43:05.1999573Z cache_alignment : 64 2025-05-07T19:43:05.1999692Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.1999771Z power management: 2025-05-07T19:43:05.1999775Z 2025-05-07T19:43:05.1999859Z processor : 50 2025-05-07T19:43:05.1999943Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2000017Z cpu family : 6 2025-05-07T19:43:05.2000099Z model : 85 2025-05-07T19:43:05.2000241Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2000312Z stepping : 7 2025-05-07T19:43:05.2000389Z microcode : 0x5003901 2025-05-07T19:43:05.2000521Z cpu MHz : 3000.006 2025-05-07T19:43:05.2000596Z cache size : 36608 KB 2025-05-07T19:43:05.2000670Z physical id : 0 2025-05-07T19:43:05.2000750Z siblings : 48 2025-05-07T19:43:05.2000826Z core id : 2 2025-05-07T19:43:05.2000898Z cpu cores : 24 2025-05-07T19:43:05.2000967Z apicid : 5 2025-05-07T19:43:05.2001054Z initial apicid : 5 2025-05-07T19:43:05.2001123Z fpu : yes 2025-05-07T19:43:05.2001198Z fpu_exception : yes 2025-05-07T19:43:05.2001282Z cpuid level : 13 2025-05-07T19:43:05.2001351Z wp : yes 2025-05-07T19:43:05.2003589Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2003974Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2004062Z bogomips : 6000.01 2025-05-07T19:43:05.2004140Z clflush size : 64 2025-05-07T19:43:05.2004230Z cache_alignment : 64 2025-05-07T19:43:05.2004352Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2004447Z power management: 2025-05-07T19:43:05.2004452Z 2025-05-07T19:43:05.2004530Z processor : 51 2025-05-07T19:43:05.2004627Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2004705Z cpu family : 6 2025-05-07T19:43:05.2004780Z model : 85 2025-05-07T19:43:05.2004944Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2005021Z stepping : 7 2025-05-07T19:43:05.2005103Z microcode : 0x5003901 2025-05-07T19:43:05.2005178Z cpu MHz : 3000.006 2025-05-07T19:43:05.2005272Z cache size : 36608 KB 2025-05-07T19:43:05.2005351Z physical id : 0 2025-05-07T19:43:05.2005426Z siblings : 48 2025-05-07T19:43:05.2005508Z core id : 3 2025-05-07T19:43:05.2005636Z cpu cores : 24 2025-05-07T19:43:05.2005709Z apicid : 7 2025-05-07T19:43:05.2005789Z initial apicid : 7 2025-05-07T19:43:05.2005874Z fpu : yes 2025-05-07T19:43:05.2005952Z fpu_exception : yes 2025-05-07T19:43:05.2006029Z cpuid level : 13 2025-05-07T19:43:05.2006111Z wp : yes 2025-05-07T19:43:05.2008349Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2008794Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2008894Z bogomips : 6000.01 2025-05-07T19:43:05.2008971Z clflush size : 64 2025-05-07T19:43:05.2009055Z cache_alignment : 64 2025-05-07T19:43:05.2009192Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2009268Z power management: 2025-05-07T19:43:05.2009272Z 2025-05-07T19:43:05.2009354Z processor : 52 2025-05-07T19:43:05.2009446Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2009535Z cpu family : 6 2025-05-07T19:43:05.2009609Z model : 85 2025-05-07T19:43:05.2009760Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2009845Z stepping : 7 2025-05-07T19:43:05.2009926Z microcode : 0x5003901 2025-05-07T19:43:05.2010001Z cpu MHz : 3000.006 2025-05-07T19:43:05.2010079Z cache size : 36608 KB 2025-05-07T19:43:05.2010221Z physical id : 0 2025-05-07T19:43:05.2010296Z siblings : 48 2025-05-07T19:43:05.2010372Z core id : 4 2025-05-07T19:43:05.2010572Z cpu cores : 24 2025-05-07T19:43:05.2010781Z apicid : 9 2025-05-07T19:43:05.2010887Z initial apicid : 9 2025-05-07T19:43:05.2010966Z fpu : yes 2025-05-07T19:43:05.2011055Z fpu_exception : yes 2025-05-07T19:43:05.2011134Z cpuid level : 13 2025-05-07T19:43:05.2011374Z wp : yes 2025-05-07T19:43:05.2013637Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2014015Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2014098Z bogomips : 6000.01 2025-05-07T19:43:05.2014187Z clflush size : 64 2025-05-07T19:43:05.2014267Z cache_alignment : 64 2025-05-07T19:43:05.2014390Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2014478Z power management: 2025-05-07T19:43:05.2014482Z 2025-05-07T19:43:05.2014562Z processor : 53 2025-05-07T19:43:05.2014646Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2014723Z cpu family : 6 2025-05-07T19:43:05.2014804Z model : 85 2025-05-07T19:43:05.2014959Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2015034Z stepping : 7 2025-05-07T19:43:05.2015123Z microcode : 0x5003901 2025-05-07T19:43:05.2015195Z cpu MHz : 3000.006 2025-05-07T19:43:05.2015271Z cache size : 36608 KB 2025-05-07T19:43:05.2015350Z physical id : 0 2025-05-07T19:43:05.2015436Z siblings : 48 2025-05-07T19:43:05.2015511Z core id : 5 2025-05-07T19:43:05.2015587Z cpu cores : 24 2025-05-07T19:43:05.2015668Z apicid : 11 2025-05-07T19:43:05.2015747Z initial apicid : 11 2025-05-07T19:43:05.2015891Z fpu : yes 2025-05-07T19:43:05.2015971Z fpu_exception : yes 2025-05-07T19:43:05.2016060Z cpuid level : 13 2025-05-07T19:43:05.2016132Z wp : yes 2025-05-07T19:43:05.2018219Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2018585Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2018663Z bogomips : 6000.01 2025-05-07T19:43:05.2018743Z clflush size : 64 2025-05-07T19:43:05.2018833Z cache_alignment : 64 2025-05-07T19:43:05.2018954Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2019030Z power management: 2025-05-07T19:43:05.2019034Z 2025-05-07T19:43:05.2019120Z processor : 54 2025-05-07T19:43:05.2019204Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2019280Z cpu family : 6 2025-05-07T19:43:05.2019348Z model : 85 2025-05-07T19:43:05.2019505Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2019578Z stepping : 7 2025-05-07T19:43:05.2019655Z microcode : 0x5003901 2025-05-07T19:43:05.2019736Z cpu MHz : 3000.006 2025-05-07T19:43:05.2019815Z cache size : 36608 KB 2025-05-07T19:43:05.2019891Z physical id : 0 2025-05-07T19:43:05.2019963Z siblings : 48 2025-05-07T19:43:05.2020042Z core id : 6 2025-05-07T19:43:05.2020230Z cpu cores : 24 2025-05-07T19:43:05.2020306Z apicid : 13 2025-05-07T19:43:05.2020383Z initial apicid : 13 2025-05-07T19:43:05.2020464Z fpu : yes 2025-05-07T19:43:05.2020718Z fpu_exception : yes 2025-05-07T19:43:05.2020798Z cpuid level : 13 2025-05-07T19:43:05.2020880Z wp : yes 2025-05-07T19:43:05.2023020Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2023409Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2023499Z bogomips : 6000.01 2025-05-07T19:43:05.2023577Z clflush size : 64 2025-05-07T19:43:05.2023664Z cache_alignment : 64 2025-05-07T19:43:05.2023798Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2023882Z power management: 2025-05-07T19:43:05.2023886Z 2025-05-07T19:43:05.2023965Z processor : 55 2025-05-07T19:43:05.2024064Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2024142Z cpu family : 6 2025-05-07T19:43:05.2024218Z model : 85 2025-05-07T19:43:05.2024377Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2024467Z stepping : 7 2025-05-07T19:43:05.2024551Z microcode : 0x5003901 2025-05-07T19:43:05.2024631Z cpu MHz : 3000.006 2025-05-07T19:43:05.2024714Z cache size : 36608 KB 2025-05-07T19:43:05.2024804Z physical id : 0 2025-05-07T19:43:05.2024881Z siblings : 48 2025-05-07T19:43:05.2024958Z core id : 7 2025-05-07T19:43:05.2025041Z cpu cores : 24 2025-05-07T19:43:05.2025117Z apicid : 15 2025-05-07T19:43:05.2025200Z initial apicid : 15 2025-05-07T19:43:05.2025278Z fpu : yes 2025-05-07T19:43:05.2025365Z fpu_exception : yes 2025-05-07T19:43:05.2025511Z cpuid level : 13 2025-05-07T19:43:05.2025587Z wp : yes 2025-05-07T19:43:05.2027741Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2028128Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2028209Z bogomips : 6000.01 2025-05-07T19:43:05.2028298Z clflush size : 64 2025-05-07T19:43:05.2028378Z cache_alignment : 64 2025-05-07T19:43:05.2028510Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2028603Z power management: 2025-05-07T19:43:05.2028608Z 2025-05-07T19:43:05.2028682Z processor : 56 2025-05-07T19:43:05.2028764Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2028837Z cpu family : 6 2025-05-07T19:43:05.2028923Z model : 85 2025-05-07T19:43:05.2029079Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2029154Z stepping : 7 2025-05-07T19:43:05.2029250Z microcode : 0x5003901 2025-05-07T19:43:05.2029328Z cpu MHz : 3000.006 2025-05-07T19:43:05.2029405Z cache size : 36608 KB 2025-05-07T19:43:05.2029485Z physical id : 0 2025-05-07T19:43:05.2029568Z siblings : 48 2025-05-07T19:43:05.2029645Z core id : 8 2025-05-07T19:43:05.2029720Z cpu cores : 24 2025-05-07T19:43:05.2029802Z apicid : 17 2025-05-07T19:43:05.2029931Z initial apicid : 17 2025-05-07T19:43:05.2030007Z fpu : yes 2025-05-07T19:43:05.2030087Z fpu_exception : yes 2025-05-07T19:43:05.2030175Z cpuid level : 13 2025-05-07T19:43:05.2030258Z wp : yes 2025-05-07T19:43:05.2032395Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2032894Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2032970Z bogomips : 6000.01 2025-05-07T19:43:05.2033047Z clflush size : 64 2025-05-07T19:43:05.2033129Z cache_alignment : 64 2025-05-07T19:43:05.2033246Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2033327Z power management: 2025-05-07T19:43:05.2033332Z 2025-05-07T19:43:05.2033410Z processor : 57 2025-05-07T19:43:05.2033490Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2033563Z cpu family : 6 2025-05-07T19:43:05.2033634Z model : 85 2025-05-07T19:43:05.2033787Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2033859Z stepping : 7 2025-05-07T19:43:05.2033933Z microcode : 0x5003901 2025-05-07T19:43:05.2034019Z cpu MHz : 3000.006 2025-05-07T19:43:05.2034098Z cache size : 36608 KB 2025-05-07T19:43:05.2034173Z physical id : 0 2025-05-07T19:43:05.2034243Z siblings : 48 2025-05-07T19:43:05.2034317Z core id : 9 2025-05-07T19:43:05.2034390Z cpu cores : 24 2025-05-07T19:43:05.2034458Z apicid : 19 2025-05-07T19:43:05.2034541Z initial apicid : 19 2025-05-07T19:43:05.2034610Z fpu : yes 2025-05-07T19:43:05.2034690Z fpu_exception : yes 2025-05-07T19:43:05.2034762Z cpuid level : 13 2025-05-07T19:43:05.2034843Z wp : yes 2025-05-07T19:43:05.2036821Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2037227Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2037302Z bogomips : 6000.01 2025-05-07T19:43:05.2037377Z clflush size : 64 2025-05-07T19:43:05.2037459Z cache_alignment : 64 2025-05-07T19:43:05.2037581Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2037655Z power management: 2025-05-07T19:43:05.2037663Z 2025-05-07T19:43:05.2037737Z processor : 58 2025-05-07T19:43:05.2037828Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2037898Z cpu family : 6 2025-05-07T19:43:05.2037969Z model : 85 2025-05-07T19:43:05.2038116Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2038199Z stepping : 7 2025-05-07T19:43:05.2038273Z microcode : 0x5003901 2025-05-07T19:43:05.2038349Z cpu MHz : 1200.168 2025-05-07T19:43:05.2038437Z cache size : 36608 KB 2025-05-07T19:43:05.2038511Z physical id : 0 2025-05-07T19:43:05.2038581Z siblings : 48 2025-05-07T19:43:05.2038653Z core id : 10 2025-05-07T19:43:05.2038735Z cpu cores : 24 2025-05-07T19:43:05.2038803Z apicid : 21 2025-05-07T19:43:05.2038874Z initial apicid : 21 2025-05-07T19:43:05.2038954Z fpu : yes 2025-05-07T19:43:05.2039034Z fpu_exception : yes 2025-05-07T19:43:05.2039161Z cpuid level : 13 2025-05-07T19:43:05.2039231Z wp : yes 2025-05-07T19:43:05.2041213Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2041569Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2041653Z bogomips : 6000.01 2025-05-07T19:43:05.2041730Z clflush size : 64 2025-05-07T19:43:05.2041808Z cache_alignment : 64 2025-05-07T19:43:05.2041930Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2042016Z power management: 2025-05-07T19:43:05.2042020Z 2025-05-07T19:43:05.2042095Z processor : 59 2025-05-07T19:43:05.2042175Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2042257Z cpu family : 6 2025-05-07T19:43:05.2042328Z model : 85 2025-05-07T19:43:05.2042472Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2042542Z stepping : 7 2025-05-07T19:43:05.2042629Z microcode : 0x5003901 2025-05-07T19:43:05.2042701Z cpu MHz : 3000.006 2025-05-07T19:43:05.2042774Z cache size : 36608 KB 2025-05-07T19:43:05.2042861Z physical id : 0 2025-05-07T19:43:05.2042932Z siblings : 48 2025-05-07T19:43:05.2043001Z core id : 11 2025-05-07T19:43:05.2043070Z cpu cores : 24 2025-05-07T19:43:05.2043155Z apicid : 23 2025-05-07T19:43:05.2043231Z initial apicid : 23 2025-05-07T19:43:05.2043299Z fpu : yes 2025-05-07T19:43:05.2043376Z fpu_exception : yes 2025-05-07T19:43:05.2043462Z cpuid level : 13 2025-05-07T19:43:05.2043533Z wp : yes 2025-05-07T19:43:05.2045517Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2045926Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2046004Z bogomips : 6000.01 2025-05-07T19:43:05.2046085Z clflush size : 64 2025-05-07T19:43:05.2046160Z cache_alignment : 64 2025-05-07T19:43:05.2046282Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2046357Z power management: 2025-05-07T19:43:05.2046362Z 2025-05-07T19:43:05.2046443Z processor : 60 2025-05-07T19:43:05.2046527Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2046601Z cpu family : 6 2025-05-07T19:43:05.2046681Z model : 85 2025-05-07T19:43:05.2046828Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2046900Z stepping : 7 2025-05-07T19:43:05.2046977Z microcode : 0x5003901 2025-05-07T19:43:05.2047063Z cpu MHz : 3000.006 2025-05-07T19:43:05.2047138Z cache size : 36608 KB 2025-05-07T19:43:05.2047211Z physical id : 0 2025-05-07T19:43:05.2047293Z siblings : 48 2025-05-07T19:43:05.2047363Z core id : 12 2025-05-07T19:43:05.2047433Z cpu cores : 24 2025-05-07T19:43:05.2047502Z apicid : 25 2025-05-07T19:43:05.2047584Z initial apicid : 25 2025-05-07T19:43:05.2047656Z fpu : yes 2025-05-07T19:43:05.2047731Z fpu_exception : yes 2025-05-07T19:43:05.2047803Z cpuid level : 13 2025-05-07T19:43:05.2047884Z wp : yes 2025-05-07T19:43:05.2049908Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2050275Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2050350Z bogomips : 6000.01 2025-05-07T19:43:05.2050426Z clflush size : 64 2025-05-07T19:43:05.2050502Z cache_alignment : 64 2025-05-07T19:43:05.2050625Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2050704Z power management: 2025-05-07T19:43:05.2050709Z 2025-05-07T19:43:05.2050782Z processor : 61 2025-05-07T19:43:05.2050883Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2050966Z cpu family : 6 2025-05-07T19:43:05.2051044Z model : 85 2025-05-07T19:43:05.2051211Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2051290Z stepping : 7 2025-05-07T19:43:05.2051371Z microcode : 0x5003901 2025-05-07T19:43:05.2051449Z cpu MHz : 3000.006 2025-05-07T19:43:05.2051545Z cache size : 36608 KB 2025-05-07T19:43:05.2051627Z physical id : 0 2025-05-07T19:43:05.2051704Z siblings : 48 2025-05-07T19:43:05.2051781Z core id : 13 2025-05-07T19:43:05.2051874Z cpu cores : 24 2025-05-07T19:43:05.2051950Z apicid : 27 2025-05-07T19:43:05.2052031Z initial apicid : 27 2025-05-07T19:43:05.2052121Z fpu : yes 2025-05-07T19:43:05.2052203Z fpu_exception : yes 2025-05-07T19:43:05.2052282Z cpuid level : 13 2025-05-07T19:43:05.2052358Z wp : yes 2025-05-07T19:43:05.2054362Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2054772Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2054868Z bogomips : 6000.01 2025-05-07T19:43:05.2054950Z clflush size : 64 2025-05-07T19:43:05.2055032Z cache_alignment : 64 2025-05-07T19:43:05.2055156Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2055254Z power management: 2025-05-07T19:43:05.2055258Z 2025-05-07T19:43:05.2055342Z processor : 62 2025-05-07T19:43:05.2055428Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2055524Z cpu family : 6 2025-05-07T19:43:05.2055598Z model : 85 2025-05-07T19:43:05.2055753Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2055848Z stepping : 7 2025-05-07T19:43:05.2055931Z microcode : 0x5003901 2025-05-07T19:43:05.2056009Z cpu MHz : 3000.006 2025-05-07T19:43:05.2056090Z cache size : 36608 KB 2025-05-07T19:43:05.2056183Z physical id : 0 2025-05-07T19:43:05.2056262Z siblings : 48 2025-05-07T19:43:05.2056336Z core id : 14 2025-05-07T19:43:05.2056415Z cpu cores : 24 2025-05-07T19:43:05.2056505Z apicid : 29 2025-05-07T19:43:05.2056587Z initial apicid : 29 2025-05-07T19:43:05.2056663Z fpu : yes 2025-05-07T19:43:05.2056757Z fpu_exception : yes 2025-05-07T19:43:05.2056834Z cpuid level : 13 2025-05-07T19:43:05.2056911Z wp : yes 2025-05-07T19:43:05.2058958Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2059322Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2059402Z bogomips : 6000.01 2025-05-07T19:43:05.2059493Z clflush size : 64 2025-05-07T19:43:05.2059574Z cache_alignment : 64 2025-05-07T19:43:05.2059695Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2059774Z power management: 2025-05-07T19:43:05.2059790Z 2025-05-07T19:43:05.2059866Z processor : 63 2025-05-07T19:43:05.2059954Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2060030Z cpu family : 6 2025-05-07T19:43:05.2060181Z model : 85 2025-05-07T19:43:05.2060332Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2060411Z stepping : 7 2025-05-07T19:43:05.2060657Z microcode : 0x5003901 2025-05-07T19:43:05.2060754Z cpu MHz : 1191.810 2025-05-07T19:43:05.2060839Z cache size : 36608 KB 2025-05-07T19:43:05.2060925Z physical id : 0 2025-05-07T19:43:05.2061022Z siblings : 48 2025-05-07T19:43:05.2061102Z core id : 15 2025-05-07T19:43:05.2061184Z cpu cores : 24 2025-05-07T19:43:05.2061339Z apicid : 31 2025-05-07T19:43:05.2061439Z initial apicid : 31 2025-05-07T19:43:05.2061518Z fpu : yes 2025-05-07T19:43:05.2061604Z fpu_exception : yes 2025-05-07T19:43:05.2061700Z cpuid level : 13 2025-05-07T19:43:05.2061777Z wp : yes 2025-05-07T19:43:05.2063923Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2064395Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2064485Z bogomips : 6000.01 2025-05-07T19:43:05.2064570Z clflush size : 64 2025-05-07T19:43:05.2064671Z cache_alignment : 64 2025-05-07T19:43:05.2064805Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2064891Z power management: 2025-05-07T19:43:05.2064895Z 2025-05-07T19:43:05.2064983Z processor : 64 2025-05-07T19:43:05.2065089Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2065174Z cpu family : 6 2025-05-07T19:43:05.2065255Z model : 85 2025-05-07T19:43:05.2065427Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2065514Z stepping : 7 2025-05-07T19:43:05.2065600Z microcode : 0x5003901 2025-05-07T19:43:05.2065682Z cpu MHz : 3000.006 2025-05-07T19:43:05.2065779Z cache size : 36608 KB 2025-05-07T19:43:05.2065862Z physical id : 0 2025-05-07T19:43:05.2065943Z siblings : 48 2025-05-07T19:43:05.2066032Z core id : 16 2025-05-07T19:43:05.2066114Z cpu cores : 24 2025-05-07T19:43:05.2066194Z apicid : 33 2025-05-07T19:43:05.2066279Z initial apicid : 33 2025-05-07T19:43:05.2066370Z fpu : yes 2025-05-07T19:43:05.2066457Z fpu_exception : yes 2025-05-07T19:43:05.2066539Z cpuid level : 13 2025-05-07T19:43:05.2066628Z wp : yes 2025-05-07T19:43:05.2068820Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2069213Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2069310Z bogomips : 6000.01 2025-05-07T19:43:05.2069395Z clflush size : 64 2025-05-07T19:43:05.2069481Z cache_alignment : 64 2025-05-07T19:43:05.2069624Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2069711Z power management: 2025-05-07T19:43:05.2069716Z 2025-05-07T19:43:05.2069798Z processor : 65 2025-05-07T19:43:05.2069890Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2069984Z cpu family : 6 2025-05-07T19:43:05.2070067Z model : 85 2025-05-07T19:43:05.2070226Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2070322Z stepping : 7 2025-05-07T19:43:05.2070410Z microcode : 0x5003901 2025-05-07T19:43:05.2070492Z cpu MHz : 1199.836 2025-05-07T19:43:05.2070578Z cache size : 36608 KB 2025-05-07T19:43:05.2070673Z physical id : 0 2025-05-07T19:43:05.2070754Z siblings : 48 2025-05-07T19:43:05.2070836Z core id : 17 2025-05-07T19:43:05.2070929Z cpu cores : 24 2025-05-07T19:43:05.2071009Z apicid : 35 2025-05-07T19:43:05.2071096Z initial apicid : 35 2025-05-07T19:43:05.2071175Z fpu : yes 2025-05-07T19:43:05.2071274Z fpu_exception : yes 2025-05-07T19:43:05.2071355Z cpuid level : 13 2025-05-07T19:43:05.2071433Z wp : yes 2025-05-07T19:43:05.2073642Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2074048Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2074127Z bogomips : 6000.01 2025-05-07T19:43:05.2074217Z clflush size : 64 2025-05-07T19:43:05.2074296Z cache_alignment : 64 2025-05-07T19:43:05.2074417Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2074510Z power management: 2025-05-07T19:43:05.2074514Z 2025-05-07T19:43:05.2074592Z processor : 66 2025-05-07T19:43:05.2074678Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2074754Z cpu family : 6 2025-05-07T19:43:05.2074841Z model : 85 2025-05-07T19:43:05.2074993Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2075072Z stepping : 7 2025-05-07T19:43:05.2075165Z microcode : 0x5003901 2025-05-07T19:43:05.2075245Z cpu MHz : 3000.006 2025-05-07T19:43:05.2075325Z cache size : 36608 KB 2025-05-07T19:43:05.2075403Z physical id : 0 2025-05-07T19:43:05.2075493Z siblings : 48 2025-05-07T19:43:05.2075568Z core id : 18 2025-05-07T19:43:05.2075644Z cpu cores : 24 2025-05-07T19:43:05.2075731Z apicid : 37 2025-05-07T19:43:05.2075811Z initial apicid : 37 2025-05-07T19:43:05.2076024Z fpu : yes 2025-05-07T19:43:05.2076105Z fpu_exception : yes 2025-05-07T19:43:05.2076197Z cpuid level : 13 2025-05-07T19:43:05.2076441Z wp : yes 2025-05-07T19:43:05.2078684Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2079090Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2079173Z bogomips : 6000.01 2025-05-07T19:43:05.2079257Z clflush size : 64 2025-05-07T19:43:05.2079356Z cache_alignment : 64 2025-05-07T19:43:05.2079487Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2079574Z power management: 2025-05-07T19:43:05.2079578Z 2025-05-07T19:43:05.2079676Z processor : 67 2025-05-07T19:43:05.2079765Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2079846Z cpu family : 6 2025-05-07T19:43:05.2079925Z model : 85 2025-05-07T19:43:05.2080102Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2080184Z stepping : 7 2025-05-07T19:43:05.2080269Z microcode : 0x5003901 2025-05-07T19:43:05.2080362Z cpu MHz : 3000.006 2025-05-07T19:43:05.2080449Z cache size : 36608 KB 2025-05-07T19:43:05.2080533Z physical id : 0 2025-05-07T19:43:05.2080613Z siblings : 48 2025-05-07T19:43:05.2080705Z core id : 19 2025-05-07T19:43:05.2080788Z cpu cores : 24 2025-05-07T19:43:05.2080869Z apicid : 39 2025-05-07T19:43:05.2080954Z initial apicid : 39 2025-05-07T19:43:05.2081045Z fpu : yes 2025-05-07T19:43:05.2081133Z fpu_exception : yes 2025-05-07T19:43:05.2081216Z cpuid level : 13 2025-05-07T19:43:05.2081310Z wp : yes 2025-05-07T19:43:05.2083458Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2083911Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2084011Z bogomips : 6000.01 2025-05-07T19:43:05.2084094Z clflush size : 64 2025-05-07T19:43:05.2084183Z cache_alignment : 64 2025-05-07T19:43:05.2084326Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2084413Z power management: 2025-05-07T19:43:05.2084417Z 2025-05-07T19:43:05.2084499Z processor : 68 2025-05-07T19:43:05.2084601Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2084682Z cpu family : 6 2025-05-07T19:43:05.2084761Z model : 85 2025-05-07T19:43:05.2084922Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2085015Z stepping : 7 2025-05-07T19:43:05.2085104Z microcode : 0x5003901 2025-05-07T19:43:05.2085187Z cpu MHz : 3000.006 2025-05-07T19:43:05.2085285Z cache size : 36608 KB 2025-05-07T19:43:05.2085373Z physical id : 0 2025-05-07T19:43:05.2085454Z siblings : 48 2025-05-07T19:43:05.2085533Z core id : 20 2025-05-07T19:43:05.2085627Z cpu cores : 24 2025-05-07T19:43:05.2085708Z apicid : 41 2025-05-07T19:43:05.2085793Z initial apicid : 41 2025-05-07T19:43:05.2085871Z fpu : yes 2025-05-07T19:43:05.2085971Z fpu_exception : yes 2025-05-07T19:43:05.2086053Z cpuid level : 13 2025-05-07T19:43:05.2086132Z wp : yes 2025-05-07T19:43:05.2088331Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2088934Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2089015Z bogomips : 6000.01 2025-05-07T19:43:05.2089109Z clflush size : 64 2025-05-07T19:43:05.2089190Z cache_alignment : 64 2025-05-07T19:43:05.2089316Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2089413Z power management: 2025-05-07T19:43:05.2089417Z 2025-05-07T19:43:05.2089496Z processor : 69 2025-05-07T19:43:05.2089583Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2089661Z cpu family : 6 2025-05-07T19:43:05.2089753Z model : 85 2025-05-07T19:43:05.2089903Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2089982Z stepping : 7 2025-05-07T19:43:05.2090079Z microcode : 0x5003901 2025-05-07T19:43:05.2090162Z cpu MHz : 1200.476 2025-05-07T19:43:05.2090242Z cache size : 36608 KB 2025-05-07T19:43:05.2090322Z physical id : 0 2025-05-07T19:43:05.2090418Z siblings : 48 2025-05-07T19:43:05.2090497Z core id : 21 2025-05-07T19:43:05.2090575Z cpu cores : 24 2025-05-07T19:43:05.2090668Z apicid : 43 2025-05-07T19:43:05.2090750Z initial apicid : 43 2025-05-07T19:43:05.2090825Z fpu : yes 2025-05-07T19:43:05.2090907Z fpu_exception : yes 2025-05-07T19:43:05.2090999Z cpuid level : 13 2025-05-07T19:43:05.2091074Z wp : yes 2025-05-07T19:43:05.2093060Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2093478Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2093558Z bogomips : 6000.01 2025-05-07T19:43:05.2093639Z clflush size : 64 2025-05-07T19:43:05.2093818Z cache_alignment : 64 2025-05-07T19:43:05.2093941Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2094024Z power management: 2025-05-07T19:43:05.2094028Z 2025-05-07T19:43:05.2094122Z processor : 70 2025-05-07T19:43:05.2094207Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2094286Z cpu family : 6 2025-05-07T19:43:05.2094364Z model : 85 2025-05-07T19:43:05.2094525Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2094604Z stepping : 7 2025-05-07T19:43:05.2094682Z microcode : 0x5003901 2025-05-07T19:43:05.2094776Z cpu MHz : 3000.006 2025-05-07T19:43:05.2094857Z cache size : 36608 KB 2025-05-07T19:43:05.2094935Z physical id : 0 2025-05-07T19:43:05.2095013Z siblings : 48 2025-05-07T19:43:05.2095102Z core id : 22 2025-05-07T19:43:05.2095182Z cpu cores : 24 2025-05-07T19:43:05.2095258Z apicid : 45 2025-05-07T19:43:05.2095351Z initial apicid : 45 2025-05-07T19:43:05.2095423Z fpu : yes 2025-05-07T19:43:05.2095503Z fpu_exception : yes 2025-05-07T19:43:05.2095581Z cpuid level : 13 2025-05-07T19:43:05.2095666Z wp : yes 2025-05-07T19:43:05.2097708Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2098076Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2098159Z bogomips : 6000.01 2025-05-07T19:43:05.2098239Z clflush size : 64 2025-05-07T19:43:05.2098319Z cache_alignment : 64 2025-05-07T19:43:05.2098451Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2098529Z power management: 2025-05-07T19:43:05.2098533Z 2025-05-07T19:43:05.2098609Z processor : 71 2025-05-07T19:43:05.2098703Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2098779Z cpu family : 6 2025-05-07T19:43:05.2098852Z model : 85 2025-05-07T19:43:05.2099000Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2111543Z stepping : 7 2025-05-07T19:43:05.2111692Z microcode : 0x5003901 2025-05-07T19:43:05.2111789Z cpu MHz : 3000.006 2025-05-07T19:43:05.2111873Z cache size : 36608 KB 2025-05-07T19:43:05.2111969Z physical id : 0 2025-05-07T19:43:05.2112048Z siblings : 48 2025-05-07T19:43:05.2112137Z core id : 23 2025-05-07T19:43:05.2112218Z cpu cores : 24 2025-05-07T19:43:05.2112301Z apicid : 47 2025-05-07T19:43:05.2112395Z initial apicid : 47 2025-05-07T19:43:05.2112473Z fpu : yes 2025-05-07T19:43:05.2112557Z fpu_exception : yes 2025-05-07T19:43:05.2112635Z cpuid level : 13 2025-05-07T19:43:05.2112828Z wp : yes 2025-05-07T19:43:05.2114838Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2115206Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2115517Z bogomips : 6000.01 2025-05-07T19:43:05.2115591Z clflush size : 64 2025-05-07T19:43:05.2115669Z cache_alignment : 64 2025-05-07T19:43:05.2115800Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2115879Z power management: 2025-05-07T19:43:05.2115885Z 2025-05-07T19:43:05.2115962Z processor : 72 2025-05-07T19:43:05.2116053Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2116125Z cpu family : 6 2025-05-07T19:43:05.2116196Z model : 85 2025-05-07T19:43:05.2116345Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2116430Z stepping : 7 2025-05-07T19:43:05.2116507Z microcode : 0x5003901 2025-05-07T19:43:05.2116583Z cpu MHz : 3000.006 2025-05-07T19:43:05.2116665Z cache size : 36608 KB 2025-05-07T19:43:05.2116740Z physical id : 1 2025-05-07T19:43:05.2116816Z siblings : 48 2025-05-07T19:43:05.2116885Z core id : 0 2025-05-07T19:43:05.2116963Z cpu cores : 24 2025-05-07T19:43:05.2117032Z apicid : 65 2025-05-07T19:43:05.2117111Z initial apicid : 65 2025-05-07T19:43:05.2117179Z fpu : yes 2025-05-07T19:43:05.2117260Z fpu_exception : yes 2025-05-07T19:43:05.2117332Z cpuid level : 13 2025-05-07T19:43:05.2117402Z wp : yes 2025-05-07T19:43:05.2119391Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2119795Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2119884Z bogomips : 6000.01 2025-05-07T19:43:05.2119958Z clflush size : 64 2025-05-07T19:43:05.2120034Z cache_alignment : 64 2025-05-07T19:43:05.2120155Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2120239Z power management: 2025-05-07T19:43:05.2120244Z 2025-05-07T19:43:05.2120316Z processor : 73 2025-05-07T19:43:05.2120398Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2120477Z cpu family : 6 2025-05-07T19:43:05.2120546Z model : 85 2025-05-07T19:43:05.2120692Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2120766Z stepping : 7 2025-05-07T19:43:05.2120851Z microcode : 0x5003901 2025-05-07T19:43:05.2120922Z cpu MHz : 3239.916 2025-05-07T19:43:05.2120999Z cache size : 36608 KB 2025-05-07T19:43:05.2121081Z physical id : 1 2025-05-07T19:43:05.2121152Z siblings : 48 2025-05-07T19:43:05.2121223Z core id : 1 2025-05-07T19:43:05.2121294Z cpu cores : 24 2025-05-07T19:43:05.2121371Z apicid : 67 2025-05-07T19:43:05.2121445Z initial apicid : 67 2025-05-07T19:43:05.2121518Z fpu : yes 2025-05-07T19:43:05.2121594Z fpu_exception : yes 2025-05-07T19:43:05.2121674Z cpuid level : 13 2025-05-07T19:43:05.2121742Z wp : yes 2025-05-07T19:43:05.2123717Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2124083Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2124158Z bogomips : 6000.01 2025-05-07T19:43:05.2124276Z clflush size : 64 2025-05-07T19:43:05.2124362Z cache_alignment : 64 2025-05-07T19:43:05.2124478Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2124555Z power management: 2025-05-07T19:43:05.2124559Z 2025-05-07T19:43:05.2124639Z processor : 74 2025-05-07T19:43:05.2124718Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2124789Z cpu family : 6 2025-05-07T19:43:05.2124856Z model : 85 2025-05-07T19:43:05.2125005Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2125079Z stepping : 7 2025-05-07T19:43:05.2125154Z microcode : 0x5003901 2025-05-07T19:43:05.2125232Z cpu MHz : 3174.584 2025-05-07T19:43:05.2125307Z cache size : 36608 KB 2025-05-07T19:43:05.2125379Z physical id : 1 2025-05-07T19:43:05.2125451Z siblings : 48 2025-05-07T19:43:05.2125528Z core id : 2 2025-05-07T19:43:05.2125603Z cpu cores : 24 2025-05-07T19:43:05.2125673Z apicid : 69 2025-05-07T19:43:05.2125754Z initial apicid : 69 2025-05-07T19:43:05.2125822Z fpu : yes 2025-05-07T19:43:05.2125896Z fpu_exception : yes 2025-05-07T19:43:05.2125971Z cpuid level : 13 2025-05-07T19:43:05.2126048Z wp : yes 2025-05-07T19:43:05.2128023Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2128428Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2128503Z bogomips : 6000.01 2025-05-07T19:43:05.2128575Z clflush size : 64 2025-05-07T19:43:05.2128651Z cache_alignment : 64 2025-05-07T19:43:05.2128779Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2128854Z power management: 2025-05-07T19:43:05.2128858Z 2025-05-07T19:43:05.2128929Z processor : 75 2025-05-07T19:43:05.2129015Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2129085Z cpu family : 6 2025-05-07T19:43:05.2129155Z model : 85 2025-05-07T19:43:05.2129296Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2129372Z stepping : 7 2025-05-07T19:43:05.2129446Z microcode : 0x5003901 2025-05-07T19:43:05.2129516Z cpu MHz : 3212.400 2025-05-07T19:43:05.2129596Z cache size : 36608 KB 2025-05-07T19:43:05.2129666Z physical id : 1 2025-05-07T19:43:05.2129737Z siblings : 48 2025-05-07T19:43:05.2129803Z core id : 3 2025-05-07T19:43:05.2129880Z cpu cores : 24 2025-05-07T19:43:05.2129948Z apicid : 71 2025-05-07T19:43:05.2130027Z initial apicid : 71 2025-05-07T19:43:05.2130105Z fpu : yes 2025-05-07T19:43:05.2130182Z fpu_exception : yes 2025-05-07T19:43:05.2130253Z cpuid level : 13 2025-05-07T19:43:05.2130325Z wp : yes 2025-05-07T19:43:05.2132315Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2132665Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2132749Z bogomips : 6000.01 2025-05-07T19:43:05.2132822Z clflush size : 64 2025-05-07T19:43:05.2132898Z cache_alignment : 64 2025-05-07T19:43:05.2133013Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2133143Z power management: 2025-05-07T19:43:05.2133147Z 2025-05-07T19:43:05.2133219Z processor : 76 2025-05-07T19:43:05.2133297Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2133373Z cpu family : 6 2025-05-07T19:43:05.2133441Z model : 85 2025-05-07T19:43:05.2133583Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2133652Z stepping : 7 2025-05-07T19:43:05.2133731Z microcode : 0x5003901 2025-05-07T19:43:05.2133801Z cpu MHz : 3221.299 2025-05-07T19:43:05.2133873Z cache size : 36608 KB 2025-05-07T19:43:05.2133952Z physical id : 1 2025-05-07T19:43:05.2134021Z siblings : 48 2025-05-07T19:43:05.2134088Z core id : 4 2025-05-07T19:43:05.2134157Z cpu cores : 24 2025-05-07T19:43:05.2134231Z apicid : 73 2025-05-07T19:43:05.2134302Z initial apicid : 73 2025-05-07T19:43:05.2134374Z fpu : yes 2025-05-07T19:43:05.2134455Z fpu_exception : yes 2025-05-07T19:43:05.2134525Z cpuid level : 13 2025-05-07T19:43:05.2134593Z wp : yes 2025-05-07T19:43:05.2136580Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2136931Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2137002Z bogomips : 6000.01 2025-05-07T19:43:05.2137126Z clflush size : 64 2025-05-07T19:43:05.2137202Z cache_alignment : 64 2025-05-07T19:43:05.2137314Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2137393Z power management: 2025-05-07T19:43:05.2137397Z 2025-05-07T19:43:05.2137475Z processor : 77 2025-05-07T19:43:05.2137553Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2137623Z cpu family : 6 2025-05-07T19:43:05.2137695Z model : 85 2025-05-07T19:43:05.2137836Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2137906Z stepping : 7 2025-05-07T19:43:05.2137980Z microcode : 0x5003901 2025-05-07T19:43:05.2138055Z cpu MHz : 3000.006 2025-05-07T19:43:05.2138126Z cache size : 36608 KB 2025-05-07T19:43:05.2138198Z physical id : 1 2025-05-07T19:43:05.2138271Z siblings : 48 2025-05-07T19:43:05.2138339Z core id : 5 2025-05-07T19:43:05.2138409Z cpu cores : 24 2025-05-07T19:43:05.2138474Z apicid : 75 2025-05-07T19:43:05.2138555Z initial apicid : 75 2025-05-07T19:43:05.2138624Z fpu : yes 2025-05-07T19:43:05.2138702Z fpu_exception : yes 2025-05-07T19:43:05.2138771Z cpuid level : 13 2025-05-07T19:43:05.2138848Z wp : yes 2025-05-07T19:43:05.2141111Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2141507Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2141590Z bogomips : 6000.01 2025-05-07T19:43:05.2141666Z clflush size : 64 2025-05-07T19:43:05.2141754Z cache_alignment : 64 2025-05-07T19:43:05.2141878Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2141959Z power management: 2025-05-07T19:43:05.2142017Z 2025-05-07T19:43:05.2142093Z processor : 78 2025-05-07T19:43:05.2142185Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2142260Z cpu family : 6 2025-05-07T19:43:05.2142333Z model : 85 2025-05-07T19:43:05.2142495Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2142570Z stepping : 7 2025-05-07T19:43:05.2142650Z microcode : 0x5003901 2025-05-07T19:43:05.2142728Z cpu MHz : 3246.056 2025-05-07T19:43:05.2142815Z cache size : 36608 KB 2025-05-07T19:43:05.2142893Z physical id : 1 2025-05-07T19:43:05.2142967Z siblings : 48 2025-05-07T19:43:05.2143045Z core id : 6 2025-05-07T19:43:05.2143121Z cpu cores : 24 2025-05-07T19:43:05.2143196Z apicid : 77 2025-05-07T19:43:05.2143276Z initial apicid : 77 2025-05-07T19:43:05.2143353Z fpu : yes 2025-05-07T19:43:05.2143433Z fpu_exception : yes 2025-05-07T19:43:05.2143513Z cpuid level : 13 2025-05-07T19:43:05.2143585Z wp : yes 2025-05-07T19:43:05.2145730Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2146116Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2146200Z bogomips : 6000.01 2025-05-07T19:43:05.2146278Z clflush size : 64 2025-05-07T19:43:05.2146361Z cache_alignment : 64 2025-05-07T19:43:05.2146548Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2146634Z power management: 2025-05-07T19:43:05.2146639Z 2025-05-07T19:43:05.2146714Z processor : 79 2025-05-07T19:43:05.2146804Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2146890Z cpu family : 6 2025-05-07T19:43:05.2146963Z model : 85 2025-05-07T19:43:05.2147117Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2147204Z stepping : 7 2025-05-07T19:43:05.2147285Z microcode : 0x5003901 2025-05-07T19:43:05.2147360Z cpu MHz : 3294.119 2025-05-07T19:43:05.2147442Z cache size : 36608 KB 2025-05-07T19:43:05.2147532Z physical id : 1 2025-05-07T19:43:05.2147608Z siblings : 48 2025-05-07T19:43:05.2147681Z core id : 7 2025-05-07T19:43:05.2147759Z cpu cores : 24 2025-05-07T19:43:05.2147843Z apicid : 79 2025-05-07T19:43:05.2147924Z initial apicid : 79 2025-05-07T19:43:05.2147997Z fpu : yes 2025-05-07T19:43:05.2148088Z fpu_exception : yes 2025-05-07T19:43:05.2148163Z cpuid level : 13 2025-05-07T19:43:05.2148239Z wp : yes 2025-05-07T19:43:05.2150394Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2150782Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2150862Z bogomips : 6000.01 2025-05-07T19:43:05.2150946Z clflush size : 64 2025-05-07T19:43:05.2151027Z cache_alignment : 64 2025-05-07T19:43:05.2151150Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2151231Z power management: 2025-05-07T19:43:05.2151242Z 2025-05-07T19:43:05.2151324Z processor : 80 2025-05-07T19:43:05.2151408Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2151531Z cpu family : 6 2025-05-07T19:43:05.2151611Z model : 85 2025-05-07T19:43:05.2151764Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2151840Z stepping : 7 2025-05-07T19:43:05.2151919Z microcode : 0x5003901 2025-05-07T19:43:05.2151997Z cpu MHz : 3215.091 2025-05-07T19:43:05.2152076Z cache size : 36608 KB 2025-05-07T19:43:05.2152153Z physical id : 1 2025-05-07T19:43:05.2152235Z siblings : 48 2025-05-07T19:43:05.2152309Z core id : 8 2025-05-07T19:43:05.2152383Z cpu cores : 24 2025-05-07T19:43:05.2152455Z apicid : 81 2025-05-07T19:43:05.2152542Z initial apicid : 81 2025-05-07T19:43:05.2152618Z fpu : yes 2025-05-07T19:43:05.2152697Z fpu_exception : yes 2025-05-07T19:43:05.2152886Z cpuid level : 13 2025-05-07T19:43:05.2152953Z wp : yes 2025-05-07T19:43:05.2154930Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2155289Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2155362Z bogomips : 6000.01 2025-05-07T19:43:05.2155432Z clflush size : 64 2025-05-07T19:43:05.2155511Z cache_alignment : 64 2025-05-07T19:43:05.2155624Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2155699Z power management: 2025-05-07T19:43:05.2155750Z 2025-05-07T19:43:05.2155824Z processor : 81 2025-05-07T19:43:05.2155911Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2155983Z cpu family : 6 2025-05-07T19:43:05.2156054Z model : 85 2025-05-07T19:43:05.2156203Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2156274Z stepping : 7 2025-05-07T19:43:05.2156348Z microcode : 0x5003901 2025-05-07T19:43:05.2156416Z cpu MHz : 3210.751 2025-05-07T19:43:05.2156491Z cache size : 36608 KB 2025-05-07T19:43:05.2156563Z physical id : 1 2025-05-07T19:43:05.2156632Z siblings : 48 2025-05-07T19:43:05.2156704Z core id : 9 2025-05-07T19:43:05.2156772Z cpu cores : 24 2025-05-07T19:43:05.2156838Z apicid : 83 2025-05-07T19:43:05.2156912Z initial apicid : 83 2025-05-07T19:43:05.2156987Z fpu : yes 2025-05-07T19:43:05.2157061Z fpu_exception : yes 2025-05-07T19:43:05.2157131Z cpuid level : 13 2025-05-07T19:43:05.2157201Z wp : yes 2025-05-07T19:43:05.2159188Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2159546Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2159627Z bogomips : 6000.01 2025-05-07T19:43:05.2159699Z clflush size : 64 2025-05-07T19:43:05.2159773Z cache_alignment : 64 2025-05-07T19:43:05.2159891Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2159964Z power management: 2025-05-07T19:43:05.2159968Z 2025-05-07T19:43:05.2160044Z processor : 82 2025-05-07T19:43:05.2160124Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2160200Z cpu family : 6 2025-05-07T19:43:05.2160268Z model : 85 2025-05-07T19:43:05.2160457Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2160531Z stepping : 7 2025-05-07T19:43:05.2160605Z microcode : 0x5003901 2025-05-07T19:43:05.2160677Z cpu MHz : 3188.813 2025-05-07T19:43:05.2160749Z cache size : 36608 KB 2025-05-07T19:43:05.2160831Z physical id : 1 2025-05-07T19:43:05.2160900Z siblings : 48 2025-05-07T19:43:05.2160967Z core id : 10 2025-05-07T19:43:05.2161042Z cpu cores : 24 2025-05-07T19:43:05.2161109Z apicid : 85 2025-05-07T19:43:05.2161181Z initial apicid : 85 2025-05-07T19:43:05.2161247Z fpu : yes 2025-05-07T19:43:05.2161323Z fpu_exception : yes 2025-05-07T19:43:05.2161394Z cpuid level : 13 2025-05-07T19:43:05.2161460Z wp : yes 2025-05-07T19:43:05.2163440Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2163795Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2163866Z bogomips : 6000.01 2025-05-07T19:43:05.2163944Z clflush size : 64 2025-05-07T19:43:05.2164018Z cache_alignment : 64 2025-05-07T19:43:05.2164133Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2164213Z power management: 2025-05-07T19:43:05.2164217Z 2025-05-07T19:43:05.2164286Z processor : 83 2025-05-07T19:43:05.2164413Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2164485Z cpu family : 6 2025-05-07T19:43:05.2164559Z model : 85 2025-05-07T19:43:05.2164700Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2164775Z stepping : 7 2025-05-07T19:43:05.2164853Z microcode : 0x5003901 2025-05-07T19:43:05.2164923Z cpu MHz : 3242.644 2025-05-07T19:43:05.2164991Z cache size : 36608 KB 2025-05-07T19:43:05.2165075Z physical id : 1 2025-05-07T19:43:05.2165143Z siblings : 48 2025-05-07T19:43:05.2165211Z core id : 11 2025-05-07T19:43:05.2165280Z cpu cores : 24 2025-05-07T19:43:05.2165359Z apicid : 87 2025-05-07T19:43:05.2165433Z initial apicid : 87 2025-05-07T19:43:05.2165504Z fpu : yes 2025-05-07T19:43:05.2165587Z fpu_exception : yes 2025-05-07T19:43:05.2165659Z cpuid level : 13 2025-05-07T19:43:05.2165728Z wp : yes 2025-05-07T19:43:05.2167717Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2168074Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2168149Z bogomips : 6000.01 2025-05-07T19:43:05.2168235Z clflush size : 64 2025-05-07T19:43:05.2168308Z cache_alignment : 64 2025-05-07T19:43:05.2168425Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2168501Z power management: 2025-05-07T19:43:05.2168505Z 2025-05-07T19:43:05.2168584Z processor : 84 2025-05-07T19:43:05.2168664Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2168741Z cpu family : 6 2025-05-07T19:43:05.2168815Z model : 85 2025-05-07T19:43:05.2168957Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2169031Z stepping : 7 2025-05-07T19:43:05.2169589Z microcode : 0x5003901 2025-05-07T19:43:05.2169667Z cpu MHz : 3166.306 2025-05-07T19:43:05.2169742Z cache size : 36608 KB 2025-05-07T19:43:05.2169816Z physical id : 1 2025-05-07T19:43:05.2169895Z siblings : 48 2025-05-07T19:43:05.2169965Z core id : 12 2025-05-07T19:43:05.2170037Z cpu cores : 24 2025-05-07T19:43:05.2170106Z apicid : 89 2025-05-07T19:43:05.2170191Z initial apicid : 89 2025-05-07T19:43:05.2170262Z fpu : yes 2025-05-07T19:43:05.2170340Z fpu_exception : yes 2025-05-07T19:43:05.2170417Z cpuid level : 13 2025-05-07T19:43:05.2170487Z wp : yes 2025-05-07T19:43:05.2172468Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2172831Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2172905Z bogomips : 6000.01 2025-05-07T19:43:05.2172979Z clflush size : 64 2025-05-07T19:43:05.2173063Z cache_alignment : 64 2025-05-07T19:43:05.2173179Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2173256Z power management: 2025-05-07T19:43:05.2173260Z 2025-05-07T19:43:05.2173336Z processor : 85 2025-05-07T19:43:05.2173424Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2173496Z cpu family : 6 2025-05-07T19:43:05.2173568Z model : 85 2025-05-07T19:43:05.2173766Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2173844Z stepping : 7 2025-05-07T19:43:05.2173920Z microcode : 0x5003901 2025-05-07T19:43:05.2173997Z cpu MHz : 3779.951 2025-05-07T19:43:05.2174082Z cache size : 36608 KB 2025-05-07T19:43:05.2174155Z physical id : 1 2025-05-07T19:43:05.2174224Z siblings : 48 2025-05-07T19:43:05.2174296Z core id : 13 2025-05-07T19:43:05.2174365Z cpu cores : 24 2025-05-07T19:43:05.2174433Z apicid : 91 2025-05-07T19:43:05.2174504Z initial apicid : 91 2025-05-07T19:43:05.2174579Z fpu : yes 2025-05-07T19:43:05.2174653Z fpu_exception : yes 2025-05-07T19:43:05.2174721Z cpuid level : 13 2025-05-07T19:43:05.2174794Z wp : yes 2025-05-07T19:43:05.2177142Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2177528Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2177614Z bogomips : 6000.01 2025-05-07T19:43:05.2177690Z clflush size : 64 2025-05-07T19:43:05.2177769Z cache_alignment : 64 2025-05-07T19:43:05.2177899Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2177978Z power management: 2025-05-07T19:43:05.2177983Z 2025-05-07T19:43:05.2178060Z processor : 86 2025-05-07T19:43:05.2178145Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2178227Z cpu family : 6 2025-05-07T19:43:05.2178299Z model : 85 2025-05-07T19:43:05.2178461Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2178543Z stepping : 7 2025-05-07T19:43:05.2178621Z microcode : 0x5003901 2025-05-07T19:43:05.2178696Z cpu MHz : 3132.431 2025-05-07T19:43:05.2178927Z cache size : 36608 KB 2025-05-07T19:43:05.2179011Z physical id : 1 2025-05-07T19:43:05.2179085Z siblings : 48 2025-05-07T19:43:05.2179160Z core id : 14 2025-05-07T19:43:05.2179239Z cpu cores : 24 2025-05-07T19:43:05.2179313Z apicid : 93 2025-05-07T19:43:05.2179391Z initial apicid : 93 2025-05-07T19:43:05.2179464Z fpu : yes 2025-05-07T19:43:05.2179551Z fpu_exception : yes 2025-05-07T19:43:05.2179630Z cpuid level : 13 2025-05-07T19:43:05.2179704Z wp : yes 2025-05-07T19:43:05.2181930Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2182316Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2182397Z bogomips : 6000.01 2025-05-07T19:43:05.2182484Z clflush size : 64 2025-05-07T19:43:05.2182565Z cache_alignment : 64 2025-05-07T19:43:05.2182693Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2182782Z power management: 2025-05-07T19:43:05.2182787Z 2025-05-07T19:43:05.2182865Z processor : 87 2025-05-07T19:43:05.2182953Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2183033Z cpu family : 6 2025-05-07T19:43:05.2183117Z model : 85 2025-05-07T19:43:05.2183274Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2183425Z stepping : 7 2025-05-07T19:43:05.2183514Z microcode : 0x5003901 2025-05-07T19:43:05.2183589Z cpu MHz : 3243.970 2025-05-07T19:43:05.2183669Z cache size : 36608 KB 2025-05-07T19:43:05.2183756Z physical id : 1 2025-05-07T19:43:05.2183841Z siblings : 48 2025-05-07T19:43:05.2183919Z core id : 15 2025-05-07T19:43:05.2183996Z cpu cores : 24 2025-05-07T19:43:05.2184081Z apicid : 95 2025-05-07T19:43:05.2184165Z initial apicid : 95 2025-05-07T19:43:05.2184240Z fpu : yes 2025-05-07T19:43:05.2184324Z fpu_exception : yes 2025-05-07T19:43:05.2184412Z cpuid level : 13 2025-05-07T19:43:05.2184488Z wp : yes 2025-05-07T19:43:05.2186643Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2187034Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2187114Z bogomips : 6000.01 2025-05-07T19:43:05.2187197Z clflush size : 64 2025-05-07T19:43:05.2187286Z cache_alignment : 64 2025-05-07T19:43:05.2187410Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2187491Z power management: 2025-05-07T19:43:05.2187495Z 2025-05-07T19:43:05.2187580Z processor : 88 2025-05-07T19:43:05.2187668Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2187747Z cpu family : 6 2025-05-07T19:43:05.2187821Z model : 85 2025-05-07T19:43:05.2187983Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2188060Z stepping : 7 2025-05-07T19:43:05.2188145Z microcode : 0x5003901 2025-05-07T19:43:05.2188230Z cpu MHz : 3208.326 2025-05-07T19:43:05.2188310Z cache size : 36608 KB 2025-05-07T19:43:05.2188388Z physical id : 1 2025-05-07T19:43:05.2188516Z siblings : 48 2025-05-07T19:43:05.2188599Z core id : 16 2025-05-07T19:43:05.2188676Z cpu cores : 24 2025-05-07T19:43:05.2188752Z apicid : 97 2025-05-07T19:43:05.2188830Z initial apicid : 97 2025-05-07T19:43:05.2188916Z fpu : yes 2025-05-07T19:43:05.2188997Z fpu_exception : yes 2025-05-07T19:43:05.2189076Z cpuid level : 13 2025-05-07T19:43:05.2189154Z wp : yes 2025-05-07T19:43:05.2191303Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2191691Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2191775Z bogomips : 6000.01 2025-05-07T19:43:05.2191850Z clflush size : 64 2025-05-07T19:43:05.2191930Z cache_alignment : 64 2025-05-07T19:43:05.2192178Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2192254Z power management: 2025-05-07T19:43:05.2192258Z 2025-05-07T19:43:05.2192328Z processor : 89 2025-05-07T19:43:05.2192413Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2192481Z cpu family : 6 2025-05-07T19:43:05.2192549Z model : 85 2025-05-07T19:43:05.2192690Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2192768Z stepping : 7 2025-05-07T19:43:05.2192841Z microcode : 0x5003901 2025-05-07T19:43:05.2192957Z cpu MHz : 3242.986 2025-05-07T19:43:05.2193034Z cache size : 36608 KB 2025-05-07T19:43:05.2193111Z physical id : 1 2025-05-07T19:43:05.2193178Z siblings : 48 2025-05-07T19:43:05.2193250Z core id : 17 2025-05-07T19:43:05.2193326Z cpu cores : 24 2025-05-07T19:43:05.2193399Z apicid : 99 2025-05-07T19:43:05.2193474Z initial apicid : 99 2025-05-07T19:43:05.2193541Z fpu : yes 2025-05-07T19:43:05.2193627Z fpu_exception : yes 2025-05-07T19:43:05.2193700Z cpuid level : 13 2025-05-07T19:43:05.2193770Z wp : yes 2025-05-07T19:43:05.2195755Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2196109Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2196191Z bogomips : 6000.01 2025-05-07T19:43:05.2196275Z clflush size : 64 2025-05-07T19:43:05.2196349Z cache_alignment : 64 2025-05-07T19:43:05.2196463Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2196546Z power management: 2025-05-07T19:43:05.2196550Z 2025-05-07T19:43:05.2196620Z processor : 90 2025-05-07T19:43:05.2196700Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2196771Z cpu family : 6 2025-05-07T19:43:05.2196845Z model : 85 2025-05-07T19:43:05.2196987Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2197062Z stepping : 7 2025-05-07T19:43:05.2197144Z microcode : 0x5003901 2025-05-07T19:43:05.2197214Z cpu MHz : 3000.006 2025-05-07T19:43:05.2197289Z cache size : 36608 KB 2025-05-07T19:43:05.2197359Z physical id : 1 2025-05-07T19:43:05.2197433Z siblings : 48 2025-05-07T19:43:05.2197500Z core id : 18 2025-05-07T19:43:05.2197618Z cpu cores : 24 2025-05-07T19:43:05.2197692Z apicid : 101 2025-05-07T19:43:05.2197765Z initial apicid : 101 2025-05-07T19:43:05.2197832Z fpu : yes 2025-05-07T19:43:05.2197904Z fpu_exception : yes 2025-05-07T19:43:05.2197982Z cpuid level : 13 2025-05-07T19:43:05.2198053Z wp : yes 2025-05-07T19:43:05.2200030Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2200390Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2200470Z bogomips : 6000.01 2025-05-07T19:43:05.2200543Z clflush size : 64 2025-05-07T19:43:05.2200625Z cache_alignment : 64 2025-05-07T19:43:05.2200741Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2200816Z power management: 2025-05-07T19:43:05.2200820Z 2025-05-07T19:43:05.2200897Z processor : 91 2025-05-07T19:43:05.2200981Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2201054Z cpu family : 6 2025-05-07T19:43:05.2201125Z model : 85 2025-05-07T19:43:05.2201273Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2201345Z stepping : 7 2025-05-07T19:43:05.2201423Z microcode : 0x5003901 2025-05-07T19:43:05.2201502Z cpu MHz : 3212.324 2025-05-07T19:43:05.2201578Z cache size : 36608 KB 2025-05-07T19:43:05.2201703Z physical id : 1 2025-05-07T19:43:05.2201776Z siblings : 48 2025-05-07T19:43:05.2201853Z core id : 19 2025-05-07T19:43:05.2201926Z cpu cores : 24 2025-05-07T19:43:05.2202000Z apicid : 103 2025-05-07T19:43:05.2202082Z initial apicid : 103 2025-05-07T19:43:05.2202151Z fpu : yes 2025-05-07T19:43:05.2202226Z fpu_exception : yes 2025-05-07T19:43:05.2202297Z cpuid level : 13 2025-05-07T19:43:05.2202372Z wp : yes 2025-05-07T19:43:05.2204351Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2204715Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2204791Z bogomips : 6000.01 2025-05-07T19:43:05.2204865Z clflush size : 64 2025-05-07T19:43:05.2204944Z cache_alignment : 64 2025-05-07T19:43:05.2205066Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2205139Z power management: 2025-05-07T19:43:05.2205143Z 2025-05-07T19:43:05.2205217Z processor : 92 2025-05-07T19:43:05.2205300Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2205371Z cpu family : 6 2025-05-07T19:43:05.2205441Z model : 85 2025-05-07T19:43:05.2205581Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2205662Z stepping : 7 2025-05-07T19:43:05.2205738Z microcode : 0x5003901 2025-05-07T19:43:05.2205811Z cpu MHz : 3211.931 2025-05-07T19:43:05.2205889Z cache size : 36608 KB 2025-05-07T19:43:05.2205962Z physical id : 1 2025-05-07T19:43:05.2206037Z siblings : 48 2025-05-07T19:43:05.2206107Z core id : 20 2025-05-07T19:43:05.2206185Z cpu cores : 24 2025-05-07T19:43:05.2206253Z apicid : 105 2025-05-07T19:43:05.2206327Z initial apicid : 105 2025-05-07T19:43:05.2206450Z fpu : yes 2025-05-07T19:43:05.2206525Z fpu_exception : yes 2025-05-07T19:43:05.2206596Z cpuid level : 13 2025-05-07T19:43:05.2206663Z wp : yes 2025-05-07T19:43:05.2208652Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2209009Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2209088Z bogomips : 6000.01 2025-05-07T19:43:05.2209164Z clflush size : 64 2025-05-07T19:43:05.2209239Z cache_alignment : 64 2025-05-07T19:43:05.2209352Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2209430Z power management: 2025-05-07T19:43:05.2209434Z 2025-05-07T19:43:05.2209502Z processor : 93 2025-05-07T19:43:05.2209579Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2209655Z cpu family : 6 2025-05-07T19:43:05.2209723Z model : 85 2025-05-07T19:43:05.2209864Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2209934Z stepping : 7 2025-05-07T19:43:05.2210013Z microcode : 0x5003901 2025-05-07T19:43:05.2210083Z cpu MHz : 3212.651 2025-05-07T19:43:05.2210154Z cache size : 36608 KB 2025-05-07T19:43:05.2210229Z physical id : 1 2025-05-07T19:43:05.2210297Z siblings : 48 2025-05-07T19:43:05.2210365Z core id : 21 2025-05-07T19:43:05.2210486Z cpu cores : 24 2025-05-07T19:43:05.2210575Z apicid : 107 2025-05-07T19:43:05.2210660Z initial apicid : 107 2025-05-07T19:43:05.2210734Z fpu : yes 2025-05-07T19:43:05.2210829Z fpu_exception : yes 2025-05-07T19:43:05.2210906Z cpuid level : 13 2025-05-07T19:43:05.2210978Z wp : yes 2025-05-07T19:43:05.2212957Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2213329Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2213402Z bogomips : 6000.01 2025-05-07T19:43:05.2213494Z clflush size : 64 2025-05-07T19:43:05.2213579Z cache_alignment : 64 2025-05-07T19:43:05.2213703Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2213782Z power management: 2025-05-07T19:43:05.2213786Z 2025-05-07T19:43:05.2213876Z processor : 94 2025-05-07T19:43:05.2213965Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2214045Z cpu family : 6 2025-05-07T19:43:05.2214127Z model : 85 2025-05-07T19:43:05.2214274Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2214351Z stepping : 7 2025-05-07T19:43:05.2214428Z microcode : 0x5003901 2025-05-07T19:43:05.2214514Z cpu MHz : 3241.632 2025-05-07T19:43:05.2214593Z cache size : 36608 KB 2025-05-07T19:43:05.2214669Z physical id : 1 2025-05-07T19:43:05.2214756Z siblings : 48 2025-05-07T19:43:05.2214830Z core id : 22 2025-05-07T19:43:05.2214906Z cpu cores : 24 2025-05-07T19:43:05.2214985Z apicid : 109 2025-05-07T19:43:05.2215075Z initial apicid : 109 2025-05-07T19:43:05.2215148Z fpu : yes 2025-05-07T19:43:05.2215229Z fpu_exception : yes 2025-05-07T19:43:05.2215367Z cpuid level : 13 2025-05-07T19:43:05.2215451Z wp : yes 2025-05-07T19:43:05.2217426Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2217801Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2217878Z bogomips : 6000.01 2025-05-07T19:43:05.2217954Z clflush size : 64 2025-05-07T19:43:05.2218038Z cache_alignment : 64 2025-05-07T19:43:05.2218162Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2218244Z power management: 2025-05-07T19:43:05.2218248Z 2025-05-07T19:43:05.2218326Z processor : 95 2025-05-07T19:43:05.2218417Z vendor_id : GenuineIntel 2025-05-07T19:43:05.2218494Z cpu family : 6 2025-05-07T19:43:05.2218563Z model : 85 2025-05-07T19:43:05.2218716Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:43:05.2218792Z stepping : 7 2025-05-07T19:43:05.2218867Z microcode : 0x5003901 2025-05-07T19:43:05.2218941Z cpu MHz : 3260.223 2025-05-07T19:43:05.2219023Z cache size : 36608 KB 2025-05-07T19:43:05.2219096Z physical id : 1 2025-05-07T19:43:05.2219169Z siblings : 48 2025-05-07T19:43:05.2219249Z core id : 23 2025-05-07T19:43:05.2219320Z cpu cores : 24 2025-05-07T19:43:05.2219392Z apicid : 111 2025-05-07T19:43:05.2219524Z initial apicid : 111 2025-05-07T19:43:05.2219606Z fpu : yes 2025-05-07T19:43:05.2219684Z fpu_exception : yes 2025-05-07T19:43:05.2219755Z cpuid level : 13 2025-05-07T19:43:05.2219830Z wp : yes 2025-05-07T19:43:05.2222164Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:43:05.2222553Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:43:05.2222648Z bogomips : 6000.01 2025-05-07T19:43:05.2222730Z clflush size : 64 2025-05-07T19:43:05.2222815Z cache_alignment : 64 2025-05-07T19:43:05.2222945Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:43:05.2223042Z power management: 2025-05-07T19:43:05.2223046Z 2025-05-07T19:43:05.2223050Z 2025-05-07T19:43:05.2223171Z ################################################################################ 2025-05-07T19:43:05.2223266Z [INFO] Print PCI info ... 2025-05-07T19:43:05.2223364Z + lspci -v 2025-05-07T19:43:05.2223369Z 2025-05-07T19:43:05.2223549Z 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] 2025-05-07T19:43:05.2223655Z Subsystem: Amazon.com, Inc. Device 1237 2025-05-07T19:43:05.2223781Z Flags: bus master, medium devsel, latency 0 2025-05-07T19:43:05.2223786Z 2025-05-07T19:43:05.2223985Z 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 2025-05-07T19:43:05.2224066Z Physical Slot: 1 2025-05-07T19:43:05.2224189Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:05.2224194Z 2025-05-07T19:43:05.2224448Z 00:01.3 Non-VGA unclassified device: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 08) 2025-05-07T19:43:05.2224588Z Physical Slot: 1 2025-05-07T19:43:05.2224726Z Flags: bus master, fast devsel, latency 0, IRQ 9 2025-05-07T19:43:05.2224731Z 2025-05-07T19:43:05.2225000Z 00:03.0 VGA compatible controller: Amazon.com, Inc. Device 1111 (prog-if 00 [VGA controller]) 2025-05-07T19:43:05.2225087Z Physical Slot: 3 2025-05-07T19:43:05.2225202Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:05.2225344Z Memory at c0000000 (32-bit, prefetchable) [size=4M] 2025-05-07T19:43:05.2225471Z Expansion ROM at 000c0000 [disabled] [size=128K] 2025-05-07T19:43:05.2225475Z 2025-05-07T19:43:05.2225789Z 00:04.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe EBS Controller (prog-if 02 [NVM Express]) 2025-05-07T19:43:05.2225908Z Subsystem: Amazon.com, Inc. Device 0000 2025-05-07T19:43:05.2225993Z Physical Slot: 4 2025-05-07T19:43:05.2226122Z Flags: bus master, fast devsel, latency 0, IRQ 11 2025-05-07T19:43:05.2226284Z Memory at c0514000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:05.2226384Z Capabilities: 2025-05-07T19:43:05.2226478Z Kernel driver in use: nvme 2025-05-07T19:43:05.2226483Z 2025-05-07T19:43:05.2226699Z 00:05.0 Ethernet controller: Amazon.com, Inc. Elastic Network Adapter (ENA) 2025-05-07T19:43:05.2226788Z Physical Slot: 5 2025-05-07T19:43:05.2226896Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:43:05.2227044Z Memory at c0510000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:43:05.2227180Z Memory at c0400000 (32-bit, prefetchable) [size=1M] 2025-05-07T19:43:05.2227324Z Memory at c0500000 (32-bit, non-prefetchable) [size=64K] 2025-05-07T19:43:05.2227416Z Capabilities: 2025-05-07T19:43:05.2227511Z Kernel driver in use: ena 2025-05-07T19:43:05.2227515Z 2025-05-07T19:43:05.2227519Z 2025-05-07T19:43:05.2227682Z ################################################################################ 2025-05-07T19:43:05.2227791Z [INFO] Print Linux distribution info ... 2025-05-07T19:43:05.2227881Z + uname -a 2025-05-07T19:43:05.2227890Z 2025-05-07T19:43:05.2228283Z Linux bd0f6f446662 6.1.130-139.222.amzn2023.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux 2025-05-07T19:43:05.2228289Z 2025-05-07T19:43:05.2228366Z + uname -m 2025-05-07T19:43:05.2228370Z 2025-05-07T19:43:05.2228454Z x86_64 2025-05-07T19:43:05.2228459Z 2025-05-07T19:43:05.2228541Z + cat /proc/version 2025-05-07T19:43:05.2228545Z 2025-05-07T19:43:05.2229139Z Linux version 6.1.130-139.222.amzn2023.x86_64 (mockbuild@ip-10-0-55-76) (gcc (GCC) 11.5.0 20240719 (Red Hat 11.5.0-5), GNU ld version 2.39-6.amzn2023.0.11) #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 2025-05-07T19:43:05.2229144Z 2025-05-07T19:43:05.2229235Z + cat /etc/os-release 2025-05-07T19:43:05.2229239Z 2025-05-07T19:43:05.2229320Z NAME="Amazon Linux" 2025-05-07T19:43:05.2229407Z VERSION="2023" 2025-05-07T19:43:05.2229490Z ID="amzn" 2025-05-07T19:43:05.2229574Z ID_LIKE="fedora" 2025-05-07T19:43:05.2229653Z VERSION_ID="2023" 2025-05-07T19:43:05.2229752Z PLATFORM_ID="platform:al2023" 2025-05-07T19:43:05.2229877Z PRETTY_NAME="Amazon Linux 2023.7.20250428" 2025-05-07T19:43:05.2229956Z ANSI_COLOR="0;33" 2025-05-07T19:43:05.2230073Z CPE_NAME="cpe:2.3:o:amazon:amazon_linux:2023" 2025-05-07T19:43:05.2230250Z HOME_URL="https://aws.amazon.com/linux/amazon-linux-2023/" 2025-05-07T19:43:05.2230428Z DOCUMENTATION_URL="https://docs.aws.amazon.com/linux/" 2025-05-07T19:43:05.2230583Z SUPPORT_URL="https://aws.amazon.com/premiumsupport/" 2025-05-07T19:43:05.2230774Z BUG_REPORT_URL="https://github.com/amazonlinux/amazon-linux-2023" 2025-05-07T19:43:05.2230862Z VENDOR_NAME="AWS" 2025-05-07T19:43:05.2230970Z VENDOR_URL="https://aws.amazon.com/" 2025-05-07T19:43:05.2231060Z SUPPORT_END="2029-06-30" 2025-05-07T19:43:05.2231064Z 2025-05-07T19:43:05.2272224Z ##[group]Run . $PRELUDE; print_gpu_info 2025-05-07T19:43:05.2272366Z . $PRELUDE; print_gpu_info 2025-05-07T19:43:05.2272635Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:05.2272706Z env: 2025-05-07T19:43:05.2273014Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:05.2273094Z BUILD_ENV: build_binary 2025-05-07T19:43:05.2273175Z BUILD_TARGET: default 2025-05-07T19:43:05.2273249Z BUILD_VARIANT: cuda 2025-05-07T19:43:05.2273330Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:05.2273410Z ##[endgroup] 2025-05-07T19:43:05.6256842Z ################################################################################ 2025-05-07T19:43:05.6257276Z [INFO] Printing general display info ... 2025-05-07T19:43:05.6270529Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:05.7142439Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:05.7146795Z /usr/bin/sudo 2025-05-07T19:43:05.7155037Z which: no apt-get in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:05.7171310Z /usr/bin/yum 2025-05-07T19:43:05.7172466Z [INSTALL] Updating system repositories ... 2025-05-07T19:43:05.7198217Z [EXEC] [ATTEMPT 0/3] + sudo yum update -y 2025-05-07T19:43:05.9417131Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:47 2025. 2025-05-07T19:43:06.0375761Z Dependencies resolved. 2025-05-07T19:43:06.0585979Z Nothing to do. 2025-05-07T19:43:06.0586730Z Complete! 2025-05-07T19:43:06.1277553Z [INSTALL] Installing system package(s): hostname lshw ... 2025-05-07T19:43:06.1303738Z [EXEC] [ATTEMPT 0/3] + sudo yum install -y hostname lshw 2025-05-07T19:43:06.3446892Z Last metadata expiration check: 0:00:19 ago on Wed May 7 19:42:47 2025. 2025-05-07T19:43:06.3968339Z Dependencies resolved. 2025-05-07T19:43:06.4134136Z ================================================================================ 2025-05-07T19:43:06.4135304Z Package Arch Version Repository Size 2025-05-07T19:43:06.4135771Z ================================================================================ 2025-05-07T19:43:06.4136116Z Installing: 2025-05-07T19:43:06.4136453Z hostname x86_64 3.23-4.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:43:06.4136946Z lshw x86_64 B.02.19.2-7.amzn2023.0.3 amazonlinux 319 k 2025-05-07T19:43:06.4137234Z 2025-05-07T19:43:06.4137347Z Transaction Summary 2025-05-07T19:43:06.4137602Z ================================================================================ 2025-05-07T19:43:06.4137938Z Install 2 Packages 2025-05-07T19:43:06.4138078Z 2025-05-07T19:43:06.4138192Z Total download size: 347 k 2025-05-07T19:43:06.4138468Z Installed size: 883 k 2025-05-07T19:43:06.4138727Z Downloading Packages: 2025-05-07T19:43:06.5248624Z (1/2): hostname-3.23-4.amzn2023.0.3.x86_64.rpm 1.3 MB/s | 28 kB 00:00 2025-05-07T19:43:06.5279292Z (2/2): lshw-B.02.19.2-7.amzn2023.0.3.x86_64.rpm 13 MB/s | 319 kB 00:00 2025-05-07T19:43:06.5286152Z -------------------------------------------------------------------------------- 2025-05-07T19:43:06.5287412Z Total 2.9 MB/s | 347 kB 00:00 2025-05-07T19:43:06.5498143Z Running transaction check 2025-05-07T19:43:06.5546899Z Transaction check succeeded. 2025-05-07T19:43:06.5547372Z Running transaction test 2025-05-07T19:43:06.5702927Z Transaction test succeeded. 2025-05-07T19:43:06.5703268Z Running transaction 2025-05-07T19:43:06.5983119Z Preparing : 1/1 2025-05-07T19:43:06.6057656Z Installing : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:06.6101485Z Installing : hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:07.6531328Z Running scriptlet: hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:07.6533625Z Verifying : hostname-3.23-4.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:07.6894738Z Verifying : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:07.6895758Z 2025-05-07T19:43:07.6896002Z Installed: 2025-05-07T19:43:07.6897004Z hostname-3.23-4.amzn2023.0.3.x86_64 lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2025-05-07T19:43:07.6898441Z 2025-05-07T19:43:07.6898697Z Complete! 2025-05-07T19:43:07.7256620Z + hostname 2025-05-07T19:43:07.7256791Z 2025-05-07T19:43:07.7271399Z bd0f6f446662 2025-05-07T19:43:07.7273898Z 2025-05-07T19:43:07.7274056Z + sudo lshw -C display 2025-05-07T19:43:07.7274229Z 2025-05-07T19:43:07.9238526Z *-display UNCLAIMED 2025-05-07T19:43:07.9239012Z description: VGA compatible controller 2025-05-07T19:43:07.9239400Z product: Amazon.com, Inc. 2025-05-07T19:43:07.9239756Z vendor: Amazon.com, Inc. 2025-05-07T19:43:07.9240050Z physical id: 3 2025-05-07T19:43:07.9240294Z bus info: pci@0000:00:03.0 2025-05-07T19:43:07.9240581Z version: 00 2025-05-07T19:43:07.9240835Z width: 32 bits 2025-05-07T19:43:07.9241065Z clock: 33MHz 2025-05-07T19:43:07.9241335Z capabilities: vga_controller bus_master 2025-05-07T19:43:07.9241664Z configuration: latency=0 2025-05-07T19:43:07.9242014Z resources: memory:c0000000-c03fffff memory:c0000-dffff 2025-05-07T19:43:07.9261910Z 2025-05-07T19:43:07.9262213Z ################################################################################ 2025-05-07T19:43:07.9262629Z [INFO] Printing NVIDIA GPU info ... 2025-05-07T19:43:07.9374613Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:07.9403820Z which: no nvidia-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:07.9404373Z [CHECK] nvidia-smi not found 2025-05-07T19:43:07.9404675Z ################################################################################ 2025-05-07T19:43:07.9405046Z [INFO] Printing AMD GPU info ... 2025-05-07T19:43:07.9510935Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:07.9541263Z which: no rocminfo in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:07.9541787Z [CHECK] rocminfo not found 2025-05-07T19:43:07.9555204Z which: no rocm-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:07.9556645Z [CHECK] rocm-smi not found 2025-05-07T19:43:07.9635432Z ##[group]Run . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:07.9635951Z . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:07.9636557Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:07.9636930Z env: 2025-05-07T19:43:07.9637211Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:07.9637564Z BUILD_ENV: build_binary 2025-05-07T19:43:07.9637834Z BUILD_TARGET: default 2025-05-07T19:43:07.9638115Z BUILD_VARIANT: cuda 2025-05-07T19:43:07.9638401Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:07.9638836Z ##[endgroup] 2025-05-07T19:43:08.4319528Z ################################################################################ 2025-05-07T19:43:08.4320419Z # Setup Miniconda 2025-05-07T19:43:08.4320705Z # 2025-05-07T19:43:08.4337448Z # [2025-05-07T19:43:08.433Z] + setup_miniconda /github/home/miniconda 2025-05-07T19:43:08.4339077Z ################################################################################ 2025-05-07T19:43:08.4340282Z 2025-05-07T19:43:08.4355836Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:08.5202114Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:08.5203155Z + mkdir -p /github/home/miniconda 2025-05-07T19:43:08.5203446Z 2025-05-07T19:43:08.5220415Z 2025-05-07T19:43:08.5221060Z [SETUP] Downloading the Miniconda installer ... 2025-05-07T19:43:08.5244578Z [EXEC] [ATTEMPT 0/3] + wget -q https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O miniconda.sh 2025-05-07T19:43:09.7229203Z [SETUP] Installing Miniconda ... 2025-05-07T19:43:09.7230345Z + bash miniconda.sh -b -p /github/home/miniconda -u 2025-05-07T19:43:09.7231111Z 2025-05-07T19:43:09.7369398Z PREFIX=/github/home/miniconda 2025-05-07T19:43:10.0866834Z Unpacking payload ... 2025-05-07T19:43:10.5661828Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:11.2421855Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:13.1078609Z 2025-05-07T19:43:13.1079232Z Installing base environment... 2025-05-07T19:43:13.1079521Z 2025-05-07T19:43:14.1058137Z Preparing transaction: ...working... done 2025-05-07T19:43:16.9669989Z Executing transaction: ...working... done 2025-05-07T19:43:17.5204279Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:17.5911747Z installation finished. 2025-05-07T19:43:17.5914920Z 2025-05-07T19:43:17.5915866Z + rm -f miniconda.sh 2025-05-07T19:43:17.5916145Z 2025-05-07T19:43:17.6063349Z 2025-05-07T19:43:17.6063692Z [SETUP] Reloading the bash configuration ... 2025-05-07T19:43:17.6064115Z + /github/home/miniconda/bin/conda init bash 2025-05-07T19:43:17.6064344Z 2025-05-07T19:43:17.9695762Z no change /github/home/miniconda/condabin/conda 2025-05-07T19:43:17.9697037Z no change /github/home/miniconda/bin/conda 2025-05-07T19:43:17.9697575Z no change /github/home/miniconda/bin/conda-env 2025-05-07T19:43:17.9697984Z no change /github/home/miniconda/bin/activate 2025-05-07T19:43:17.9698409Z no change /github/home/miniconda/bin/deactivate 2025-05-07T19:43:17.9698876Z no change /github/home/miniconda/etc/profile.d/conda.sh 2025-05-07T19:43:17.9699370Z no change /github/home/miniconda/etc/fish/conf.d/conda.fish 2025-05-07T19:43:17.9699893Z no change /github/home/miniconda/shell/condabin/Conda.psm1 2025-05-07T19:43:17.9700482Z no change /github/home/miniconda/shell/condabin/conda-hook.ps1 2025-05-07T19:43:17.9701248Z no change /github/home/miniconda/lib/python3.13/site-packages/xontrib/conda.xsh 2025-05-07T19:43:17.9702140Z no change /github/home/miniconda/etc/profile.d/conda.csh 2025-05-07T19:43:17.9702578Z modified /github/home/.bashrc 2025-05-07T19:43:17.9702784Z 2025-05-07T19:43:17.9703050Z ==> For changes to take effect, close and re-open your current shell. <== 2025-05-07T19:43:17.9703376Z 2025-05-07T19:43:18.0222013Z 2025-05-07T19:43:18.0222482Z + . /github/home/.bashrc 2025-05-07T19:43:18.0222758Z 2025-05-07T19:43:18.8102347Z 2025-05-07T19:43:18.8102982Z [SETUP] Installing libmamba-solver (required since Anaconda 2024.02-1) and libarchive ... 2025-05-07T19:43:18.8128889Z [EXEC] [ATTEMPT 0/3] + conda install --solver=classic -c conda-forge --override-channels -y conda-libmamba-solver libmamba libmambapy libarchive 2025-05-07T19:43:30.4545735Z Collecting package metadata (current_repodata.json): - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:31.9121933Z Solving environment: \ | / - \ | / - \ | / done 2025-05-07T19:43:32.0015330Z 2025-05-07T19:43:32.0015982Z ## Package Plan ## 2025-05-07T19:43:32.0016489Z 2025-05-07T19:43:32.0016886Z environment location: /github/home/miniconda 2025-05-07T19:43:32.0017595Z 2025-05-07T19:43:32.0017898Z added / updated specs: 2025-05-07T19:43:32.0018664Z - conda-libmamba-solver 2025-05-07T19:43:32.0019438Z - libarchive 2025-05-07T19:43:32.0020032Z - libmamba 2025-05-07T19:43:32.0020619Z - libmambapy 2025-05-07T19:43:32.0020764Z 2025-05-07T19:43:32.0020768Z 2025-05-07T19:43:32.0020905Z The following packages will be downloaded: 2025-05-07T19:43:32.0021170Z 2025-05-07T19:43:32.0021375Z package | build 2025-05-07T19:43:32.0022150Z ---------------------------|----------------- 2025-05-07T19:43:32.0022619Z ca-certificates-2025.4.26 | hbd8a1cb_0 149 KB conda-forge 2025-05-07T19:43:32.0023168Z certifi-2025.4.26 | pyhd8ed1ab_0 154 KB conda-forge 2025-05-07T19:43:32.0023641Z conda-25.3.1 | py313h78bf25f_1 1.1 MB conda-forge 2025-05-07T19:43:32.0024195Z conda-libmamba-solver-25.4.0| pyhd8ed1ab_0 41 KB conda-forge 2025-05-07T19:43:32.0024688Z ------------------------------------------------------------ 2025-05-07T19:43:32.0025097Z Total: 1.4 MB 2025-05-07T19:43:32.0025330Z 2025-05-07T19:43:32.0025487Z The following packages will be UPDATED: 2025-05-07T19:43:32.0025710Z 2025-05-07T19:43:32.0031211Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:43:32.0032151Z conda pkgs/main::conda-25.3.1-py313h06a4308~ --> conda-forge::conda-25.3.1-py313h78bf25f_1 2025-05-07T19:43:32.0032597Z 2025-05-07T19:43:32.0032867Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:43:32.0033213Z 2025-05-07T19:43:32.0033558Z certifi pkgs/main/linux-64::certifi-2025.4.26~ --> conda-forge/noarch::certifi-2025.4.26-pyhd8ed1ab_0 2025-05-07T19:43:32.0034446Z conda-libmamba-so~ pkgs/main::conda-libmamba-solver-25.4~ --> conda-forge::conda-libmamba-solver-25.4.0-pyhd8ed1ab_0 2025-05-07T19:43:32.0034974Z 2025-05-07T19:43:32.0034978Z 2025-05-07T19:43:32.0034982Z 2025-05-07T19:43:32.0035168Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:32.0035565Z conda-25.3.1 | 1.1 MB | | 0% 2025-05-07T19:43:32.0035833Z 2025-05-07T19:43:32.0036285Z certifi-2025.4.26 | 154 KB | | 0%  2025-05-07T19:43:32.0036539Z 2025-05-07T19:43:32.0036543Z 2025-05-07T19:43:32.0036777Z ca-certificates-2025 | 149 KB | | 0%  2025-05-07T19:43:32.0037083Z 2025-05-07T19:43:32.0037205Z 2025-05-07T19:43:32.0037507Z 2025-05-07T19:43:32.0702146Z conda-libmamba-solve | 41 KB | | 0%  2025-05-07T19:43:32.0823174Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:32.0823946Z 2025-05-07T19:43:32.0823961Z 2025-05-07T19:43:32.0928484Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:32.0929368Z 2025-05-07T19:43:32.0929383Z 2025-05-07T19:43:32.1012454Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:32.1013361Z 2025-05-07T19:43:32.1013375Z 2025-05-07T19:43:32.1013386Z 2025-05-07T19:43:32.1161556Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:32.1161927Z 2025-05-07T19:43:32.1162094Z 2025-05-07T19:43:32.1162098Z 2025-05-07T19:43:32.1162375Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:32.1162684Z 2025-05-07T19:43:32.1162688Z 2025-05-07T19:43:32.1162712Z 2025-05-07T19:43:32.1806466Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:32.1807760Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:32.4124261Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:32.4125043Z 2025-05-07T19:43:32.4160925Z certifi-2025.4.26 | 154 KB | # | 10%  2025-05-07T19:43:32.4161760Z 2025-05-07T19:43:32.4243550Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:32.4244383Z 2025-05-07T19:43:32.4247550Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:32.4248876Z 2025-05-07T19:43:32.4249104Z 2025-05-07T19:43:32.4249325Z  2025-05-07T19:43:32.4249550Z 2025-05-07T19:43:32.4249555Z 2025-05-07T19:43:32.4249745Z  2025-05-07T19:43:32.4250021Z 2025-05-07T19:43:32.4250025Z 2025-05-07T19:43:32.4250288Z 2025-05-07T19:43:32.4250509Z  done 2025-05-07T19:43:32.5259613Z Preparing transaction: \ done 2025-05-07T19:43:32.6268781Z Verifying transaction: / done 2025-05-07T19:43:33.9296744Z Executing transaction: \ | / - \ | / - \ | / - \ done 2025-05-07T19:43:35.5327430Z [SETUP] Updating Miniconda base packages ... 2025-05-07T19:43:35.5349051Z [EXEC] [ATTEMPT 0/3] + conda update -n base -c defaults --update-deps -y conda 2025-05-07T19:43:36.2640201Z Channels: 2025-05-07T19:43:36.2640632Z - defaults 2025-05-07T19:43:36.2640880Z Platform: linux-64 2025-05-07T19:43:37.3479554Z Collecting package metadata (repodata.json): - \ | / - \ done 2025-05-07T19:43:37.4762211Z Solving environment: / - Channels: 2025-05-07T19:43:37.4762700Z - defaults 2025-05-07T19:43:37.4762967Z Platform: linux-64 2025-05-07T19:43:37.7578618Z Collecting package metadata (repodata.json): | / - \ done 2025-05-07T19:43:37.9663280Z Solving environment: / - \ done 2025-05-07T19:43:38.0767904Z | done 2025-05-07T19:43:38.1407009Z 2025-05-07T19:43:38.1407324Z ## Package Plan ## 2025-05-07T19:43:38.1407516Z 2025-05-07T19:43:38.1407674Z environment location: /github/home/miniconda 2025-05-07T19:43:38.1407958Z 2025-05-07T19:43:38.1408071Z added / updated specs: 2025-05-07T19:43:38.1408373Z - conda 2025-05-07T19:43:38.1408507Z 2025-05-07T19:43:38.1408511Z 2025-05-07T19:43:38.1408645Z The following packages will be downloaded: 2025-05-07T19:43:38.1408907Z 2025-05-07T19:43:38.1409033Z package | build 2025-05-07T19:43:38.1409389Z ---------------------------|----------------- 2025-05-07T19:43:38.1409794Z pip-25.1 | pyhc872135_2 1.3 MB 2025-05-07T19:43:38.1410249Z tzdata-2025b | h04d1e81_0 116 KB 2025-05-07T19:43:38.1410660Z ------------------------------------------------------------ 2025-05-07T19:43:38.1411074Z Total: 1.4 MB 2025-05-07T19:43:38.1411539Z 2025-05-07T19:43:38.1411665Z The following packages will be UPDATED: 2025-05-07T19:43:38.1411899Z 2025-05-07T19:43:38.1412217Z pip pkgs/main/linux-64::pip-25.0-py313h06~ --> pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:38.1412791Z tzdata 2025a-h04d1e81_0 --> 2025b-h04d1e81_0 2025-05-07T19:43:38.1413052Z 2025-05-07T19:43:38.1413057Z 2025-05-07T19:43:38.1413060Z 2025-05-07T19:43:38.1413205Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:38.1413587Z pip-25.1 | 1.3 MB | | 0% 2025-05-07T19:43:38.1413812Z 2025-05-07T19:43:38.1790808Z tzdata-2025b | 116 KB | | 0%  2025-05-07T19:43:38.1791331Z 2025-05-07T19:43:38.1927428Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:38.3780850Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:38.3781181Z 2025-05-07T19:43:38.3781588Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:38.3781872Z 2025-05-07T19:43:38.3823593Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:38.3824807Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:38.3825825Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:38.3826815Z 2025-05-07T19:43:38.3827433Z 2025-05-07T19:43:38.3827998Z  done 2025-05-07T19:43:38.4834984Z Preparing transaction: - done 2025-05-07T19:43:38.5845713Z Verifying transaction: | done 2025-05-07T19:43:40.5891011Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:41.1296665Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:43:41.1298629Z + conda clean --packages --tarball -y 2025-05-07T19:43:41.1299246Z 2025-05-07T19:43:41.5624919Z Will remove 99 (117.8 MB) tarball(s). 2025-05-07T19:43:41.5625950Z Will remove 11 (16.0 MB) package(s). 2025-05-07T19:43:41.6176290Z 2025-05-07T19:43:41.6179743Z + conda clean --all -y 2025-05-07T19:43:41.6179946Z 2025-05-07T19:43:42.0651108Z There are no unused tarball(s) to remove. 2025-05-07T19:43:42.0652472Z Will remove 1 index cache(s). 2025-05-07T19:43:42.0652969Z There are no unused package(s) to remove. 2025-05-07T19:43:42.0653306Z There are no tempfile(s) to remove. 2025-05-07T19:43:42.0653623Z There are no logfile(s) to remove. 2025-05-07T19:43:42.1205742Z 2025-05-07T19:43:42.1206265Z + conda info 2025-05-07T19:43:42.1206466Z 2025-05-07T19:43:42.6800316Z 2025-05-07T19:43:42.6800983Z active environment : base 2025-05-07T19:43:42.6801938Z active env location : /github/home/miniconda 2025-05-07T19:43:42.6802924Z shell level : 1 2025-05-07T19:43:42.6803790Z user config file : /github/home/.condarc 2025-05-07T19:43:42.6804959Z populated config files : /github/home/miniconda/.condarc 2025-05-07T19:43:42.6806120Z conda version : 25.3.1 2025-05-07T19:43:42.6806944Z conda-build version : not installed 2025-05-07T19:43:42.6807861Z python version : 3.13.2.final.0 2025-05-07T19:43:42.6808337Z solver : libmamba (default) 2025-05-07T19:43:42.6808694Z virtual packages : __archspec=1=cascadelake 2025-05-07T19:43:42.6809021Z __conda=25.3.1=0 2025-05-07T19:43:42.6809341Z __glibc=2.34=0 2025-05-07T19:43:42.6809632Z __linux=6.1.130=0 2025-05-07T19:43:42.6809952Z __unix=0=0 2025-05-07T19:43:42.6810317Z base environment : /github/home/miniconda (writable) 2025-05-07T19:43:42.6810717Z conda av data dir : /github/home/miniconda/etc/conda 2025-05-07T19:43:42.6811091Z conda av metadata url : None 2025-05-07T19:43:42.6811473Z channel URLs : https://repo.anaconda.com/pkgs/main/linux-64 2025-05-07T19:43:42.6812208Z https://repo.anaconda.com/pkgs/main/noarch 2025-05-07T19:43:42.6812603Z https://repo.anaconda.com/pkgs/r/linux-64 2025-05-07T19:43:42.6812997Z https://repo.anaconda.com/pkgs/r/noarch 2025-05-07T19:43:42.6813373Z package cache : /github/home/miniconda/pkgs 2025-05-07T19:43:42.6813699Z /github/home/.conda/pkgs 2025-05-07T19:43:42.6814049Z envs directories : /github/home/miniconda/envs 2025-05-07T19:43:42.6814374Z /github/home/.conda/envs 2025-05-07T19:43:42.6814692Z platform : linux-64 2025-05-07T19:43:42.6815523Z user-agent : conda/25.3.1 requests/2.32.3 CPython/3.13.2 Linux/6.1.130-139.222.amzn2023.x86_64 amzn/2023.7.20250428 glibc/2.34 solver/libmamba conda-libmamba-solver/25.4.0 libmambapy/2.0.5 aau/0.7.0 c/. s/. e/. 2025-05-07T19:43:42.6816390Z UID:GID : 0:0 2025-05-07T19:43:42.6816662Z netrc file : None 2025-05-07T19:43:42.6816925Z offline mode : False 2025-05-07T19:43:42.6817093Z 2025-05-07T19:43:42.7379365Z 2025-05-07T19:43:42.7379710Z [SETUP] Exporting Miniconda variables ... 2025-05-07T19:43:42.7381215Z [SETUP] Saving Miniconda variables to /__w/_temp/_runner_file_commands/add_path_ccbcc1e0-33b5-4f00-a54c-6bb6c3938918 ... 2025-05-07T19:43:42.7381980Z [SETUP] Successfully set up Miniconda at /github/home/miniconda 2025-05-07T19:43:42.7550278Z ##[group]Run . $PRELUDE; create_conda_environment $BUILD_ENV 3.9 2025-05-07T19:43:42.7550781Z . $PRELUDE; create_conda_environment $BUILD_ENV 3.9 2025-05-07T19:43:42.7551482Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:42.7551804Z env: 2025-05-07T19:43:42.7552032Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:42.7552318Z BUILD_ENV: build_binary 2025-05-07T19:43:42.7552559Z BUILD_TARGET: default 2025-05-07T19:43:42.7552774Z BUILD_VARIANT: cuda 2025-05-07T19:43:42.7553005Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:43:42.7553431Z ##[endgroup] 2025-05-07T19:43:43.1976501Z ################################################################################ 2025-05-07T19:43:43.1977183Z # Create Conda Environment 2025-05-07T19:43:43.1977440Z # 2025-05-07T19:43:43.1990729Z # [2025-05-07T19:43:43.198Z] + create_conda_environment build_binary 3.9 2025-05-07T19:43:43.1992179Z ################################################################################ 2025-05-07T19:43:43.1992867Z 2025-05-07T19:43:43.2006529Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:43.2880252Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:43.2881422Z [SETUP] Listing existing Conda environments ... 2025-05-07T19:43:43.2882371Z + conda info --envs 2025-05-07T19:43:43.2882782Z 2025-05-07T19:43:43.8496473Z 2025-05-07T19:43:43.8497196Z # conda environments: 2025-05-07T19:43:43.8497972Z # 2025-05-07T19:43:43.8498633Z base /github/home/miniconda 2025-05-07T19:43:43.8499355Z 2025-05-07T19:43:43.9080170Z 2025-05-07T19:43:43.9080792Z [SETUP] Deleting the prefix directory if it exists ... 2025-05-07T19:43:45.5014754Z + rm -rf /github/home/miniconda/envs/build_binary 2025-05-07T19:43:45.5015617Z 2025-05-07T19:43:45.5025271Z 2025-05-07T19:43:45.5036376Z [SETUP] Creating new Conda environment (Python 3.9) ... 2025-05-07T19:43:45.5058251Z [EXEC] [ATTEMPT 0/3] + conda create -y -n build_binary python=3.9 2025-05-07T19:43:46.0714372Z Channels: 2025-05-07T19:43:46.0715067Z - defaults 2025-05-07T19:43:46.0715669Z Platform: linux-64 2025-05-07T19:43:47.4588219Z Collecting package metadata (repodata.json): - \ | / - \ | / - done 2025-05-07T19:43:47.5595588Z Solving environment: | done 2025-05-07T19:43:47.5887035Z 2025-05-07T19:43:47.5887609Z ## Package Plan ## 2025-05-07T19:43:47.5888156Z 2025-05-07T19:43:47.5888568Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:43:47.5888888Z 2025-05-07T19:43:47.5889024Z added / updated specs: 2025-05-07T19:43:47.5889351Z - python=3.9 2025-05-07T19:43:47.5889495Z 2025-05-07T19:43:47.5889500Z 2025-05-07T19:43:47.5889660Z The following packages will be downloaded: 2025-05-07T19:43:47.5889890Z 2025-05-07T19:43:47.5890016Z package | build 2025-05-07T19:43:47.5890392Z ---------------------------|----------------- 2025-05-07T19:43:47.5890799Z _libgcc_mutex-0.1 | main 3 KB 2025-05-07T19:43:47.5891249Z _openmp_mutex-5.1 | 1_gnu 21 KB 2025-05-07T19:43:47.5891713Z ca-certificates-2025.2.25 | h06a4308_0 129 KB 2025-05-07T19:43:47.5892143Z python-3.9.21 | he870216_1 25.1 MB 2025-05-07T19:43:47.5892591Z setuptools-78.1.1 | py39h06a4308_0 1.7 MB 2025-05-07T19:43:47.5893011Z wheel-0.45.1 | py39h06a4308_0 114 KB 2025-05-07T19:43:47.5893426Z ------------------------------------------------------------ 2025-05-07T19:43:47.5893791Z Total: 27.1 MB 2025-05-07T19:43:47.5894040Z 2025-05-07T19:43:47.5894183Z The following NEW packages will be INSTALLED: 2025-05-07T19:43:47.5894419Z 2025-05-07T19:43:47.5894685Z _libgcc_mutex pkgs/main/linux-64::_libgcc_mutex-0.1-main 2025-05-07T19:43:47.5895149Z _openmp_mutex pkgs/main/linux-64::_openmp_mutex-5.1-1_gnu 2025-05-07T19:43:47.5896070Z ca-certificates pkgs/main/linux-64::ca-certificates-2025.2.25-h06a4308_0 2025-05-07T19:43:47.5896660Z ld_impl_linux-64 pkgs/main/linux-64::ld_impl_linux-64-2.40-h12ee557_0 2025-05-07T19:43:47.5897184Z libffi pkgs/main/linux-64::libffi-3.4.4-h6a678d5_1 2025-05-07T19:43:47.5897676Z libgcc-ng pkgs/main/linux-64::libgcc-ng-11.2.0-h1234567_1 2025-05-07T19:43:47.5898138Z libgomp pkgs/main/linux-64::libgomp-11.2.0-h1234567_1 2025-05-07T19:43:47.5898650Z libstdcxx-ng pkgs/main/linux-64::libstdcxx-ng-11.2.0-h1234567_1 2025-05-07T19:43:47.5899274Z ncurses pkgs/main/linux-64::ncurses-6.4-h6a678d5_0 2025-05-07T19:43:47.5899746Z openssl pkgs/main/linux-64::openssl-3.0.16-h5eee18b_0 2025-05-07T19:43:47.5900335Z pip pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:47.5900956Z python pkgs/main/linux-64::python-3.9.21-he870216_1 2025-05-07T19:43:47.5901464Z readline pkgs/main/linux-64::readline-8.2-h5eee18b_0 2025-05-07T19:43:47.5901973Z setuptools pkgs/main/linux-64::setuptools-78.1.1-py39h06a4308_0 2025-05-07T19:43:47.5902504Z sqlite pkgs/main/linux-64::sqlite-3.45.3-h5eee18b_0 2025-05-07T19:43:47.5902971Z tk pkgs/main/linux-64::tk-8.6.14-h39e8969_0 2025-05-07T19:43:47.5903388Z tzdata pkgs/main/noarch::tzdata-2025b-h04d1e81_0 2025-05-07T19:43:47.5903854Z wheel pkgs/main/linux-64::wheel-0.45.1-py39h06a4308_0 2025-05-07T19:43:47.5904273Z xz pkgs/main/linux-64::xz-5.6.4-h5eee18b_1 2025-05-07T19:43:47.5904705Z zlib pkgs/main/linux-64::zlib-1.2.13-h5eee18b_1 2025-05-07T19:43:47.5904967Z 2025-05-07T19:43:47.5904971Z 2025-05-07T19:43:47.5904975Z 2025-05-07T19:43:47.5905155Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:47.5905576Z python-3.9.21 | 25.1 MB | | 0% 2025-05-07T19:43:47.5905853Z 2025-05-07T19:43:47.5906199Z setuptools-78.1.1 | 1.7 MB | | 0%  2025-05-07T19:43:47.5906467Z 2025-05-07T19:43:47.5906471Z 2025-05-07T19:43:47.5906832Z ca-certificates-2025 | 129 KB | | 0%  2025-05-07T19:43:47.5907128Z 2025-05-07T19:43:47.5907132Z 2025-05-07T19:43:47.5907135Z 2025-05-07T19:43:47.5921550Z wheel-0.45.1 | 114 KB | | 0%  2025-05-07T19:43:47.5921888Z 2025-05-07T19:43:47.5922137Z 2025-05-07T19:43:47.5922149Z 2025-05-07T19:43:47.5924349Z 2025-05-07T19:43:47.5945488Z _openmp_mutex-5.1 | 21 KB | | 0%  2025-05-07T19:43:47.5946422Z 2025-05-07T19:43:47.5946438Z 2025-05-07T19:43:47.5946449Z 2025-05-07T19:43:47.5946459Z 2025-05-07T19:43:47.5946470Z 2025-05-07T19:43:47.6239549Z _libgcc_mutex-0.1 | 3 KB | | 0%  2025-05-07T19:43:47.6239875Z 2025-05-07T19:43:47.6239880Z 2025-05-07T19:43:47.6239883Z 2025-05-07T19:43:47.6239887Z 2025-05-07T19:43:47.6382357Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:47.6382709Z 2025-05-07T19:43:47.6382714Z 2025-05-07T19:43:47.6382718Z 2025-05-07T19:43:47.6401575Z wheel-0.45.1 | 114 KB | ########## | 100%  2025-05-07T19:43:47.6402189Z 2025-05-07T19:43:47.6402194Z 2025-05-07T19:43:47.6458584Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:47.6459476Z 2025-05-07T19:43:47.6459490Z 2025-05-07T19:43:47.6459501Z 2025-05-07T19:43:47.6459512Z 2025-05-07T19:43:47.6459522Z 2025-05-07T19:43:47.6623908Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:47.6624843Z 2025-05-07T19:43:47.6624856Z 2025-05-07T19:43:47.6722593Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:47.6722935Z 2025-05-07T19:43:47.6723188Z 2025-05-07T19:43:47.6723198Z 2025-05-07T19:43:47.6723203Z 2025-05-07T19:43:47.6723208Z 2025-05-07T19:43:47.6747690Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:47.6749095Z 2025-05-07T19:43:47.6811214Z setuptools-78.1.1 | 1.7 MB | ########## | 100%  2025-05-07T19:43:47.6812052Z 2025-05-07T19:43:47.6812066Z 2025-05-07T19:43:47.6812077Z 2025-05-07T19:43:47.6812087Z 2025-05-07T19:43:47.6888845Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:47.7038312Z python-3.9.21 | 25.1 MB | ##5 | 25% 2025-05-07T19:43:47.7039153Z 2025-05-07T19:43:47.7039167Z 2025-05-07T19:43:47.7039178Z 2025-05-07T19:43:47.7039978Z wheel-0.45.1 | 114 KB | ########## | 100%  2025-05-07T19:43:47.7041216Z 2025-05-07T19:43:47.7041229Z 2025-05-07T19:43:47.7041240Z 2025-05-07T19:43:47.8935216Z wheel-0.45.1 | 114 KB | ########## | 100%  2025-05-07T19:43:47.8936416Z python-3.9.21 | 25.1 MB | ########## | 100% 2025-05-07T19:43:47.9228663Z python-3.9.21 | 25.1 MB | ########## | 100% 2025-05-07T19:43:47.9229470Z 2025-05-07T19:43:47.9230310Z setuptools-78.1.1 | 1.7 MB | ########## | 100%  2025-05-07T19:43:47.9231094Z 2025-05-07T19:43:48.3993553Z setuptools-78.1.1 | 1.7 MB | ########## | 100%  2025-05-07T19:43:48.3994633Z python-3.9.21 | 25.1 MB | ########## | 100% 2025-05-07T19:43:48.3995024Z 2025-05-07T19:43:48.3995411Z 2025-05-07T19:43:48.3995624Z  2025-05-07T19:43:48.3995842Z 2025-05-07T19:43:48.3995848Z 2025-05-07T19:43:48.3996039Z  2025-05-07T19:43:48.3996319Z 2025-05-07T19:43:48.3996325Z 2025-05-07T19:43:48.3996329Z 2025-05-07T19:43:48.3996509Z  2025-05-07T19:43:48.3996735Z 2025-05-07T19:43:48.3996739Z 2025-05-07T19:43:48.3996743Z 2025-05-07T19:43:48.3996746Z 2025-05-07T19:43:48.3996950Z  2025-05-07T19:43:48.3997175Z 2025-05-07T19:43:48.3997207Z 2025-05-07T19:43:48.3997210Z 2025-05-07T19:43:48.3997214Z 2025-05-07T19:43:48.3997217Z 2025-05-07T19:43:48.3997441Z  done 2025-05-07T19:43:48.6110070Z Preparing transaction: - \ done 2025-05-07T19:43:49.7568793Z Verifying transaction: / - \ | / - \ | / - \ done 2025-05-07T19:43:51.8717276Z Executing transaction: / - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:51.8755429Z # 2025-05-07T19:43:51.8756109Z # To activate this environment, use 2025-05-07T19:43:51.8757032Z # 2025-05-07T19:43:51.8757903Z # $ conda activate build_binary 2025-05-07T19:43:51.8758658Z # 2025-05-07T19:43:51.8759393Z # To deactivate an active environment, use 2025-05-07T19:43:51.8760219Z # 2025-05-07T19:43:51.8760752Z # $ conda deactivate 2025-05-07T19:43:51.8761207Z 2025-05-07T19:43:51.9621154Z [SETUP] Upgrading PIP to latest ... 2025-05-07T19:43:51.9655009Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --upgrade pip 2025-05-07T19:43:54.6827228Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:43:54.6829196Z 2025-05-07T19:43:54.6829630Z Requirement already satisfied: pip in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (25.1) 2025-05-07T19:43:54.6830285Z Collecting pip 2025-05-07T19:43:54.6830616Z Downloading pip-25.1.1-py3-none-any.whl.metadata (3.6 kB) 2025-05-07T19:43:54.6831038Z Downloading pip-25.1.1-py3-none-any.whl (1.8 MB) 2025-05-07T19:43:54.6831917Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.8/1.8 MB 122.4 MB/s eta 0:00:00 2025-05-07T19:43:54.6832319Z Installing collected packages: pip 2025-05-07T19:43:54.6834100Z Attempting uninstall: pip 2025-05-07T19:43:54.6834550Z Found existing installation: pip 25.1 2025-05-07T19:43:54.6834930Z Uninstalling pip-25.1: 2025-05-07T19:43:54.6835265Z Successfully uninstalled pip-25.1 2025-05-07T19:43:54.6835613Z Successfully installed pip-25.1.1 2025-05-07T19:43:54.6835824Z 2025-05-07T19:43:54.7413081Z [SETUP] Upgrading pyOpenSSL ... 2025-05-07T19:43:54.7442342Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y pyOpenSSL>22.1.0 2025-05-07T19:43:55.4026852Z Channels: 2025-05-07T19:43:55.4027688Z - conda-forge 2025-05-07T19:43:55.4027985Z Platform: linux-64 2025-05-07T19:44:05.1253233Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:44:06.8904677Z Solving environment: | / - \ | done 2025-05-07T19:44:06.9353831Z 2025-05-07T19:44:06.9354318Z ## Package Plan ## 2025-05-07T19:44:06.9354542Z 2025-05-07T19:44:06.9354837Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:06.9355212Z 2025-05-07T19:44:06.9355327Z added / updated specs: 2025-05-07T19:44:06.9355662Z - pyopenssl[version='>22.1.0'] 2025-05-07T19:44:06.9355868Z 2025-05-07T19:44:06.9355873Z 2025-05-07T19:44:06.9356009Z The following packages will be downloaded: 2025-05-07T19:44:06.9356277Z 2025-05-07T19:44:06.9356408Z package | build 2025-05-07T19:44:06.9356763Z ---------------------------|----------------- 2025-05-07T19:44:06.9357187Z cffi-1.17.1 | py39h15c3d72_0 236 KB conda-forge 2025-05-07T19:44:06.9357726Z cryptography-44.0.3 | py39h7170ec2_0 1.5 MB conda-forge 2025-05-07T19:44:06.9358207Z libgcc-15.1.0 | h767d61c_2 810 KB conda-forge 2025-05-07T19:44:06.9358711Z libgcc-ng-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:44:06.9359174Z libgomp-15.1.0 | h767d61c_2 442 KB conda-forge 2025-05-07T19:44:06.9359650Z openssl-3.5.0 | h7b32b05_1 3.0 MB conda-forge 2025-05-07T19:44:06.9360144Z pycparser-2.22 | pyh29332c3_1 108 KB conda-forge 2025-05-07T19:44:06.9360630Z pyopenssl-25.0.0 | pyhd8ed1ab_0 120 KB conda-forge 2025-05-07T19:44:06.9361128Z python_abi-3.9 | 2_cp39 4 KB conda-forge 2025-05-07T19:44:06.9361626Z typing-extensions-4.13.2 | h0e9735f_0 88 KB conda-forge 2025-05-07T19:44:06.9362206Z typing_extensions-4.13.2 | pyh29332c3_0 51 KB conda-forge 2025-05-07T19:44:06.9362682Z ------------------------------------------------------------ 2025-05-07T19:44:06.9363094Z Total: 6.3 MB 2025-05-07T19:44:06.9363330Z 2025-05-07T19:44:06.9363500Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:06.9363743Z 2025-05-07T19:44:06.9363987Z cffi conda-forge/linux-64::cffi-1.17.1-py39h15c3d72_0 2025-05-07T19:44:06.9364542Z cryptography conda-forge/linux-64::cryptography-44.0.3-py39h7170ec2_0 2025-05-07T19:44:06.9365074Z libgcc conda-forge/linux-64::libgcc-15.1.0-h767d61c_2 2025-05-07T19:44:06.9365596Z pycparser conda-forge/noarch::pycparser-2.22-pyh29332c3_1 2025-05-07T19:44:06.9366144Z pyopenssl conda-forge/noarch::pyopenssl-25.0.0-pyhd8ed1ab_0 2025-05-07T19:44:06.9366647Z python_abi conda-forge/linux-64::python_abi-3.9-2_cp39 2025-05-07T19:44:06.9367240Z typing-extensions conda-forge/noarch::typing-extensions-4.13.2-h0e9735f_0 2025-05-07T19:44:06.9367872Z typing_extensions conda-forge/noarch::typing_extensions-4.13.2-pyh29332c3_0 2025-05-07T19:44:06.9368273Z 2025-05-07T19:44:06.9368403Z The following packages will be UPDATED: 2025-05-07T19:44:06.9368629Z 2025-05-07T19:44:06.9371740Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:44:06.9372599Z libgcc-ng pkgs/main::libgcc-ng-11.2.0-h1234567_1 --> conda-forge::libgcc-ng-15.1.0-h69a702a_2 2025-05-07T19:44:06.9373344Z libgomp pkgs/main::libgomp-11.2.0-h1234567_1 --> conda-forge::libgomp-15.1.0-h767d61c_2 2025-05-07T19:44:06.9374092Z openssl pkgs/main::openssl-3.0.16-h5eee18b_0 --> conda-forge::openssl-3.5.0-h7b32b05_1 2025-05-07T19:44:06.9374495Z 2025-05-07T19:44:06.9374499Z 2025-05-07T19:44:06.9374503Z 2025-05-07T19:44:06.9374821Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:06.9375220Z openssl-3.5.0 | 3.0 MB | | 0% 2025-05-07T19:44:06.9375458Z 2025-05-07T19:44:06.9376132Z cryptography-44.0.3 | 1.5 MB | | 0%  2025-05-07T19:44:06.9376395Z 2025-05-07T19:44:06.9376399Z 2025-05-07T19:44:06.9376612Z libgcc-15.1.0 | 810 KB | | 0%  2025-05-07T19:44:06.9376862Z 2025-05-07T19:44:06.9376888Z 2025-05-07T19:44:06.9376892Z 2025-05-07T19:44:06.9397209Z libgomp-15.1.0 | 442 KB | | 0%  2025-05-07T19:44:06.9397507Z 2025-05-07T19:44:06.9397512Z 2025-05-07T19:44:06.9397612Z 2025-05-07T19:44:06.9397622Z 2025-05-07T19:44:06.9419547Z cffi-1.17.1 | 236 KB | | 0%  2025-05-07T19:44:06.9419979Z 2025-05-07T19:44:06.9420250Z 2025-05-07T19:44:06.9420254Z 2025-05-07T19:44:06.9420345Z 2025-05-07T19:44:06.9420374Z 2025-05-07T19:44:06.9422328Z pyopenssl-25.0.0 | 120 KB | | 0%  2025-05-07T19:44:06.9422665Z 2025-05-07T19:44:06.9422669Z 2025-05-07T19:44:06.9422673Z 2025-05-07T19:44:06.9422677Z 2025-05-07T19:44:06.9422681Z 2025-05-07T19:44:06.9422699Z 2025-05-07T19:44:06.9422981Z pycparser-2.22 | 108 KB | | 0%  2025-05-07T19:44:06.9423263Z 2025-05-07T19:44:06.9423268Z 2025-05-07T19:44:06.9423271Z 2025-05-07T19:44:06.9423275Z 2025-05-07T19:44:06.9423286Z 2025-05-07T19:44:06.9423290Z 2025-05-07T19:44:06.9430723Z 2025-05-07T19:44:06.9431915Z typing-extensions-4. | 88 KB | | 0%  2025-05-07T19:44:06.9432234Z 2025-05-07T19:44:06.9432237Z 2025-05-07T19:44:06.9432241Z 2025-05-07T19:44:06.9432244Z 2025-05-07T19:44:06.9432247Z 2025-05-07T19:44:06.9432251Z 2025-05-07T19:44:06.9432255Z 2025-05-07T19:44:06.9432259Z 2025-05-07T19:44:06.9433089Z typing_extensions-4. | 51 KB | | 0%  2025-05-07T19:44:06.9433401Z 2025-05-07T19:44:06.9433413Z 2025-05-07T19:44:06.9433416Z 2025-05-07T19:44:06.9433420Z 2025-05-07T19:44:06.9433423Z 2025-05-07T19:44:06.9433427Z 2025-05-07T19:44:06.9433434Z 2025-05-07T19:44:06.9433438Z 2025-05-07T19:44:06.9433457Z 2025-05-07T19:44:06.9434300Z libgcc-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:06.9434601Z 2025-05-07T19:44:06.9434606Z 2025-05-07T19:44:06.9434609Z 2025-05-07T19:44:06.9434623Z 2025-05-07T19:44:06.9434646Z 2025-05-07T19:44:06.9434650Z 2025-05-07T19:44:06.9434653Z 2025-05-07T19:44:06.9434656Z 2025-05-07T19:44:06.9434660Z 2025-05-07T19:44:06.9434664Z 2025-05-07T19:44:07.0053418Z python_abi-3.9 | 4 KB | | 0%  2025-05-07T19:44:07.0054321Z 2025-05-07T19:44:07.0262494Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:07.0262807Z 2025-05-07T19:44:07.0262812Z 2025-05-07T19:44:07.0300885Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:07.0301259Z 2025-05-07T19:44:07.0301572Z 2025-05-07T19:44:07.0301580Z 2025-05-07T19:44:07.0301585Z 2025-05-07T19:44:07.0355794Z cffi-1.17.1 | 236 KB | ########## | 100%  2025-05-07T19:44:07.0438195Z openssl-3.5.0 | 3.0 MB | ###5 | 35% 2025-05-07T19:44:07.0438688Z 2025-05-07T19:44:07.0438735Z 2025-05-07T19:44:07.0438741Z 2025-05-07T19:44:07.0438835Z 2025-05-07T19:44:07.0438843Z 2025-05-07T19:44:07.0439564Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:07.0439885Z 2025-05-07T19:44:07.0439896Z 2025-05-07T19:44:07.0439900Z 2025-05-07T19:44:07.0439904Z 2025-05-07T19:44:07.0439921Z 2025-05-07T19:44:07.0476508Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:07.0476838Z 2025-05-07T19:44:07.0476882Z 2025-05-07T19:44:07.0476886Z 2025-05-07T19:44:07.0728225Z libgomp-15.1.0 | 442 KB | 3 | 4%  2025-05-07T19:44:07.0728518Z 2025-05-07T19:44:07.0728616Z 2025-05-07T19:44:07.0728825Z 2025-05-07T19:44:07.0760269Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:07.0760582Z 2025-05-07T19:44:07.0760586Z 2025-05-07T19:44:07.0760590Z 2025-05-07T19:44:07.0760594Z 2025-05-07T19:44:07.0760598Z 2025-05-07T19:44:07.0760601Z 2025-05-07T19:44:07.0780993Z pycparser-2.22 | 108 KB | #4 | 15%  2025-05-07T19:44:07.0781347Z 2025-05-07T19:44:07.0781351Z 2025-05-07T19:44:07.0781372Z 2025-05-07T19:44:07.0781376Z 2025-05-07T19:44:07.0781380Z 2025-05-07T19:44:07.0781383Z 2025-05-07T19:44:07.0781387Z 2025-05-07T19:44:07.0781391Z 2025-05-07T19:44:07.0803410Z typing_extensions-4. | 51 KB | ###1 | 31%  2025-05-07T19:44:07.0803767Z 2025-05-07T19:44:07.0803772Z 2025-05-07T19:44:07.0803776Z 2025-05-07T19:44:07.0803779Z 2025-05-07T19:44:07.0803783Z 2025-05-07T19:44:07.0803787Z 2025-05-07T19:44:07.0815284Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:07.0815608Z 2025-05-07T19:44:07.0815612Z 2025-05-07T19:44:07.0815616Z 2025-05-07T19:44:07.0815620Z 2025-05-07T19:44:07.0815631Z 2025-05-07T19:44:07.0815634Z 2025-05-07T19:44:07.0815638Z 2025-05-07T19:44:07.0815641Z 2025-05-07T19:44:07.1005640Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:07.1006026Z 2025-05-07T19:44:07.1006030Z 2025-05-07T19:44:07.1006034Z 2025-05-07T19:44:07.1006055Z 2025-05-07T19:44:07.1006277Z cffi-1.17.1 | 236 KB | ########## | 100%  2025-05-07T19:44:07.1006527Z 2025-05-07T19:44:07.1006531Z 2025-05-07T19:44:07.1006535Z 2025-05-07T19:44:07.1006538Z 2025-05-07T19:44:07.1006554Z 2025-05-07T19:44:07.1006559Z 2025-05-07T19:44:07.1006563Z 2025-05-07T19:44:07.1006842Z typing-extensions-4. | 88 KB | #8 | 18%  2025-05-07T19:44:07.1007148Z 2025-05-07T19:44:07.1007152Z 2025-05-07T19:44:07.1007155Z 2025-05-07T19:44:07.1007159Z 2025-05-07T19:44:07.1021009Z cffi-1.17.1 | 236 KB | ########## | 100%  2025-05-07T19:44:07.1029587Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:07.1029876Z 2025-05-07T19:44:07.1029888Z 2025-05-07T19:44:07.1029893Z 2025-05-07T19:44:07.1029897Z 2025-05-07T19:44:07.1029902Z 2025-05-07T19:44:07.1029906Z 2025-05-07T19:44:07.1030172Z 2025-05-07T19:44:07.1117934Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:07.1118305Z 2025-05-07T19:44:07.1118430Z 2025-05-07T19:44:07.1118434Z 2025-05-07T19:44:07.1118460Z 2025-05-07T19:44:07.1118819Z 2025-05-07T19:44:07.1118847Z 2025-05-07T19:44:07.1118860Z 2025-05-07T19:44:07.1118878Z 2025-05-07T19:44:07.1118891Z 2025-05-07T19:44:07.1137635Z libgcc-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:07.1138013Z 2025-05-07T19:44:07.1138020Z 2025-05-07T19:44:07.1138054Z 2025-05-07T19:44:07.1138061Z 2025-05-07T19:44:07.1138068Z 2025-05-07T19:44:07.1138075Z 2025-05-07T19:44:07.1138121Z 2025-05-07T19:44:07.1138127Z 2025-05-07T19:44:07.1138132Z 2025-05-07T19:44:07.1189377Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:07.1190272Z 2025-05-07T19:44:07.1190747Z 2025-05-07T19:44:07.1191733Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:07.1192036Z 2025-05-07T19:44:07.1192042Z 2025-05-07T19:44:07.1273581Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:07.1273922Z 2025-05-07T19:44:07.1273927Z 2025-05-07T19:44:07.1273930Z 2025-05-07T19:44:07.1273934Z 2025-05-07T19:44:07.1273937Z 2025-05-07T19:44:07.1273941Z 2025-05-07T19:44:07.1273944Z 2025-05-07T19:44:07.1273972Z 2025-05-07T19:44:07.1273976Z 2025-05-07T19:44:07.1273979Z 2025-05-07T19:44:07.1279192Z python_abi-3.9 | 4 KB | ########## | 100%  2025-05-07T19:44:07.1279499Z 2025-05-07T19:44:07.1279503Z 2025-05-07T19:44:07.1279506Z 2025-05-07T19:44:07.1279667Z 2025-05-07T19:44:07.1279670Z 2025-05-07T19:44:07.1279699Z 2025-05-07T19:44:07.1279703Z 2025-05-07T19:44:07.1279706Z 2025-05-07T19:44:07.1279710Z 2025-05-07T19:44:07.1279720Z 2025-05-07T19:44:07.1282588Z python_abi-3.9 | 4 KB | ########## | 100%  2025-05-07T19:44:07.1282878Z 2025-05-07T19:44:07.1282890Z 2025-05-07T19:44:07.1282918Z 2025-05-07T19:44:07.1282921Z 2025-05-07T19:44:07.1282925Z 2025-05-07T19:44:07.1633330Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:07.1633643Z 2025-05-07T19:44:07.1633647Z 2025-05-07T19:44:07.1633661Z 2025-05-07T19:44:07.1635686Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:07.1635956Z 2025-05-07T19:44:07.1635960Z 2025-05-07T19:44:07.1635972Z 2025-05-07T19:44:07.1772962Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:07.1773294Z 2025-05-07T19:44:07.1773299Z 2025-05-07T19:44:07.1773303Z 2025-05-07T19:44:07.1773320Z 2025-05-07T19:44:07.1773323Z 2025-05-07T19:44:07.1773327Z 2025-05-07T19:44:07.1773330Z 2025-05-07T19:44:07.1773334Z 2025-05-07T19:44:07.2099456Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:07.2100693Z 2025-05-07T19:44:07.2100744Z 2025-05-07T19:44:07.2100755Z 2025-05-07T19:44:07.2100766Z 2025-05-07T19:44:07.2100776Z 2025-05-07T19:44:07.2100787Z 2025-05-07T19:44:07.2101589Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:07.2102582Z 2025-05-07T19:44:07.2102585Z 2025-05-07T19:44:07.2102589Z 2025-05-07T19:44:07.2102592Z 2025-05-07T19:44:07.2102596Z 2025-05-07T19:44:07.2102599Z 2025-05-07T19:44:07.2168417Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:07.2169371Z 2025-05-07T19:44:07.2169384Z 2025-05-07T19:44:07.2169395Z 2025-05-07T19:44:07.2169407Z 2025-05-07T19:44:07.2169417Z 2025-05-07T19:44:07.2169427Z 2025-05-07T19:44:07.2169438Z 2025-05-07T19:44:07.2170315Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:07.2171232Z 2025-05-07T19:44:07.2171276Z 2025-05-07T19:44:07.2171287Z 2025-05-07T19:44:07.2171297Z 2025-05-07T19:44:07.2171307Z 2025-05-07T19:44:07.2171317Z 2025-05-07T19:44:07.2171328Z 2025-05-07T19:44:07.2424277Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:07.2425295Z 2025-05-07T19:44:07.2425377Z 2025-05-07T19:44:07.2425390Z 2025-05-07T19:44:07.2425401Z 2025-05-07T19:44:07.2425412Z 2025-05-07T19:44:07.2425422Z 2025-05-07T19:44:07.2425433Z 2025-05-07T19:44:07.2425444Z 2025-05-07T19:44:07.2425454Z 2025-05-07T19:44:07.2426223Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:07.2427065Z 2025-05-07T19:44:07.2427076Z 2025-05-07T19:44:07.2427118Z 2025-05-07T19:44:07.2427128Z 2025-05-07T19:44:07.2427139Z 2025-05-07T19:44:07.2427149Z 2025-05-07T19:44:07.2427159Z 2025-05-07T19:44:07.2427169Z 2025-05-07T19:44:07.2427210Z 2025-05-07T19:44:07.2523497Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:07.2524468Z 2025-05-07T19:44:07.2524481Z 2025-05-07T19:44:07.2524491Z 2025-05-07T19:44:07.2524502Z 2025-05-07T19:44:07.2524512Z 2025-05-07T19:44:07.2524522Z 2025-05-07T19:44:07.2524532Z 2025-05-07T19:44:07.2524543Z 2025-05-07T19:44:07.2524553Z 2025-05-07T19:44:07.2524563Z 2025-05-07T19:44:07.2581425Z python_abi-3.9 | 4 KB | ########## | 100%  2025-05-07T19:44:07.2582395Z 2025-05-07T19:44:07.2582870Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:07.2583148Z 2025-05-07T19:44:07.2867492Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:07.2868728Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:07.2874042Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:07.2875065Z 2025-05-07T19:44:07.2876446Z 2025-05-07T19:44:07.2877005Z  2025-05-07T19:44:07.2877631Z 2025-05-07T19:44:07.2877643Z 2025-05-07T19:44:07.2878172Z  2025-05-07T19:44:07.2878938Z 2025-05-07T19:44:07.2878943Z 2025-05-07T19:44:07.2878946Z 2025-05-07T19:44:07.2879135Z  2025-05-07T19:44:07.2879394Z 2025-05-07T19:44:07.2879398Z 2025-05-07T19:44:07.2879401Z 2025-05-07T19:44:07.2879405Z 2025-05-07T19:44:07.2879592Z  2025-05-07T19:44:07.2879826Z 2025-05-07T19:44:07.2879830Z 2025-05-07T19:44:07.2879834Z 2025-05-07T19:44:07.2879863Z 2025-05-07T19:44:07.2879866Z 2025-05-07T19:44:07.2880056Z  2025-05-07T19:44:07.2880290Z 2025-05-07T19:44:07.2880293Z 2025-05-07T19:44:07.2880296Z 2025-05-07T19:44:07.2880304Z 2025-05-07T19:44:07.2880308Z 2025-05-07T19:44:07.2880312Z 2025-05-07T19:44:07.2880546Z  2025-05-07T19:44:07.2880784Z 2025-05-07T19:44:07.2880788Z 2025-05-07T19:44:07.2880791Z 2025-05-07T19:44:07.2880795Z 2025-05-07T19:44:07.2880798Z 2025-05-07T19:44:07.2880801Z 2025-05-07T19:44:07.2880805Z 2025-05-07T19:44:07.2881026Z  2025-05-07T19:44:07.2881264Z 2025-05-07T19:44:07.2881267Z 2025-05-07T19:44:07.2881271Z 2025-05-07T19:44:07.2881274Z 2025-05-07T19:44:07.2881278Z 2025-05-07T19:44:07.2881281Z 2025-05-07T19:44:07.2881284Z 2025-05-07T19:44:07.2881288Z 2025-05-07T19:44:07.2881516Z  2025-05-07T19:44:07.2881756Z 2025-05-07T19:44:07.2881759Z 2025-05-07T19:44:07.2881762Z 2025-05-07T19:44:07.2881766Z 2025-05-07T19:44:07.2881769Z 2025-05-07T19:44:07.2881773Z 2025-05-07T19:44:07.2881780Z 2025-05-07T19:44:07.2881784Z 2025-05-07T19:44:07.2881787Z 2025-05-07T19:44:07.2882014Z  2025-05-07T19:44:07.2882255Z 2025-05-07T19:44:07.2882258Z 2025-05-07T19:44:07.2882261Z 2025-05-07T19:44:07.2882265Z 2025-05-07T19:44:07.2882268Z 2025-05-07T19:44:07.2882272Z 2025-05-07T19:44:07.2882275Z 2025-05-07T19:44:07.2882279Z 2025-05-07T19:44:07.2882287Z 2025-05-07T19:44:07.2882290Z 2025-05-07T19:44:07.2882614Z  done 2025-05-07T19:44:07.3885828Z Preparing transaction: - done 2025-05-07T19:44:07.4895672Z Verifying transaction: | done 2025-05-07T19:44:08.8922507Z Executing transaction: - \ | / - \ | / - \ | / - \ done 2025-05-07T19:44:08.9927849Z [SETUP] Testing pyOpenSSL import ... 2025-05-07T19:44:10.6880716Z [CHECK] Python (sub-)package 'OpenSSL' found ... 2025-05-07T19:44:10.6888303Z [SETUP] Installing libxcrypt ... 2025-05-07T19:44:10.6914832Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y libxcrypt 2025-05-07T19:44:11.3522791Z Channels: 2025-05-07T19:44:11.3523476Z - conda-forge 2025-05-07T19:44:11.3524109Z Platform: linux-64 2025-05-07T19:44:14.4547588Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:14.8818457Z Solving environment: \ done 2025-05-07T19:44:14.9295451Z 2025-05-07T19:44:14.9296139Z ## Package Plan ## 2025-05-07T19:44:14.9296644Z 2025-05-07T19:44:14.9297226Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:14.9298154Z 2025-05-07T19:44:14.9298422Z added / updated specs: 2025-05-07T19:44:14.9299120Z - libxcrypt 2025-05-07T19:44:14.9299513Z 2025-05-07T19:44:14.9299526Z 2025-05-07T19:44:14.9299871Z The following packages will be downloaded: 2025-05-07T19:44:14.9300763Z 2025-05-07T19:44:14.9301121Z package | build 2025-05-07T19:44:14.9302376Z ---------------------------|----------------- 2025-05-07T19:44:14.9303519Z libxcrypt-4.4.36 | hd590300_1 98 KB conda-forge 2025-05-07T19:44:14.9304736Z ------------------------------------------------------------ 2025-05-07T19:44:14.9305795Z Total: 98 KB 2025-05-07T19:44:14.9306019Z 2025-05-07T19:44:14.9306166Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:14.9306417Z 2025-05-07T19:44:14.9306662Z libxcrypt conda-forge/linux-64::libxcrypt-4.4.36-hd590300_1 2025-05-07T19:44:14.9307065Z 2025-05-07T19:44:14.9307069Z 2025-05-07T19:44:14.9307090Z 2025-05-07T19:44:14.9307225Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:15.0543855Z libxcrypt-4.4.36 | 98 KB | | 0% 2025-05-07T19:44:15.0565428Z libxcrypt-4.4.36 | 98 KB | #6 | 16% 2025-05-07T19:44:15.0662190Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:15.0662663Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:15.0663069Z 2025-05-07T19:44:15.0663414Z done 2025-05-07T19:44:15.1672658Z Preparing transaction: / done 2025-05-07T19:44:15.2681414Z Verifying transaction: \ done 2025-05-07T19:44:15.3690907Z Executing transaction: / done 2025-05-07T19:44:18.6516622Z [SETUP] Copying over ... 2025-05-07T19:44:18.6518770Z + cp /github/home/miniconda/envs/build_binary/include/crypt.h /github/home/miniconda/envs/build_binary/include/python3.9/crypt.h 2025-05-07T19:44:18.6520562Z 2025-05-07T19:44:18.6543967Z 2025-05-07T19:44:20.2469542Z [SETUP] Installed Python version: Python 3.9.21 2025-05-07T19:44:20.2470834Z [SETUP] Successfully created Conda environment: build_binary 2025-05-07T19:44:20.2539824Z ##[group]Run . $PRELUDE; install_cxx_compiler $BUILD_ENV clang 2025-05-07T19:44:20.2540409Z . $PRELUDE; install_cxx_compiler $BUILD_ENV clang 2025-05-07T19:44:20.2541169Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:20.2541544Z env: 2025-05-07T19:44:20.2541770Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:20.2542098Z BUILD_ENV: build_binary 2025-05-07T19:44:20.2542343Z BUILD_TARGET: default 2025-05-07T19:44:20.2542591Z BUILD_VARIANT: cuda 2025-05-07T19:44:20.2542822Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:44:20.2543085Z ##[endgroup] 2025-05-07T19:44:20.6604744Z ################################################################################ 2025-05-07T19:44:20.6605144Z # Install C/C++ Compilers 2025-05-07T19:44:20.6605417Z # 2025-05-07T19:44:20.6629311Z # [2025-05-07T19:44:20.662Z] + install_cxx_compiler build_binary clang 2025-05-07T19:44:20.6630723Z ################################################################################ 2025-05-07T19:44:20.6631475Z 2025-05-07T19:44:20.6646341Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:20.7467105Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:20.7472613Z [INSTALL] Installing GLIBC (architecture = 64) ... 2025-05-07T19:44:20.7497954Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y sysroot_linux-64=2.17 2025-05-07T19:44:21.4101095Z Channels: 2025-05-07T19:44:21.4101715Z - conda-forge 2025-05-07T19:44:21.4102000Z Platform: linux-64 2025-05-07T19:44:24.5689505Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:24.9947505Z Solving environment: \ done 2025-05-07T19:44:25.0427416Z 2025-05-07T19:44:25.0428294Z ## Package Plan ## 2025-05-07T19:44:25.0428529Z 2025-05-07T19:44:25.0428772Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:25.0429115Z 2025-05-07T19:44:25.0429230Z added / updated specs: 2025-05-07T19:44:25.0429580Z - sysroot_linux-64=2.17 2025-05-07T19:44:25.0429767Z 2025-05-07T19:44:25.0429771Z 2025-05-07T19:44:25.0429919Z The following packages will be downloaded: 2025-05-07T19:44:25.0430547Z 2025-05-07T19:44:25.0430684Z package | build 2025-05-07T19:44:25.0431078Z ---------------------------|----------------- 2025-05-07T19:44:25.0431537Z kernel-headers_linux-64-3.10.0| he073ed8_18 921 KB conda-forge 2025-05-07T19:44:25.0432100Z sysroot_linux-64-2.17 | h0157908_18 14.5 MB conda-forge 2025-05-07T19:44:25.0432559Z ------------------------------------------------------------ 2025-05-07T19:44:25.0432961Z Total: 15.4 MB 2025-05-07T19:44:25.0433191Z 2025-05-07T19:44:25.0433357Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:25.0433600Z 2025-05-07T19:44:25.0433911Z kernel-headers_li~ conda-forge/noarch::kernel-headers_linux-64-3.10.0-he073ed8_18 2025-05-07T19:44:25.0434559Z sysroot_linux-64 conda-forge/noarch::sysroot_linux-64-2.17-h0157908_18 2025-05-07T19:44:25.0434896Z 2025-05-07T19:44:25.0434900Z 2025-05-07T19:44:25.0434903Z 2025-05-07T19:44:25.0435064Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:25.0435510Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:25.0435762Z 2025-05-07T19:44:25.2497744Z kernel-headers_linux | 921 KB | | 0%  2025-05-07T19:44:25.2669734Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:25.2670553Z 2025-05-07T19:44:25.2761305Z kernel-headers_linux | 921 KB | 1 | 2%  2025-05-07T19:44:25.2761656Z 2025-05-07T19:44:25.3498563Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:25.4377686Z sysroot_linux-64-2.1 | 14.5 MB | #######3 | 74% 2025-05-07T19:44:25.5134166Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:25.5134515Z 2025-05-07T19:44:25.5135220Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:25.5135506Z 2025-05-07T19:44:25.9210316Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:25.9211582Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:25.9212620Z 2025-05-07T19:44:25.9213247Z 2025-05-07T19:44:25.9213775Z  done 2025-05-07T19:44:26.0222545Z Preparing transaction: / done 2025-05-07T19:44:26.2236640Z Verifying transaction: \ | done 2025-05-07T19:44:26.3248901Z Executing transaction: - done 2025-05-07T19:44:26.4103237Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:44:26.4104113Z [CHECK] CONDA_PREFIX is not set. 2025-05-07T19:44:28.0351677Z [CHECK] libstdc++.so.6 found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libstdc++.so.6 2025-05-07T19:44:28.0359854Z [INSTALL] Installing GCC (11.4.0, 64) through Conda ... 2025-05-07T19:44:28.0383256Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y gxx_linux-64=11.4.0 2025-05-07T19:44:28.7213694Z Channels: 2025-05-07T19:44:28.7214101Z - conda-forge 2025-05-07T19:44:28.7215249Z Platform: linux-64 2025-05-07T19:44:31.8728006Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:33.0136564Z Solving environment: \ | / done 2025-05-07T19:44:33.0657771Z 2025-05-07T19:44:33.0658412Z ## Package Plan ## 2025-05-07T19:44:33.0658877Z 2025-05-07T19:44:33.0659465Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:33.0660621Z 2025-05-07T19:44:33.0660895Z added / updated specs: 2025-05-07T19:44:33.0661651Z - gxx_linux-64=11.4.0 2025-05-07T19:44:33.0662113Z 2025-05-07T19:44:33.0662125Z 2025-05-07T19:44:33.0662472Z The following packages will be downloaded: 2025-05-07T19:44:33.0663144Z 2025-05-07T19:44:33.0663474Z package | build 2025-05-07T19:44:33.0664449Z ---------------------------|----------------- 2025-05-07T19:44:33.0665682Z binutils_impl_linux-64-2.40| ha1999f0_7 6.0 MB conda-forge 2025-05-07T19:44:33.0667161Z binutils_linux-64-2.40 | hb3c18ed_4 28 KB conda-forge 2025-05-07T19:44:33.0668129Z gcc_impl_linux-64-11.4.0 | h00c12a0_13 53.0 MB conda-forge 2025-05-07T19:44:33.0668610Z gcc_linux-64-11.4.0 | ha077dfb_4 31 KB conda-forge 2025-05-07T19:44:33.0669168Z gxx_impl_linux-64-11.4.0 | h634f3ee_13 11.2 MB conda-forge 2025-05-07T19:44:33.0669618Z gxx_linux-64-11.4.0 | h35bfe5d_4 29 KB conda-forge 2025-05-07T19:44:33.0670042Z ld_impl_linux-64-2.40 | hf3520f5_7 691 KB conda-forge 2025-05-07T19:44:33.0670521Z libgcc-devel_linux-64-11.4.0| h8f596e0_113 2.3 MB conda-forge 2025-05-07T19:44:33.0670999Z libsanitizer-11.4.0 | h5763a12_13 3.5 MB conda-forge 2025-05-07T19:44:33.0671435Z libstdcxx-15.1.0 | h8f9b012_2 3.7 MB conda-forge 2025-05-07T19:44:33.0671915Z libstdcxx-devel_linux-64-11.4.0| h8f596e0_113 11.1 MB conda-forge 2025-05-07T19:44:33.0672387Z libstdcxx-ng-15.1.0 | h4852527_2 34 KB conda-forge 2025-05-07T19:44:33.0672797Z ------------------------------------------------------------ 2025-05-07T19:44:33.0673134Z Total: 91.6 MB 2025-05-07T19:44:33.0673411Z 2025-05-07T19:44:33.0673545Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:33.0673771Z 2025-05-07T19:44:33.0674095Z binutils_impl_lin~ conda-forge/linux-64::binutils_impl_linux-64-2.40-ha1999f0_7 2025-05-07T19:44:33.0674668Z binutils_linux-64 conda-forge/linux-64::binutils_linux-64-2.40-hb3c18ed_4 2025-05-07T19:44:33.0675249Z gcc_impl_linux-64 conda-forge/linux-64::gcc_impl_linux-64-11.4.0-h00c12a0_13 2025-05-07T19:44:33.0676352Z gcc_linux-64 conda-forge/linux-64::gcc_linux-64-11.4.0-ha077dfb_4 2025-05-07T19:44:33.0676980Z gxx_impl_linux-64 conda-forge/linux-64::gxx_impl_linux-64-11.4.0-h634f3ee_13 2025-05-07T19:44:33.0677571Z gxx_linux-64 conda-forge/linux-64::gxx_linux-64-11.4.0-h35bfe5d_4 2025-05-07T19:44:33.0678159Z libgcc-devel_linu~ conda-forge/noarch::libgcc-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:33.0678800Z libsanitizer conda-forge/linux-64::libsanitizer-11.4.0-h5763a12_13 2025-05-07T19:44:33.0679342Z libstdcxx conda-forge/linux-64::libstdcxx-15.1.0-h8f9b012_2 2025-05-07T19:44:33.0679969Z libstdcxx-devel_l~ conda-forge/noarch::libstdcxx-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:33.0680367Z 2025-05-07T19:44:33.0680531Z The following packages will be UPDATED: 2025-05-07T19:44:33.0680762Z 2025-05-07T19:44:33.0681101Z ld_impl_linux-64 pkgs/main::ld_impl_linux-64-2.40-h12e~ --> conda-forge::ld_impl_linux-64-2.40-hf3520f5_7 2025-05-07T19:44:33.0681917Z libstdcxx-ng pkgs/main::libstdcxx-ng-11.2.0-h12345~ --> conda-forge::libstdcxx-ng-15.1.0-h4852527_2 2025-05-07T19:44:33.0682591Z 2025-05-07T19:44:33.0682595Z 2025-05-07T19:44:33.0682598Z 2025-05-07T19:44:33.0682783Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:33.0683177Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:33.0683441Z 2025-05-07T19:44:33.0683752Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:33.0683999Z 2025-05-07T19:44:33.0684003Z 2025-05-07T19:44:33.0684226Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:33.0684521Z 2025-05-07T19:44:33.0684525Z 2025-05-07T19:44:33.0684529Z 2025-05-07T19:44:33.0684779Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:33.0685070Z 2025-05-07T19:44:33.0685074Z 2025-05-07T19:44:33.0685077Z 2025-05-07T19:44:33.0685081Z 2025-05-07T19:44:33.0688863Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:33.0689669Z 2025-05-07T19:44:33.0689697Z 2025-05-07T19:44:33.0689711Z 2025-05-07T19:44:33.0689721Z 2025-05-07T19:44:33.0689742Z 2025-05-07T19:44:33.0699653Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:33.0701350Z 2025-05-07T19:44:33.0701365Z 2025-05-07T19:44:33.0701376Z 2025-05-07T19:44:33.0701388Z 2025-05-07T19:44:33.0701398Z 2025-05-07T19:44:33.0701409Z 2025-05-07T19:44:33.0702289Z libgcc-devel_linux-6 | 2.3 MB | | 0%  2025-05-07T19:44:33.0703193Z 2025-05-07T19:44:33.0703204Z 2025-05-07T19:44:33.0703215Z 2025-05-07T19:44:33.0703225Z 2025-05-07T19:44:33.0703236Z 2025-05-07T19:44:33.0703247Z 2025-05-07T19:44:33.0703258Z 2025-05-07T19:44:33.0704087Z ld_impl_linux-64-2.4 | 691 KB | | 0%  2025-05-07T19:44:33.0704944Z 2025-05-07T19:44:33.0704955Z 2025-05-07T19:44:33.0704965Z 2025-05-07T19:44:33.0704976Z 2025-05-07T19:44:33.0704986Z 2025-05-07T19:44:33.0704996Z 2025-05-07T19:44:33.0705006Z 2025-05-07T19:44:33.0705036Z 2025-05-07T19:44:33.0705835Z libstdcxx-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:33.0706710Z 2025-05-07T19:44:33.0706721Z 2025-05-07T19:44:33.0706746Z 2025-05-07T19:44:33.0706756Z 2025-05-07T19:44:33.0706767Z 2025-05-07T19:44:33.0706777Z 2025-05-07T19:44:33.0706787Z 2025-05-07T19:44:33.0706798Z 2025-05-07T19:44:33.0706809Z 2025-05-07T19:44:33.0707535Z gcc_linux-64-11.4.0 | 31 KB | | 0%  2025-05-07T19:44:33.0707834Z 2025-05-07T19:44:33.0707838Z 2025-05-07T19:44:33.0707842Z 2025-05-07T19:44:33.0707845Z 2025-05-07T19:44:33.0707848Z 2025-05-07T19:44:33.0707852Z 2025-05-07T19:44:33.0707855Z 2025-05-07T19:44:33.0707858Z 2025-05-07T19:44:33.0707862Z 2025-05-07T19:44:33.0707865Z 2025-05-07T19:44:33.0708252Z gxx_linux-64-11.4.0 | 29 KB | | 0%  2025-05-07T19:44:33.0708549Z 2025-05-07T19:44:33.0708553Z 2025-05-07T19:44:33.0708556Z 2025-05-07T19:44:33.0708702Z 2025-05-07T19:44:33.0708707Z 2025-05-07T19:44:33.0708710Z 2025-05-07T19:44:33.0708714Z 2025-05-07T19:44:33.0708718Z 2025-05-07T19:44:33.0708722Z 2025-05-07T19:44:33.0708725Z 2025-05-07T19:44:33.0708733Z 2025-05-07T19:44:33.1733038Z binutils_linux-64-2. | 28 KB | | 0%  2025-05-07T19:44:33.1734013Z 2025-05-07T19:44:33.1734027Z 2025-05-07T19:44:33.1734038Z 2025-05-07T19:44:33.1734049Z 2025-05-07T19:44:33.3719952Z libstdcxx-15.1.0 | 3.7 MB | #5 | 15%  2025-05-07T19:44:33.3720841Z 2025-05-07T19:44:33.3720855Z 2025-05-07T19:44:33.3720866Z 2025-05-07T19:44:33.3720877Z 2025-05-07T19:44:33.4134045Z libstdcxx-15.1.0 | 3.7 MB | ##9 | 30%  2025-05-07T19:44:33.4134394Z 2025-05-07T19:44:33.4134399Z 2025-05-07T19:44:33.4134403Z 2025-05-07T19:44:33.4134406Z 2025-05-07T19:44:33.4207953Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:33.4208314Z 2025-05-07T19:44:33.4208695Z 2025-05-07T19:44:33.4252757Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:33.4253103Z 2025-05-07T19:44:33.4281695Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:33.4282000Z 2025-05-07T19:44:33.4282004Z 2025-05-07T19:44:33.4282343Z 2025-05-07T19:44:33.4323495Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:33.4502006Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:33.4502321Z 2025-05-07T19:44:33.4502325Z 2025-05-07T19:44:33.4502329Z 2025-05-07T19:44:33.4502332Z 2025-05-07T19:44:33.4502336Z 2025-05-07T19:44:33.5253911Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:33.5254254Z 2025-05-07T19:44:33.5286831Z gxx_impl_linux-64-11 | 11.2 MB | ####3 | 44%  2025-05-07T19:44:33.5287149Z 2025-05-07T19:44:33.5287156Z 2025-05-07T19:44:33.5287160Z 2025-05-07T19:44:33.5324103Z binutils_impl_linux- | 6.0 MB | #######7 | 78%  2025-05-07T19:44:33.5349419Z gcc_impl_linux-64-11 | 53.0 MB | 5 | 6% 2025-05-07T19:44:33.5349700Z 2025-05-07T19:44:33.5349867Z 2025-05-07T19:44:33.5769454Z libstdcxx-devel_linu | 11.1 MB | ###9 | 39%  2025-05-07T19:44:33.5770051Z 2025-05-07T19:44:33.5770069Z 2025-05-07T19:44:33.5770073Z 2025-05-07T19:44:33.5770101Z 2025-05-07T19:44:33.5770575Z 2025-05-07T19:44:33.5770913Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:33.5771251Z 2025-05-07T19:44:33.5771263Z 2025-05-07T19:44:33.5771267Z 2025-05-07T19:44:33.5771270Z 2025-05-07T19:44:33.5771274Z 2025-05-07T19:44:33.6210499Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:33.6211424Z 2025-05-07T19:44:33.6211438Z 2025-05-07T19:44:33.6211451Z 2025-05-07T19:44:33.6211463Z 2025-05-07T19:44:33.6211474Z 2025-05-07T19:44:33.6211486Z 2025-05-07T19:44:33.6325482Z libgcc-devel_linux-6 | 2.3 MB | | 1%  2025-05-07T19:44:33.6350184Z gcc_impl_linux-64-11 | 53.0 MB | #8 | 18% 2025-05-07T19:44:33.6350486Z 2025-05-07T19:44:33.6350655Z 2025-05-07T19:44:33.6502467Z libstdcxx-devel_linu | 11.1 MB | #########4 | 94%  2025-05-07T19:44:33.6502814Z 2025-05-07T19:44:33.6502833Z 2025-05-07T19:44:33.6502838Z 2025-05-07T19:44:33.6781264Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:33.6781584Z 2025-05-07T19:44:33.6781598Z 2025-05-07T19:44:33.6781604Z 2025-05-07T19:44:33.6781609Z 2025-05-07T19:44:33.6783978Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:33.6784257Z 2025-05-07T19:44:33.6784269Z 2025-05-07T19:44:33.6784272Z 2025-05-07T19:44:33.6784277Z 2025-05-07T19:44:33.6911787Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:33.6912114Z 2025-05-07T19:44:33.6912120Z 2025-05-07T19:44:33.6912123Z 2025-05-07T19:44:33.6912127Z 2025-05-07T19:44:33.6912131Z 2025-05-07T19:44:33.6912138Z 2025-05-07T19:44:33.6912819Z 2025-05-07T19:44:33.6914529Z ld_impl_linux-64-2.4 | 691 KB | 2 | 2%  2025-05-07T19:44:33.6914841Z 2025-05-07T19:44:33.6914846Z 2025-05-07T19:44:33.6914863Z 2025-05-07T19:44:33.6914867Z 2025-05-07T19:44:33.6914872Z 2025-05-07T19:44:33.6914898Z 2025-05-07T19:44:33.7149469Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:33.7149842Z 2025-05-07T19:44:33.7149848Z 2025-05-07T19:44:33.7149851Z 2025-05-07T19:44:33.7149855Z 2025-05-07T19:44:33.7149859Z 2025-05-07T19:44:33.7149863Z 2025-05-07T19:44:33.7149866Z 2025-05-07T19:44:33.7257614Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:33.7257943Z 2025-05-07T19:44:33.7258130Z 2025-05-07T19:44:33.7258143Z 2025-05-07T19:44:33.7258150Z 2025-05-07T19:44:33.7258157Z 2025-05-07T19:44:33.7258163Z 2025-05-07T19:44:33.7258169Z 2025-05-07T19:44:33.7258174Z 2025-05-07T19:44:33.7279552Z libstdcxx-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:33.7279877Z 2025-05-07T19:44:33.7279883Z 2025-05-07T19:44:33.7279887Z 2025-05-07T19:44:33.7279890Z 2025-05-07T19:44:33.7279894Z 2025-05-07T19:44:33.7279929Z 2025-05-07T19:44:33.7279933Z 2025-05-07T19:44:33.7279937Z 2025-05-07T19:44:33.7326938Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:33.7450527Z gcc_impl_linux-64-11 | 53.0 MB | ###2 | 32% 2025-05-07T19:44:33.7450915Z 2025-05-07T19:44:33.7451105Z 2025-05-07T19:44:33.7451118Z 2025-05-07T19:44:33.7451123Z 2025-05-07T19:44:33.7451128Z 2025-05-07T19:44:33.7451132Z 2025-05-07T19:44:33.7451136Z 2025-05-07T19:44:33.7451141Z 2025-05-07T19:44:33.7451501Z 2025-05-07T19:44:33.7477016Z gcc_linux-64-11.4.0 | 31 KB | #####2 | 52%  2025-05-07T19:44:33.7477355Z 2025-05-07T19:44:33.7477361Z 2025-05-07T19:44:33.7477365Z 2025-05-07T19:44:33.7477370Z 2025-05-07T19:44:33.7477376Z 2025-05-07T19:44:33.7477405Z 2025-05-07T19:44:33.7477410Z 2025-05-07T19:44:33.7477414Z 2025-05-07T19:44:33.7477418Z 2025-05-07T19:44:33.7719884Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:33.7720511Z 2025-05-07T19:44:33.7720516Z 2025-05-07T19:44:33.7720520Z 2025-05-07T19:44:33.7720523Z 2025-05-07T19:44:33.7720527Z 2025-05-07T19:44:33.7720531Z 2025-05-07T19:44:33.7720535Z 2025-05-07T19:44:33.7720539Z 2025-05-07T19:44:33.7720543Z 2025-05-07T19:44:33.7720547Z 2025-05-07T19:44:33.7738828Z gxx_linux-64-11.4.0 | 29 KB | #####5 | 55%  2025-05-07T19:44:33.7739175Z 2025-05-07T19:44:33.7739181Z 2025-05-07T19:44:33.7739185Z 2025-05-07T19:44:33.7739188Z 2025-05-07T19:44:33.7739192Z 2025-05-07T19:44:33.7739195Z 2025-05-07T19:44:33.7739199Z 2025-05-07T19:44:33.7739225Z 2025-05-07T19:44:33.7739229Z 2025-05-07T19:44:33.7739232Z 2025-05-07T19:44:33.7836272Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:33.7836617Z 2025-05-07T19:44:33.7836621Z 2025-05-07T19:44:33.7836625Z 2025-05-07T19:44:33.7836628Z 2025-05-07T19:44:33.7836632Z 2025-05-07T19:44:33.7836667Z 2025-05-07T19:44:33.7836747Z 2025-05-07T19:44:33.7836751Z 2025-05-07T19:44:33.7836754Z 2025-05-07T19:44:33.7836757Z 2025-05-07T19:44:33.7836761Z 2025-05-07T19:44:33.7855140Z binutils_linux-64-2. | 28 KB | #####6 | 56%  2025-05-07T19:44:33.7855503Z 2025-05-07T19:44:33.7855528Z 2025-05-07T19:44:33.7855532Z 2025-05-07T19:44:33.7855535Z 2025-05-07T19:44:33.7855539Z 2025-05-07T19:44:33.7855542Z 2025-05-07T19:44:33.7855546Z 2025-05-07T19:44:33.7855549Z 2025-05-07T19:44:33.7855553Z 2025-05-07T19:44:33.7855556Z 2025-05-07T19:44:33.7855560Z 2025-05-07T19:44:33.7919100Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:33.7919469Z 2025-05-07T19:44:33.7919808Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:33.7920323Z 2025-05-07T19:44:33.8043792Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:33.8044317Z 2025-05-07T19:44:33.8044344Z 2025-05-07T19:44:33.8306121Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:33.8306452Z 2025-05-07T19:44:33.8306458Z 2025-05-07T19:44:33.8306465Z 2025-05-07T19:44:33.8306472Z 2025-05-07T19:44:33.8306476Z 2025-05-07T19:44:33.8306683Z 2025-05-07T19:44:33.8311259Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:33.8311633Z 2025-05-07T19:44:33.8311639Z 2025-05-07T19:44:33.8311646Z 2025-05-07T19:44:33.8311650Z 2025-05-07T19:44:33.8311653Z 2025-05-07T19:44:33.8311664Z 2025-05-07T19:44:33.8326868Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:33.8341355Z gcc_impl_linux-64-11 | 53.0 MB | #####3 | 54% 2025-05-07T19:44:33.8341632Z 2025-05-07T19:44:33.8341637Z 2025-05-07T19:44:33.8341642Z 2025-05-07T19:44:33.8341673Z 2025-05-07T19:44:33.8341677Z 2025-05-07T19:44:33.8763207Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:33.8763553Z 2025-05-07T19:44:33.8763560Z 2025-05-07T19:44:33.8763612Z 2025-05-07T19:44:33.8763617Z 2025-05-07T19:44:33.8763620Z 2025-05-07T19:44:33.8763625Z 2025-05-07T19:44:33.8763643Z 2025-05-07T19:44:33.8763647Z 2025-05-07T19:44:33.8763927Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:33.8764233Z 2025-05-07T19:44:33.8764237Z 2025-05-07T19:44:33.8764241Z 2025-05-07T19:44:33.8764269Z 2025-05-07T19:44:33.8764272Z 2025-05-07T19:44:33.8764276Z 2025-05-07T19:44:33.8764279Z 2025-05-07T19:44:33.8765147Z 2025-05-07T19:44:33.8772591Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:33.8772909Z 2025-05-07T19:44:33.8772913Z 2025-05-07T19:44:33.8772936Z 2025-05-07T19:44:33.8772940Z 2025-05-07T19:44:33.8772944Z 2025-05-07T19:44:33.8772947Z 2025-05-07T19:44:33.8772962Z 2025-05-07T19:44:33.8777142Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:33.8777451Z 2025-05-07T19:44:33.8777455Z 2025-05-07T19:44:33.8777661Z 2025-05-07T19:44:33.8777690Z 2025-05-07T19:44:33.8777693Z 2025-05-07T19:44:33.8777697Z 2025-05-07T19:44:33.8777700Z 2025-05-07T19:44:33.9141918Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:33.9142260Z 2025-05-07T19:44:33.9142265Z 2025-05-07T19:44:33.9142269Z 2025-05-07T19:44:33.9142292Z 2025-05-07T19:44:33.9142296Z 2025-05-07T19:44:33.9142299Z 2025-05-07T19:44:33.9142303Z 2025-05-07T19:44:33.9142306Z 2025-05-07T19:44:33.9142310Z 2025-05-07T19:44:33.9144338Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:33.9144642Z 2025-05-07T19:44:33.9144660Z 2025-05-07T19:44:33.9144688Z 2025-05-07T19:44:33.9144691Z 2025-05-07T19:44:33.9144695Z 2025-05-07T19:44:33.9144698Z 2025-05-07T19:44:33.9144702Z 2025-05-07T19:44:33.9144720Z 2025-05-07T19:44:33.9144723Z 2025-05-07T19:44:33.9169556Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:33.9169890Z 2025-05-07T19:44:33.9169931Z 2025-05-07T19:44:33.9169935Z 2025-05-07T19:44:33.9169939Z 2025-05-07T19:44:33.9169943Z 2025-05-07T19:44:33.9169946Z 2025-05-07T19:44:33.9169950Z 2025-05-07T19:44:33.9169953Z 2025-05-07T19:44:33.9169957Z 2025-05-07T19:44:33.9169961Z 2025-05-07T19:44:33.9171311Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:33.9171649Z 2025-05-07T19:44:33.9171653Z 2025-05-07T19:44:33.9171656Z 2025-05-07T19:44:33.9171660Z 2025-05-07T19:44:33.9171663Z 2025-05-07T19:44:33.9171667Z 2025-05-07T19:44:33.9171670Z 2025-05-07T19:44:33.9171674Z 2025-05-07T19:44:33.9171677Z 2025-05-07T19:44:33.9172014Z 2025-05-07T19:44:33.9329009Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:33.9534877Z gcc_impl_linux-64-11 | 53.0 MB | #######2 | 72% 2025-05-07T19:44:33.9535158Z 2025-05-07T19:44:33.9535201Z 2025-05-07T19:44:33.9535204Z 2025-05-07T19:44:33.9535227Z 2025-05-07T19:44:33.9535265Z 2025-05-07T19:44:33.9535279Z 2025-05-07T19:44:33.9535283Z 2025-05-07T19:44:33.9535286Z 2025-05-07T19:44:33.9535289Z 2025-05-07T19:44:33.9535293Z 2025-05-07T19:44:33.9535327Z 2025-05-07T19:44:33.9537231Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:33.9537588Z 2025-05-07T19:44:33.9537596Z 2025-05-07T19:44:33.9537604Z 2025-05-07T19:44:33.9537611Z 2025-05-07T19:44:33.9537619Z 2025-05-07T19:44:33.9537639Z 2025-05-07T19:44:33.9537647Z 2025-05-07T19:44:33.9537654Z 2025-05-07T19:44:33.9537661Z 2025-05-07T19:44:33.9537667Z 2025-05-07T19:44:33.9537682Z 2025-05-07T19:44:34.0342697Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:34.1050186Z gcc_impl_linux-64-11 | 53.0 MB | ########8 | 89% 2025-05-07T19:44:34.1050495Z 2025-05-07T19:44:34.1050626Z 2025-05-07T19:44:34.1050630Z 2025-05-07T19:44:34.2220316Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:34.2220748Z 2025-05-07T19:44:34.3128458Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:34.4577085Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:34.4577382Z 2025-05-07T19:44:34.4577390Z 2025-05-07T19:44:34.9350147Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:34.9361568Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:34.9362016Z 2025-05-07T19:44:34.9362241Z 2025-05-07T19:44:34.9362511Z  2025-05-07T19:44:34.9362729Z 2025-05-07T19:44:34.9362735Z 2025-05-07T19:44:34.9362928Z  2025-05-07T19:44:34.9363149Z 2025-05-07T19:44:34.9363154Z 2025-05-07T19:44:34.9363191Z 2025-05-07T19:44:34.9363370Z  2025-05-07T19:44:34.9363609Z 2025-05-07T19:44:34.9363613Z 2025-05-07T19:44:34.9363919Z 2025-05-07T19:44:34.9363923Z 2025-05-07T19:44:34.9364117Z  2025-05-07T19:44:34.9364365Z 2025-05-07T19:44:34.9364368Z 2025-05-07T19:44:34.9364372Z 2025-05-07T19:44:34.9364376Z 2025-05-07T19:44:34.9364379Z 2025-05-07T19:44:34.9364565Z  2025-05-07T19:44:34.9364791Z 2025-05-07T19:44:34.9364795Z 2025-05-07T19:44:34.9364798Z 2025-05-07T19:44:34.9364802Z 2025-05-07T19:44:34.9364805Z 2025-05-07T19:44:34.9364827Z 2025-05-07T19:44:34.9365020Z  2025-05-07T19:44:34.9365291Z 2025-05-07T19:44:34.9365295Z 2025-05-07T19:44:34.9365298Z 2025-05-07T19:44:34.9365302Z 2025-05-07T19:44:34.9365305Z 2025-05-07T19:44:34.9365339Z 2025-05-07T19:44:34.9365344Z 2025-05-07T19:44:34.9365536Z  2025-05-07T19:44:34.9365766Z 2025-05-07T19:44:34.9365776Z 2025-05-07T19:44:34.9365780Z 2025-05-07T19:44:34.9365783Z 2025-05-07T19:44:34.9365786Z 2025-05-07T19:44:34.9365790Z 2025-05-07T19:44:34.9365793Z 2025-05-07T19:44:34.9365796Z 2025-05-07T19:44:34.9366005Z  2025-05-07T19:44:34.9366240Z 2025-05-07T19:44:34.9366244Z 2025-05-07T19:44:34.9366247Z 2025-05-07T19:44:34.9366251Z 2025-05-07T19:44:34.9366254Z 2025-05-07T19:44:34.9366257Z 2025-05-07T19:44:34.9366261Z 2025-05-07T19:44:34.9366264Z 2025-05-07T19:44:34.9366268Z 2025-05-07T19:44:34.9366490Z  2025-05-07T19:44:34.9366726Z 2025-05-07T19:44:34.9366730Z 2025-05-07T19:44:34.9366734Z 2025-05-07T19:44:34.9366738Z 2025-05-07T19:44:34.9366903Z 2025-05-07T19:44:34.9366907Z 2025-05-07T19:44:34.9366911Z 2025-05-07T19:44:34.9366914Z 2025-05-07T19:44:34.9366917Z 2025-05-07T19:44:34.9366921Z 2025-05-07T19:44:34.9367149Z  2025-05-07T19:44:34.9367394Z 2025-05-07T19:44:34.9367398Z 2025-05-07T19:44:34.9367401Z 2025-05-07T19:44:34.9367405Z 2025-05-07T19:44:34.9367408Z 2025-05-07T19:44:34.9367412Z 2025-05-07T19:44:34.9367415Z 2025-05-07T19:44:34.9367419Z 2025-05-07T19:44:34.9367422Z 2025-05-07T19:44:34.9367425Z 2025-05-07T19:44:34.9367429Z 2025-05-07T19:44:34.9367664Z  done 2025-05-07T19:44:35.0372429Z Preparing transaction: \ done 2025-05-07T19:44:35.3422046Z Verifying transaction: / - \ done 2025-05-07T19:44:35.4437004Z Executing transaction: / done 2025-05-07T19:44:35.5371513Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:39.2197053Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:39.2208658Z 2025-05-07T19:44:39.2208665Z 2025-05-07T19:44:39.2224021Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:39.2224619Z 2025-05-07T19:44:39.2238565Z 2025-05-07T19:44:39.2266543Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:39.2267135Z 2025-05-07T19:44:39.2287761Z 2025-05-07T19:44:39.2305809Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:39.2306411Z 2025-05-07T19:44:39.2321111Z 2025-05-07T19:44:39.2336297Z [INSTALL] Installing Clang (16.0.6, 64) and relevant libraries through Conda ... 2025-05-07T19:44:39.2361384Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y clangxx=16.0.6 libcxx llvm-openmp=16.0.6 compiler-rt=16.0.6 2025-05-07T19:44:39.9462280Z Channels: 2025-05-07T19:44:39.9462964Z - conda-forge 2025-05-07T19:44:39.9464089Z Platform: linux-64 2025-05-07T19:44:43.0319369Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:44.3759821Z Solving environment: \ | / - done 2025-05-07T19:44:44.4270996Z 2025-05-07T19:44:44.4271656Z ## Package Plan ## 2025-05-07T19:44:44.4272117Z 2025-05-07T19:44:44.4272724Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:44.4273629Z 2025-05-07T19:44:44.4273896Z added / updated specs: 2025-05-07T19:44:44.4274615Z - clangxx=16.0.6 2025-05-07T19:44:44.4275273Z - compiler-rt=16.0.6 2025-05-07T19:44:44.4276359Z - libcxx 2025-05-07T19:44:44.4276951Z - llvm-openmp=16.0.6 2025-05-07T19:44:44.4277426Z 2025-05-07T19:44:44.4277439Z 2025-05-07T19:44:44.4277786Z The following packages will be downloaded: 2025-05-07T19:44:44.4278356Z 2025-05-07T19:44:44.4278496Z package | build 2025-05-07T19:44:44.4278832Z ---------------------------|----------------- 2025-05-07T19:44:44.4279252Z clang-16.0.6 |default_h9e3a008_14 110 KB conda-forge 2025-05-07T19:44:44.4279718Z clang-16-16.0.6 |default_hb5137d0_14 780 KB conda-forge 2025-05-07T19:44:44.4280200Z clangxx-16.0.6 |default_ha78316a_14 110 KB conda-forge 2025-05-07T19:44:44.4280683Z compiler-rt-16.0.6 | h00ab1b0_2 107 KB conda-forge 2025-05-07T19:44:44.4281172Z compiler-rt_linux-64-16.0.6| h00ab1b0_2 36.0 MB conda-forge 2025-05-07T19:44:44.4281641Z icu-73.2 | h59595ed_0 11.5 MB conda-forge 2025-05-07T19:44:44.4282103Z libclang-cpp16-16.0.6 |default_hb5137d0_14 17.3 MB conda-forge 2025-05-07T19:44:44.4282876Z libcxx-19.1.7 | h2713693_1 1000 KB conda-forge 2025-05-07T19:44:44.4283321Z libcxxabi-19.1.7 | hd85fd95_1 158 KB conda-forge 2025-05-07T19:44:44.4283780Z libiconv-1.18 | h4ce23a2_1 696 KB conda-forge 2025-05-07T19:44:44.4284386Z libllvm16-16.0.6 | hb3ce162_3 33.7 MB conda-forge 2025-05-07T19:44:44.4284919Z libxml2-2.12.7 | hc051c1a_1 688 KB conda-forge 2025-05-07T19:44:44.4285335Z libzlib-1.2.13 | h4ab18f5_6 60 KB conda-forge 2025-05-07T19:44:44.4285749Z llvm-openmp-16.0.6 | h4dfa4b3_0 39.9 MB conda-forge 2025-05-07T19:44:44.4286169Z zlib-1.2.13 | h4ab18f5_6 91 KB conda-forge 2025-05-07T19:44:44.4286722Z zstd-1.5.6 | ha6fb4c9_0 542 KB conda-forge 2025-05-07T19:44:44.4287190Z ------------------------------------------------------------ 2025-05-07T19:44:44.4287573Z Total: 142.6 MB 2025-05-07T19:44:44.4287800Z 2025-05-07T19:44:44.4287935Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:44.4288186Z 2025-05-07T19:44:44.4288423Z clang conda-forge/linux-64::clang-16.0.6-default_h9e3a008_14 2025-05-07T19:44:44.4288982Z clang-16 conda-forge/linux-64::clang-16-16.0.6-default_hb5137d0_14 2025-05-07T19:44:44.4289513Z clangxx conda-forge/linux-64::clangxx-16.0.6-default_ha78316a_14 2025-05-07T19:44:44.4290056Z compiler-rt conda-forge/linux-64::compiler-rt-16.0.6-h00ab1b0_2 2025-05-07T19:44:44.4290617Z compiler-rt_linux~ conda-forge/noarch::compiler-rt_linux-64-16.0.6-h00ab1b0_2 2025-05-07T19:44:44.4291144Z icu conda-forge/linux-64::icu-73.2-h59595ed_0 2025-05-07T19:44:44.4291678Z libclang-cpp16 conda-forge/linux-64::libclang-cpp16-16.0.6-default_hb5137d0_14 2025-05-07T19:44:44.4292230Z libcxx conda-forge/linux-64::libcxx-19.1.7-h2713693_1 2025-05-07T19:44:44.4292714Z libcxxabi conda-forge/linux-64::libcxxabi-19.1.7-hd85fd95_1 2025-05-07T19:44:44.4293194Z libiconv conda-forge/linux-64::libiconv-1.18-h4ce23a2_1 2025-05-07T19:44:44.4293932Z libllvm16 conda-forge/linux-64::libllvm16-16.0.6-hb3ce162_3 2025-05-07T19:44:44.4294412Z libxml2 conda-forge/linux-64::libxml2-2.12.7-hc051c1a_1 2025-05-07T19:44:44.4294854Z libzlib conda-forge/linux-64::libzlib-1.2.13-h4ab18f5_6 2025-05-07T19:44:44.4295348Z llvm-openmp conda-forge/linux-64::llvm-openmp-16.0.6-h4dfa4b3_0 2025-05-07T19:44:44.4295872Z zstd conda-forge/linux-64::zstd-1.5.6-ha6fb4c9_0 2025-05-07T19:44:44.4296149Z 2025-05-07T19:44:44.4296268Z The following packages will be UPDATED: 2025-05-07T19:44:44.4296479Z 2025-05-07T19:44:44.4296745Z zlib pkgs/main::zlib-1.2.13-h5eee18b_1 --> conda-forge::zlib-1.2.13-h4ab18f5_6 2025-05-07T19:44:44.4297092Z 2025-05-07T19:44:44.4297095Z 2025-05-07T19:44:44.4297106Z 2025-05-07T19:44:44.4297253Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:44.4297657Z llvm-openmp-16.0.6 | 39.9 MB | | 0% 2025-05-07T19:44:44.4297904Z 2025-05-07T19:44:44.4298216Z compiler-rt_linux-64 | 36.0 MB | | 0%  2025-05-07T19:44:44.4298487Z 2025-05-07T19:44:44.4298491Z 2025-05-07T19:44:44.4298711Z libllvm16-16.0.6 | 33.7 MB | | 0%  2025-05-07T19:44:44.4298964Z 2025-05-07T19:44:44.4298968Z 2025-05-07T19:44:44.4298972Z 2025-05-07T19:44:44.4309617Z libclang-cpp16-16.0. | 17.3 MB | | 0%  2025-05-07T19:44:44.4309942Z 2025-05-07T19:44:44.4309946Z 2025-05-07T19:44:44.4309950Z 2025-05-07T19:44:44.4309954Z 2025-05-07T19:44:44.4313411Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:44.4313681Z 2025-05-07T19:44:44.4313685Z 2025-05-07T19:44:44.4313689Z 2025-05-07T19:44:44.4313692Z 2025-05-07T19:44:44.4313696Z 2025-05-07T19:44:44.4314631Z libcxx-19.1.7 | 1000 KB | | 0%  2025-05-07T19:44:44.4314924Z 2025-05-07T19:44:44.4314928Z 2025-05-07T19:44:44.4314946Z 2025-05-07T19:44:44.4314949Z 2025-05-07T19:44:44.4314960Z 2025-05-07T19:44:44.4314964Z 2025-05-07T19:44:44.4315549Z clang-16-16.0.6 | 780 KB | | 0%  2025-05-07T19:44:44.4315842Z 2025-05-07T19:44:44.4315858Z 2025-05-07T19:44:44.4315862Z 2025-05-07T19:44:44.4315866Z 2025-05-07T19:44:44.4315869Z 2025-05-07T19:44:44.4315873Z 2025-05-07T19:44:44.4315877Z 2025-05-07T19:44:44.4316673Z libiconv-1.18 | 696 KB | | 0%  2025-05-07T19:44:44.4316971Z 2025-05-07T19:44:44.4316975Z 2025-05-07T19:44:44.4316994Z 2025-05-07T19:44:44.4316997Z 2025-05-07T19:44:44.4317001Z 2025-05-07T19:44:44.4317004Z 2025-05-07T19:44:44.4317008Z 2025-05-07T19:44:44.4317011Z 2025-05-07T19:44:44.4317699Z libxml2-2.12.7 | 688 KB | | 0%  2025-05-07T19:44:44.4318003Z 2025-05-07T19:44:44.4318020Z 2025-05-07T19:44:44.4318024Z 2025-05-07T19:44:44.4318027Z 2025-05-07T19:44:44.4318031Z 2025-05-07T19:44:44.4318034Z 2025-05-07T19:44:44.4318042Z 2025-05-07T19:44:44.4318045Z 2025-05-07T19:44:44.4318049Z 2025-05-07T19:44:44.4318804Z zstd-1.5.6 | 542 KB | | 0%  2025-05-07T19:44:44.4319103Z 2025-05-07T19:44:44.4319107Z 2025-05-07T19:44:44.4319111Z 2025-05-07T19:44:44.4319114Z 2025-05-07T19:44:44.4319117Z 2025-05-07T19:44:44.4319121Z 2025-05-07T19:44:44.4319124Z 2025-05-07T19:44:44.4319129Z 2025-05-07T19:44:44.4319132Z 2025-05-07T19:44:44.4319136Z 2025-05-07T19:44:44.4319987Z libcxxabi-19.1.7 | 158 KB | | 0%  2025-05-07T19:44:44.4320310Z 2025-05-07T19:44:44.4320314Z 2025-05-07T19:44:44.4320317Z 2025-05-07T19:44:44.4320321Z 2025-05-07T19:44:44.4320324Z 2025-05-07T19:44:44.4320328Z 2025-05-07T19:44:44.4320331Z 2025-05-07T19:44:44.4320335Z 2025-05-07T19:44:44.4320343Z 2025-05-07T19:44:44.4320347Z 2025-05-07T19:44:44.4320361Z 2025-05-07T19:44:44.4321041Z clang-16.0.6 | 110 KB | | 0%  2025-05-07T19:44:44.4321435Z 2025-05-07T19:44:44.4321439Z 2025-05-07T19:44:44.4321442Z 2025-05-07T19:44:44.4321464Z 2025-05-07T19:44:44.4321468Z 2025-05-07T19:44:44.4321472Z 2025-05-07T19:44:44.4321475Z 2025-05-07T19:44:44.4321479Z 2025-05-07T19:44:44.4321482Z 2025-05-07T19:44:44.4321486Z 2025-05-07T19:44:44.4321489Z 2025-05-07T19:44:44.4321493Z 2025-05-07T19:44:44.4322088Z clangxx-16.0.6 | 110 KB | | 0%  2025-05-07T19:44:44.4322386Z 2025-05-07T19:44:44.4322390Z 2025-05-07T19:44:44.4322394Z 2025-05-07T19:44:44.4322398Z 2025-05-07T19:44:44.4322401Z 2025-05-07T19:44:44.4322404Z 2025-05-07T19:44:44.4322408Z 2025-05-07T19:44:44.4322411Z 2025-05-07T19:44:44.4322415Z 2025-05-07T19:44:44.4322418Z 2025-05-07T19:44:44.4322436Z 2025-05-07T19:44:44.4322464Z 2025-05-07T19:44:44.4322467Z 2025-05-07T19:44:44.4323077Z compiler-rt-16.0.6 | 107 KB | | 0%  2025-05-07T19:44:44.4323397Z 2025-05-07T19:44:44.4323400Z 2025-05-07T19:44:44.4323404Z 2025-05-07T19:44:44.4323407Z 2025-05-07T19:44:44.4323411Z 2025-05-07T19:44:44.4323414Z 2025-05-07T19:44:44.4323418Z 2025-05-07T19:44:44.4323440Z 2025-05-07T19:44:44.4323443Z 2025-05-07T19:44:44.4323447Z 2025-05-07T19:44:44.4323450Z 2025-05-07T19:44:44.4323454Z 2025-05-07T19:44:44.4323457Z 2025-05-07T19:44:44.4323469Z 2025-05-07T19:44:44.4324059Z zlib-1.2.13 | 91 KB | | 0%  2025-05-07T19:44:44.4324367Z 2025-05-07T19:44:44.4324371Z 2025-05-07T19:44:44.4324374Z 2025-05-07T19:44:44.4324378Z 2025-05-07T19:44:44.4324381Z 2025-05-07T19:44:44.4324403Z 2025-05-07T19:44:44.4324407Z 2025-05-07T19:44:44.4324410Z 2025-05-07T19:44:44.4324414Z 2025-05-07T19:44:44.4324417Z 2025-05-07T19:44:44.4324493Z 2025-05-07T19:44:44.4324498Z 2025-05-07T19:44:44.4324501Z 2025-05-07T19:44:44.4324505Z 2025-05-07T19:44:44.4324508Z 2025-05-07T19:44:44.5424280Z libzlib-1.2.13 | 60 KB | | 0%  2025-05-07T19:44:44.5425274Z 2025-05-07T19:44:44.5425289Z 2025-05-07T19:44:44.5425300Z 2025-05-07T19:44:44.5579761Z 2025-05-07T19:44:44.7317164Z icu-73.2 | 11.5 MB | 4 | 4%  2025-05-07T19:44:44.7317573Z 2025-05-07T19:44:44.7317647Z 2025-05-07T19:44:44.7317652Z 2025-05-07T19:44:44.7317657Z 2025-05-07T19:44:44.7841027Z icu-73.2 | 11.5 MB | 7 | 8%  2025-05-07T19:44:44.8070355Z llvm-openmp-16.0.6 | 39.9 MB | | 0% 2025-05-07T19:44:44.8070705Z 2025-05-07T19:44:44.8212210Z compiler-rt_linux-64 | 36.0 MB | | 0%  2025-05-07T19:44:44.8212504Z 2025-05-07T19:44:44.8212658Z 2025-05-07T19:44:44.8212668Z 2025-05-07T19:44:44.8316520Z libclang-cpp16-16.0. | 17.3 MB | | 0%  2025-05-07T19:44:44.8317280Z 2025-05-07T19:44:44.8317305Z 2025-05-07T19:44:44.8317321Z 2025-05-07T19:44:44.8317334Z 2025-05-07T19:44:44.8703039Z icu-73.2 | 11.5 MB | ######### | 90%  2025-05-07T19:44:44.8703331Z 2025-05-07T19:44:44.8703348Z 2025-05-07T19:44:44.8844317Z libllvm16-16.0.6 | 33.7 MB | | 0%  2025-05-07T19:44:44.9070411Z llvm-openmp-16.0.6 | 39.9 MB | #2 | 13% 2025-05-07T19:44:44.9070725Z 2025-05-07T19:44:44.9212030Z compiler-rt_linux-64 | 36.0 MB | # | 10%  2025-05-07T19:44:44.9212333Z 2025-05-07T19:44:44.9212465Z 2025-05-07T19:44:44.9212481Z 2025-05-07T19:44:44.9472851Z libclang-cpp16-16.0. | 17.3 MB | #####9 | 59%  2025-05-07T19:44:44.9473185Z 2025-05-07T19:44:44.9473268Z 2025-05-07T19:44:44.9473273Z 2025-05-07T19:44:44.9473306Z 2025-05-07T19:44:44.9703752Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:44.9704052Z 2025-05-07T19:44:44.9704079Z 2025-05-07T19:44:44.9846807Z libllvm16-16.0.6 | 33.7 MB | #5 | 15%  2025-05-07T19:44:44.9968565Z llvm-openmp-16.0.6 | 39.9 MB | ##6 | 27% 2025-05-07T19:44:44.9969491Z 2025-05-07T19:44:44.9969513Z 2025-05-07T19:44:44.9969529Z 2025-05-07T19:44:44.9969553Z 2025-05-07T19:44:44.9969632Z 2025-05-07T19:44:45.0076797Z libcxx-19.1.7 | 1000 KB | 1 | 2%  2025-05-07T19:44:45.0077204Z 2025-05-07T19:44:45.0408593Z compiler-rt_linux-64 | 36.0 MB | ##5 | 26%  2025-05-07T19:44:45.0409147Z 2025-05-07T19:44:45.0409224Z 2025-05-07T19:44:45.0409234Z 2025-05-07T19:44:45.0409237Z 2025-05-07T19:44:45.0410332Z 2025-05-07T19:44:45.0435994Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:45.0436385Z 2025-05-07T19:44:45.0436696Z 2025-05-07T19:44:45.0449858Z 2025-05-07T19:44:45.0709629Z libclang-cpp16-16.0. | 17.3 MB | #########4 | 94%  2025-05-07T19:44:45.0709964Z 2025-05-07T19:44:45.0709970Z 2025-05-07T19:44:45.0827601Z libllvm16-16.0.6 | 33.7 MB | ##8 | 28%  2025-05-07T19:44:45.0827901Z 2025-05-07T19:44:45.0827925Z 2025-05-07T19:44:45.0827928Z 2025-05-07T19:44:45.0827932Z 2025-05-07T19:44:45.0827936Z 2025-05-07T19:44:45.0827939Z 2025-05-07T19:44:45.0849002Z clang-16-16.0.6 | 780 KB | 2 | 2%  2025-05-07T19:44:45.1076654Z llvm-openmp-16.0.6 | 39.9 MB | ###9 | 40% 2025-05-07T19:44:45.1077018Z 2025-05-07T19:44:45.1087273Z compiler-rt_linux-64 | 36.0 MB | #### | 40%  2025-05-07T19:44:45.1087556Z 2025-05-07T19:44:45.1087561Z 2025-05-07T19:44:45.1087565Z 2025-05-07T19:44:45.1087569Z 2025-05-07T19:44:45.1087572Z 2025-05-07T19:44:45.1091564Z 2025-05-07T19:44:45.1709388Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:45.1709693Z 2025-05-07T19:44:45.1709698Z 2025-05-07T19:44:45.1788760Z libllvm16-16.0.6 | 33.7 MB | ###9 | 39%  2025-05-07T19:44:45.1789056Z 2025-05-07T19:44:45.1789060Z 2025-05-07T19:44:45.1789064Z 2025-05-07T19:44:45.1789068Z 2025-05-07T19:44:45.1789071Z 2025-05-07T19:44:45.1789084Z 2025-05-07T19:44:45.1789093Z 2025-05-07T19:44:45.1851040Z libiconv-1.18 | 696 KB | 2 | 2%  2025-05-07T19:44:45.2018103Z llvm-openmp-16.0.6 | 39.9 MB | #####3 | 54% 2025-05-07T19:44:45.2018494Z 2025-05-07T19:44:45.2018625Z 2025-05-07T19:44:45.2018629Z 2025-05-07T19:44:45.2018633Z 2025-05-07T19:44:45.2018636Z 2025-05-07T19:44:45.2020452Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:45.2020753Z 2025-05-07T19:44:45.2020758Z 2025-05-07T19:44:45.2020762Z 2025-05-07T19:44:45.2020767Z 2025-05-07T19:44:45.2021903Z 2025-05-07T19:44:45.2077746Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:45.2078057Z 2025-05-07T19:44:45.2211839Z compiler-rt_linux-64 | 36.0 MB | #####3 | 54%  2025-05-07T19:44:45.2212202Z 2025-05-07T19:44:45.2212473Z 2025-05-07T19:44:45.2212477Z 2025-05-07T19:44:45.2212495Z 2025-05-07T19:44:45.2212498Z 2025-05-07T19:44:45.2212502Z 2025-05-07T19:44:45.2212529Z 2025-05-07T19:44:45.2640686Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:45.2640993Z 2025-05-07T19:44:45.2641288Z 2025-05-07T19:44:45.2641304Z 2025-05-07T19:44:45.2641313Z 2025-05-07T19:44:45.2641367Z 2025-05-07T19:44:45.2641373Z 2025-05-07T19:44:45.2641378Z 2025-05-07T19:44:45.2641384Z 2025-05-07T19:44:45.2712609Z libxml2-2.12.7 | 688 KB | 2 | 2%  2025-05-07T19:44:45.2713484Z 2025-05-07T19:44:45.2713509Z 2025-05-07T19:44:45.2957409Z libllvm16-16.0.6 | 33.7 MB | #####1 | 52%  2025-05-07T19:44:45.3057612Z llvm-openmp-16.0.6 | 39.9 MB | ######6 | 66% 2025-05-07T19:44:45.3058115Z 2025-05-07T19:44:45.3058165Z 2025-05-07T19:44:45.3058188Z 2025-05-07T19:44:45.3058212Z 2025-05-07T19:44:45.3058217Z 2025-05-07T19:44:45.3058252Z 2025-05-07T19:44:45.3058262Z 2025-05-07T19:44:45.3058348Z 2025-05-07T19:44:45.3084457Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:45.3085760Z 2025-05-07T19:44:45.3445542Z compiler-rt_linux-64 | 36.0 MB | ######6 | 66%  2025-05-07T19:44:45.3445855Z 2025-05-07T19:44:45.3445861Z 2025-05-07T19:44:45.3445865Z 2025-05-07T19:44:45.3445868Z 2025-05-07T19:44:45.3445873Z 2025-05-07T19:44:45.3445876Z 2025-05-07T19:44:45.3445880Z 2025-05-07T19:44:45.3445883Z 2025-05-07T19:44:45.3445888Z 2025-05-07T19:44:45.3704383Z zstd-1.5.6 | 542 KB | 2 | 3%  2025-05-07T19:44:45.3704699Z 2025-05-07T19:44:45.3704703Z 2025-05-07T19:44:45.3704708Z 2025-05-07T19:44:45.3704711Z 2025-05-07T19:44:45.3704716Z 2025-05-07T19:44:45.3704720Z 2025-05-07T19:44:45.3704723Z 2025-05-07T19:44:45.3704727Z 2025-05-07T19:44:45.3704731Z 2025-05-07T19:44:45.3711318Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:45.3711606Z 2025-05-07T19:44:45.3711614Z 2025-05-07T19:44:45.3958221Z libllvm16-16.0.6 | 33.7 MB | ######3 | 64%  2025-05-07T19:44:45.4022399Z llvm-openmp-16.0.6 | 39.9 MB | #######8 | 79% 2025-05-07T19:44:45.4022694Z 2025-05-07T19:44:45.4022699Z 2025-05-07T19:44:45.4022703Z 2025-05-07T19:44:45.4022707Z 2025-05-07T19:44:45.4022710Z 2025-05-07T19:44:45.4022714Z 2025-05-07T19:44:45.4022717Z 2025-05-07T19:44:45.4024953Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:45.4025237Z 2025-05-07T19:44:45.4025241Z 2025-05-07T19:44:45.4025245Z 2025-05-07T19:44:45.4025248Z 2025-05-07T19:44:45.4025252Z 2025-05-07T19:44:45.4025255Z 2025-05-07T19:44:45.4025268Z 2025-05-07T19:44:45.4087056Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:45.4087569Z 2025-05-07T19:44:45.4137005Z compiler-rt_linux-64 | 36.0 MB | #######9 | 79%  2025-05-07T19:44:45.4137316Z 2025-05-07T19:44:45.4137358Z 2025-05-07T19:44:45.4137362Z 2025-05-07T19:44:45.4137365Z 2025-05-07T19:44:45.4137382Z 2025-05-07T19:44:45.4137393Z 2025-05-07T19:44:45.4137407Z 2025-05-07T19:44:45.4137411Z 2025-05-07T19:44:45.4137425Z 2025-05-07T19:44:45.4137694Z 2025-05-07T19:44:45.4221900Z libcxxabi-19.1.7 | 158 KB | # | 10%  2025-05-07T19:44:45.4222229Z 2025-05-07T19:44:45.4222234Z 2025-05-07T19:44:45.4222238Z 2025-05-07T19:44:45.4222241Z 2025-05-07T19:44:45.4222245Z 2025-05-07T19:44:45.4222248Z 2025-05-07T19:44:45.4222264Z 2025-05-07T19:44:45.4222267Z 2025-05-07T19:44:45.4222271Z 2025-05-07T19:44:45.4222274Z 2025-05-07T19:44:45.4658193Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:45.4658571Z 2025-05-07T19:44:45.4658576Z 2025-05-07T19:44:45.4658581Z 2025-05-07T19:44:45.4658585Z 2025-05-07T19:44:45.4658602Z 2025-05-07T19:44:45.4658625Z 2025-05-07T19:44:45.4658629Z 2025-05-07T19:44:45.4658637Z 2025-05-07T19:44:45.4658641Z 2025-05-07T19:44:45.4658644Z 2025-05-07T19:44:45.4658648Z 2025-05-07T19:44:45.4707354Z clang-16.0.6 | 110 KB | #4 | 15%  2025-05-07T19:44:45.4707685Z 2025-05-07T19:44:45.4707690Z 2025-05-07T19:44:45.4707694Z 2025-05-07T19:44:45.4707698Z 2025-05-07T19:44:45.4707701Z 2025-05-07T19:44:45.4707705Z 2025-05-07T19:44:45.4707708Z 2025-05-07T19:44:45.4707712Z 2025-05-07T19:44:45.4707715Z 2025-05-07T19:44:45.4707719Z 2025-05-07T19:44:45.4707728Z 2025-05-07T19:44:45.4724330Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:45.4725224Z 2025-05-07T19:44:45.4732589Z 2025-05-07T19:44:45.4961755Z libllvm16-16.0.6 | 33.7 MB | #######7 | 78%  2025-05-07T19:44:45.5047531Z llvm-openmp-16.0.6 | 39.9 MB | #########2 | 93% 2025-05-07T19:44:45.5047916Z 2025-05-07T19:44:45.5047972Z 2025-05-07T19:44:45.5048062Z 2025-05-07T19:44:45.5088426Z libclang-cpp16-16.0. | 17.3 MB | ########## | 100%  2025-05-07T19:44:45.5088884Z 2025-05-07T19:44:45.5123727Z compiler-rt_linux-64 | 36.0 MB | #########4 | 94%  2025-05-07T19:44:45.5124219Z 2025-05-07T19:44:45.5124332Z 2025-05-07T19:44:45.5124336Z 2025-05-07T19:44:45.5124340Z 2025-05-07T19:44:45.5124343Z 2025-05-07T19:44:45.5124347Z 2025-05-07T19:44:45.5124366Z 2025-05-07T19:44:45.5124369Z 2025-05-07T19:44:45.5124373Z 2025-05-07T19:44:45.5124376Z 2025-05-07T19:44:45.5124379Z 2025-05-07T19:44:45.5124386Z 2025-05-07T19:44:45.5210705Z clangxx-16.0.6 | 110 KB | #4 | 15%  2025-05-07T19:44:45.5211040Z 2025-05-07T19:44:45.5211044Z 2025-05-07T19:44:45.5211048Z 2025-05-07T19:44:45.5211052Z 2025-05-07T19:44:45.5211055Z 2025-05-07T19:44:45.5211059Z 2025-05-07T19:44:45.5211062Z 2025-05-07T19:44:45.5211066Z 2025-05-07T19:44:45.5211069Z 2025-05-07T19:44:45.5211072Z 2025-05-07T19:44:45.5211105Z 2025-05-07T19:44:45.5211109Z 2025-05-07T19:44:45.5505050Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:45.5505585Z 2025-05-07T19:44:45.5505590Z 2025-05-07T19:44:45.5505594Z 2025-05-07T19:44:45.5505597Z 2025-05-07T19:44:45.5505601Z 2025-05-07T19:44:45.5505620Z 2025-05-07T19:44:45.5505623Z 2025-05-07T19:44:45.5505627Z 2025-05-07T19:44:45.5509725Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:45.5510016Z 2025-05-07T19:44:45.5510020Z 2025-05-07T19:44:45.5510024Z 2025-05-07T19:44:45.5510027Z 2025-05-07T19:44:45.5510031Z 2025-05-07T19:44:45.5510047Z 2025-05-07T19:44:45.5510051Z 2025-05-07T19:44:45.5510054Z 2025-05-07T19:44:45.5657620Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:45.5658246Z 2025-05-07T19:44:45.5658259Z 2025-05-07T19:44:45.5658290Z 2025-05-07T19:44:45.5658296Z 2025-05-07T19:44:45.5658301Z 2025-05-07T19:44:45.5658504Z 2025-05-07T19:44:45.5658520Z 2025-05-07T19:44:45.5658555Z 2025-05-07T19:44:45.5658559Z 2025-05-07T19:44:45.5658563Z 2025-05-07T19:44:45.5658568Z 2025-05-07T19:44:45.5658584Z 2025-05-07T19:44:45.5658588Z 2025-05-07T19:44:45.5703924Z compiler-rt-16.0.6 | 107 KB | #4 | 15%  2025-05-07T19:44:45.5704270Z 2025-05-07T19:44:45.5704275Z 2025-05-07T19:44:45.5704278Z 2025-05-07T19:44:45.5704282Z 2025-05-07T19:44:45.5704286Z 2025-05-07T19:44:45.5704289Z 2025-05-07T19:44:45.5704293Z 2025-05-07T19:44:45.5704296Z 2025-05-07T19:44:45.5704300Z 2025-05-07T19:44:45.5704303Z 2025-05-07T19:44:45.5704312Z 2025-05-07T19:44:45.5704315Z 2025-05-07T19:44:45.5704334Z 2025-05-07T19:44:45.5724541Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:45.5724890Z 2025-05-07T19:44:45.5724970Z 2025-05-07T19:44:45.5725441Z libllvm16-16.0.6 | 33.7 MB | #########1 | 92%  2025-05-07T19:44:45.5725773Z 2025-05-07T19:44:45.5725778Z 2025-05-07T19:44:45.5725796Z 2025-05-07T19:44:45.5725800Z 2025-05-07T19:44:45.5725805Z 2025-05-07T19:44:45.5725809Z 2025-05-07T19:44:45.5725834Z 2025-05-07T19:44:45.5725838Z 2025-05-07T19:44:45.5725842Z 2025-05-07T19:44:45.5725846Z 2025-05-07T19:44:45.5725851Z 2025-05-07T19:44:45.5725856Z 2025-05-07T19:44:45.5725860Z 2025-05-07T19:44:45.5725864Z 2025-05-07T19:44:45.5762463Z zlib-1.2.13 | 91 KB | #7 | 18%  2025-05-07T19:44:45.5762860Z 2025-05-07T19:44:45.5762996Z 2025-05-07T19:44:45.5763004Z 2025-05-07T19:44:45.5763009Z 2025-05-07T19:44:45.5763015Z 2025-05-07T19:44:45.5763019Z 2025-05-07T19:44:45.5763052Z 2025-05-07T19:44:45.5763056Z 2025-05-07T19:44:45.5763060Z 2025-05-07T19:44:45.5763064Z 2025-05-07T19:44:45.5763068Z 2025-05-07T19:44:45.5763072Z 2025-05-07T19:44:45.5763076Z 2025-05-07T19:44:45.5763080Z 2025-05-07T19:44:45.6022409Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:45.6022743Z 2025-05-07T19:44:45.6022747Z 2025-05-07T19:44:45.6022751Z 2025-05-07T19:44:45.6022754Z 2025-05-07T19:44:45.6022942Z 2025-05-07T19:44:45.6022946Z 2025-05-07T19:44:45.6022949Z 2025-05-07T19:44:45.6022953Z 2025-05-07T19:44:45.6022956Z 2025-05-07T19:44:45.6023227Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:45.6023501Z 2025-05-07T19:44:45.6023504Z 2025-05-07T19:44:45.6023508Z 2025-05-07T19:44:45.6023511Z 2025-05-07T19:44:45.6023515Z 2025-05-07T19:44:45.6023518Z 2025-05-07T19:44:45.6023521Z 2025-05-07T19:44:45.6023525Z 2025-05-07T19:44:45.6023528Z 2025-05-07T19:44:45.6283293Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:45.6283685Z 2025-05-07T19:44:45.6283749Z 2025-05-07T19:44:45.6283753Z 2025-05-07T19:44:45.6283756Z 2025-05-07T19:44:45.6283776Z 2025-05-07T19:44:45.6283779Z 2025-05-07T19:44:45.6283783Z 2025-05-07T19:44:45.6283798Z 2025-05-07T19:44:45.6283821Z 2025-05-07T19:44:45.6283824Z 2025-05-07T19:44:45.6283843Z 2025-05-07T19:44:45.6283951Z 2025-05-07T19:44:45.6283963Z 2025-05-07T19:44:45.6283973Z 2025-05-07T19:44:45.6283998Z 2025-05-07T19:44:45.6304029Z libzlib-1.2.13 | 60 KB | ##6 | 27%  2025-05-07T19:44:45.6304361Z 2025-05-07T19:44:45.6304366Z 2025-05-07T19:44:45.6304369Z 2025-05-07T19:44:45.6304373Z 2025-05-07T19:44:45.6304376Z 2025-05-07T19:44:45.6304380Z 2025-05-07T19:44:45.6304383Z 2025-05-07T19:44:45.6304387Z 2025-05-07T19:44:45.6304397Z 2025-05-07T19:44:45.6304414Z 2025-05-07T19:44:45.6304418Z 2025-05-07T19:44:45.6304422Z 2025-05-07T19:44:45.6304425Z 2025-05-07T19:44:45.6304428Z 2025-05-07T19:44:45.6304432Z 2025-05-07T19:44:45.6361379Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:45.6361715Z 2025-05-07T19:44:45.6361763Z 2025-05-07T19:44:45.6362041Z 2025-05-07T19:44:45.6362046Z 2025-05-07T19:44:45.6362049Z 2025-05-07T19:44:45.6362067Z 2025-05-07T19:44:45.6362071Z 2025-05-07T19:44:45.6362074Z 2025-05-07T19:44:45.6362078Z 2025-05-07T19:44:45.6362087Z 2025-05-07T19:44:45.6362380Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:45.6362694Z 2025-05-07T19:44:45.6362698Z 2025-05-07T19:44:45.6362702Z 2025-05-07T19:44:45.6362705Z 2025-05-07T19:44:45.6362708Z 2025-05-07T19:44:45.6362712Z 2025-05-07T19:44:45.6362715Z 2025-05-07T19:44:45.6362719Z 2025-05-07T19:44:45.6362722Z 2025-05-07T19:44:45.6362726Z 2025-05-07T19:44:45.6775831Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:45.6776533Z 2025-05-07T19:44:45.6776579Z 2025-05-07T19:44:45.6776584Z 2025-05-07T19:44:45.6776588Z 2025-05-07T19:44:45.6776591Z 2025-05-07T19:44:45.6776595Z 2025-05-07T19:44:45.6777104Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:45.6777441Z 2025-05-07T19:44:45.6777458Z 2025-05-07T19:44:45.6777462Z 2025-05-07T19:44:45.6777465Z 2025-05-07T19:44:45.6777469Z 2025-05-07T19:44:45.6777472Z 2025-05-07T19:44:45.7414768Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:45.7415076Z 2025-05-07T19:44:45.7415081Z 2025-05-07T19:44:45.7415085Z 2025-05-07T19:44:45.7415101Z 2025-05-07T19:44:45.7415104Z 2025-05-07T19:44:45.7415108Z 2025-05-07T19:44:45.7415111Z 2025-05-07T19:44:45.7415115Z 2025-05-07T19:44:45.7415122Z 2025-05-07T19:44:45.7415125Z 2025-05-07T19:44:45.7415129Z 2025-05-07T19:44:45.7419684Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:45.7420004Z 2025-05-07T19:44:45.7420009Z 2025-05-07T19:44:45.7420012Z 2025-05-07T19:44:45.7420016Z 2025-05-07T19:44:45.7420020Z 2025-05-07T19:44:45.7420024Z 2025-05-07T19:44:45.7420028Z 2025-05-07T19:44:45.7420032Z 2025-05-07T19:44:45.7420036Z 2025-05-07T19:44:45.7420052Z 2025-05-07T19:44:45.7420061Z 2025-05-07T19:44:45.8296205Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:45.8296593Z 2025-05-07T19:44:45.8296785Z 2025-05-07T19:44:45.8296790Z 2025-05-07T19:44:45.8296793Z 2025-05-07T19:44:45.8296797Z 2025-05-07T19:44:45.8296801Z 2025-05-07T19:44:45.8296805Z 2025-05-07T19:44:45.8296809Z 2025-05-07T19:44:45.8296813Z 2025-05-07T19:44:45.8296816Z 2025-05-07T19:44:45.8296820Z 2025-05-07T19:44:45.8296824Z 2025-05-07T19:44:45.8297193Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:45.8297536Z 2025-05-07T19:44:45.8297540Z 2025-05-07T19:44:45.8297543Z 2025-05-07T19:44:45.8297547Z 2025-05-07T19:44:45.8297550Z 2025-05-07T19:44:45.8297554Z 2025-05-07T19:44:45.8297557Z 2025-05-07T19:44:45.8297561Z 2025-05-07T19:44:45.8297564Z 2025-05-07T19:44:45.8297567Z 2025-05-07T19:44:45.8297571Z 2025-05-07T19:44:45.8297577Z 2025-05-07T19:44:45.8455768Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:45.8456106Z 2025-05-07T19:44:45.8456255Z 2025-05-07T19:44:45.8456264Z 2025-05-07T19:44:45.8456286Z 2025-05-07T19:44:45.8678826Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:45.8679105Z 2025-05-07T19:44:45.8679262Z 2025-05-07T19:44:45.8679270Z 2025-05-07T19:44:45.8679275Z 2025-05-07T19:44:45.8679280Z 2025-05-07T19:44:45.8679285Z 2025-05-07T19:44:45.8679289Z 2025-05-07T19:44:45.8679294Z 2025-05-07T19:44:45.8679297Z 2025-05-07T19:44:45.8679302Z 2025-05-07T19:44:45.8679306Z 2025-05-07T19:44:45.8679311Z 2025-05-07T19:44:45.8679315Z 2025-05-07T19:44:45.8679320Z 2025-05-07T19:44:45.8679871Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:45.8680220Z 2025-05-07T19:44:45.8680225Z 2025-05-07T19:44:45.8680236Z 2025-05-07T19:44:45.8680241Z 2025-05-07T19:44:45.8680245Z 2025-05-07T19:44:45.8680529Z 2025-05-07T19:44:45.8680534Z 2025-05-07T19:44:45.8680538Z 2025-05-07T19:44:45.8680541Z 2025-05-07T19:44:45.8680560Z 2025-05-07T19:44:45.8680564Z 2025-05-07T19:44:45.8680567Z 2025-05-07T19:44:45.8680578Z 2025-05-07T19:44:45.8680581Z 2025-05-07T19:44:45.8685206Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:45.8685494Z 2025-05-07T19:44:45.8685498Z 2025-05-07T19:44:45.8685501Z 2025-05-07T19:44:45.8685517Z 2025-05-07T19:44:45.8685520Z 2025-05-07T19:44:45.8685524Z 2025-05-07T19:44:45.8685527Z 2025-05-07T19:44:45.8685531Z 2025-05-07T19:44:45.8685534Z 2025-05-07T19:44:45.8685542Z 2025-05-07T19:44:45.8685545Z 2025-05-07T19:44:45.8685549Z 2025-05-07T19:44:45.8686077Z 2025-05-07T19:44:45.8688933Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:45.8689257Z 2025-05-07T19:44:45.8689268Z 2025-05-07T19:44:45.8689271Z 2025-05-07T19:44:45.8689275Z 2025-05-07T19:44:45.8689284Z 2025-05-07T19:44:45.8689287Z 2025-05-07T19:44:45.8689291Z 2025-05-07T19:44:45.8689294Z 2025-05-07T19:44:45.8689298Z 2025-05-07T19:44:45.8689301Z 2025-05-07T19:44:45.8689305Z 2025-05-07T19:44:45.8689312Z 2025-05-07T19:44:45.8689315Z 2025-05-07T19:44:45.8803648Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:45.8804003Z 2025-05-07T19:44:45.8804008Z 2025-05-07T19:44:45.8804011Z 2025-05-07T19:44:45.8804015Z 2025-05-07T19:44:45.8804018Z 2025-05-07T19:44:45.8804022Z 2025-05-07T19:44:45.8804025Z 2025-05-07T19:44:45.8804029Z 2025-05-07T19:44:45.8804032Z 2025-05-07T19:44:45.8804036Z 2025-05-07T19:44:45.8804039Z 2025-05-07T19:44:45.8804043Z 2025-05-07T19:44:45.8804046Z 2025-05-07T19:44:45.8804061Z 2025-05-07T19:44:45.8804065Z 2025-05-07T19:44:45.8804359Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:45.8804664Z 2025-05-07T19:44:45.8804668Z 2025-05-07T19:44:45.8804684Z 2025-05-07T19:44:45.8804687Z 2025-05-07T19:44:45.8804691Z 2025-05-07T19:44:45.8804695Z 2025-05-07T19:44:45.8804698Z 2025-05-07T19:44:45.8804701Z 2025-05-07T19:44:45.8804716Z 2025-05-07T19:44:45.8804890Z 2025-05-07T19:44:45.8804894Z 2025-05-07T19:44:45.8804897Z 2025-05-07T19:44:45.8804900Z 2025-05-07T19:44:45.8804904Z 2025-05-07T19:44:45.8804907Z 2025-05-07T19:44:45.9046530Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:45.9046857Z 2025-05-07T19:44:45.9172946Z compiler-rt_linux-64 | 36.0 MB | ########## | 100%  2025-05-07T19:44:45.9173232Z 2025-05-07T19:44:45.9173327Z 2025-05-07T19:44:45.9313508Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:46.0812555Z llvm-openmp-16.0.6 | 39.9 MB | ########## | 100% 2025-05-07T19:44:46.0813358Z 2025-05-07T19:44:46.0813372Z 2025-05-07T19:44:46.0813384Z 2025-05-07T19:44:46.3141809Z libclang-cpp16-16.0. | 17.3 MB | ########## | 100%  2025-05-07T19:44:46.3142694Z 2025-05-07T19:44:46.3704899Z compiler-rt_linux-64 | 36.0 MB | ########## | 100%  2025-05-07T19:44:46.3705745Z 2025-05-07T19:44:46.3705760Z 2025-05-07T19:44:46.5340828Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:46.5352129Z llvm-openmp-16.0.6 | 39.9 MB | ########## | 100% 2025-05-07T19:44:46.5353378Z 2025-05-07T19:44:46.5353996Z 2025-05-07T19:44:46.5354635Z  2025-05-07T19:44:46.5355277Z 2025-05-07T19:44:46.5355290Z 2025-05-07T19:44:46.5355782Z  2025-05-07T19:44:46.5356408Z 2025-05-07T19:44:46.5356420Z 2025-05-07T19:44:46.5356431Z 2025-05-07T19:44:46.5356950Z  2025-05-07T19:44:46.5357576Z 2025-05-07T19:44:46.5357588Z 2025-05-07T19:44:46.5357599Z 2025-05-07T19:44:46.5357609Z 2025-05-07T19:44:46.5358548Z  2025-05-07T19:44:46.5359338Z 2025-05-07T19:44:46.5359342Z 2025-05-07T19:44:46.5359345Z 2025-05-07T19:44:46.5359349Z 2025-05-07T19:44:46.5359361Z 2025-05-07T19:44:46.5359555Z  2025-05-07T19:44:46.5359782Z 2025-05-07T19:44:46.5359803Z 2025-05-07T19:44:46.5359807Z 2025-05-07T19:44:46.5359810Z 2025-05-07T19:44:46.5359814Z 2025-05-07T19:44:46.5359817Z 2025-05-07T19:44:46.5360007Z  2025-05-07T19:44:46.5360242Z 2025-05-07T19:44:46.5360246Z 2025-05-07T19:44:46.5360250Z 2025-05-07T19:44:46.5360253Z 2025-05-07T19:44:46.5360257Z 2025-05-07T19:44:46.5360260Z 2025-05-07T19:44:46.5360281Z 2025-05-07T19:44:46.5360473Z  2025-05-07T19:44:46.5360705Z 2025-05-07T19:44:46.5360709Z 2025-05-07T19:44:46.5360713Z 2025-05-07T19:44:46.5360721Z 2025-05-07T19:44:46.5360724Z 2025-05-07T19:44:46.5360728Z 2025-05-07T19:44:46.5360731Z 2025-05-07T19:44:46.5360735Z 2025-05-07T19:44:46.5360949Z  2025-05-07T19:44:46.5361187Z 2025-05-07T19:44:46.5361190Z 2025-05-07T19:44:46.5361194Z 2025-05-07T19:44:46.5361197Z 2025-05-07T19:44:46.5361200Z 2025-05-07T19:44:46.5361204Z 2025-05-07T19:44:46.5361207Z 2025-05-07T19:44:46.5361211Z 2025-05-07T19:44:46.5361214Z 2025-05-07T19:44:46.5361543Z  2025-05-07T19:44:46.5361774Z 2025-05-07T19:44:46.5361777Z 2025-05-07T19:44:46.5361781Z 2025-05-07T19:44:46.5361784Z 2025-05-07T19:44:46.5361788Z 2025-05-07T19:44:46.5361791Z 2025-05-07T19:44:46.5361794Z 2025-05-07T19:44:46.5361798Z 2025-05-07T19:44:46.5361801Z 2025-05-07T19:44:46.5361804Z 2025-05-07T19:44:46.5362025Z  2025-05-07T19:44:46.5362261Z 2025-05-07T19:44:46.5362265Z 2025-05-07T19:44:46.5362268Z 2025-05-07T19:44:46.5362271Z 2025-05-07T19:44:46.5362275Z 2025-05-07T19:44:46.5362278Z 2025-05-07T19:44:46.5362386Z 2025-05-07T19:44:46.5362390Z 2025-05-07T19:44:46.5362393Z 2025-05-07T19:44:46.5362396Z 2025-05-07T19:44:46.5362400Z 2025-05-07T19:44:46.5362625Z  2025-05-07T19:44:46.5362864Z 2025-05-07T19:44:46.5362868Z 2025-05-07T19:44:46.5362871Z 2025-05-07T19:44:46.5362874Z 2025-05-07T19:44:46.5362878Z 2025-05-07T19:44:46.5362881Z 2025-05-07T19:44:46.5362885Z 2025-05-07T19:44:46.5362888Z 2025-05-07T19:44:46.5362891Z 2025-05-07T19:44:46.5362895Z 2025-05-07T19:44:46.5362916Z 2025-05-07T19:44:46.5362919Z 2025-05-07T19:44:46.5363126Z  2025-05-07T19:44:46.5363364Z 2025-05-07T19:44:46.5363367Z 2025-05-07T19:44:46.5363375Z 2025-05-07T19:44:46.5363378Z 2025-05-07T19:44:46.5363382Z 2025-05-07T19:44:46.5363385Z 2025-05-07T19:44:46.5363389Z 2025-05-07T19:44:46.5363392Z 2025-05-07T19:44:46.5363416Z 2025-05-07T19:44:46.5363423Z 2025-05-07T19:44:46.5363427Z 2025-05-07T19:44:46.5363430Z 2025-05-07T19:44:46.5363433Z 2025-05-07T19:44:46.5363640Z  2025-05-07T19:44:46.5363880Z 2025-05-07T19:44:46.5363884Z 2025-05-07T19:44:46.5363887Z 2025-05-07T19:44:46.5363891Z 2025-05-07T19:44:46.5363894Z 2025-05-07T19:44:46.5363917Z 2025-05-07T19:44:46.5363920Z 2025-05-07T19:44:46.5363924Z 2025-05-07T19:44:46.5363927Z 2025-05-07T19:44:46.5363931Z 2025-05-07T19:44:46.5363934Z 2025-05-07T19:44:46.5363937Z 2025-05-07T19:44:46.5363941Z 2025-05-07T19:44:46.5363944Z 2025-05-07T19:44:46.5364165Z  2025-05-07T19:44:46.5364410Z 2025-05-07T19:44:46.5364565Z 2025-05-07T19:44:46.5364569Z 2025-05-07T19:44:46.5364573Z 2025-05-07T19:44:46.5364577Z 2025-05-07T19:44:46.5364580Z 2025-05-07T19:44:46.5364583Z 2025-05-07T19:44:46.5364587Z 2025-05-07T19:44:46.5364593Z 2025-05-07T19:44:46.5364597Z 2025-05-07T19:44:46.5364600Z 2025-05-07T19:44:46.5364603Z 2025-05-07T19:44:46.5364607Z 2025-05-07T19:44:46.5364611Z 2025-05-07T19:44:46.5364614Z 2025-05-07T19:44:46.5364870Z  done 2025-05-07T19:44:46.6365336Z Preparing transaction: | done 2025-05-07T19:44:46.7391964Z Verifying transaction: - done 2025-05-07T19:44:46.8391228Z Executing transaction: | done 2025-05-07T19:44:46.9268439Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:50.6640880Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:50.6641558Z 2025-05-07T19:44:50.6653893Z 2025-05-07T19:44:50.6672420Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:50.6686127Z 2025-05-07T19:44:50.6686144Z 2025-05-07T19:44:50.6702079Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:50.6702650Z 2025-05-07T19:44:50.6716133Z 2025-05-07T19:44:50.6734265Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:50.6734823Z 2025-05-07T19:44:50.6743851Z 2025-05-07T19:44:50.6744898Z [INSTALL] Removing GCC package activation scripts ... 2025-05-07T19:44:52.5248440Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:44:52.5249013Z 2025-05-07T19:44:52.5262876Z total 28 2025-05-07T19:44:52.5263174Z drwxr-xr-x. 2 root root 134 May 7 19:44 . 2025-05-07T19:44:52.5263559Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:44:52.5264017Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:44:52.5264537Z -rw-r--r--. 2 root root 11630 Jun 10 2024 activate-gcc_linux-64.sh 2025-05-07T19:44:52.5264996Z -rw-r--r--. 2 root root 5190 Jun 10 2024 activate-gxx_linux-64.sh 2025-05-07T19:44:52.5265822Z -rw-r--r--. 2 root root 873 Jun 5 2024 libxml2_activate.sh 2025-05-07T19:44:52.5266106Z 2025-05-07T19:44:52.5266451Z + rm -rf /github/home/miniconda/envs/build_binary/etc/conda/activate.d/activate-gcc_linux-64.sh 2025-05-07T19:44:52.5266882Z 2025-05-07T19:44:52.5277628Z 2025-05-07T19:44:52.5278367Z + rm -rf /github/home/miniconda/envs/build_binary/etc/conda/activate.d/activate-gxx_linux-64.sh 2025-05-07T19:44:52.5278833Z 2025-05-07T19:44:52.5293238Z 2025-05-07T19:44:52.5293944Z + conda env config vars set -n build_binary CC= 2025-05-07T19:44:52.5294678Z 2025-05-07T19:44:52.9567829Z 2025-05-07T19:44:52.9568508Z + conda env config vars set -n build_binary CXX= 2025-05-07T19:44:52.9568793Z 2025-05-07T19:44:53.3780295Z 2025-05-07T19:44:53.3780700Z + conda run -n build_binary printenv CC 2025-05-07T19:44:53.3780976Z 2025-05-07T19:44:54.9537433Z 2025-05-07T19:44:54.9537680Z 2025-05-07T19:44:55.0125581Z 2025-05-07T19:44:55.0126247Z + conda run -n build_binary printenv CXX 2025-05-07T19:44:55.0126980Z 2025-05-07T19:44:56.5965839Z 2025-05-07T19:44:56.5966116Z 2025-05-07T19:44:56.6718917Z 2025-05-07T19:44:58.3224997Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/lib ... 2025-05-07T19:44:59.8929759Z ERROR conda.cli.main_run:execute(125): `conda run printenv LD_LIBRARY_PATH` failed. (See above for error) 2025-05-07T19:44:59.9503966Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/lib 2025-05-07T19:44:59.9505305Z 2025-05-07T19:45:00.3579012Z 2025-05-07T19:45:01.9271803Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:45:01.9841954Z 2025-05-07T19:45:01.9842445Z [CHECK] Binary cc found in PATH 2025-05-07T19:45:03.5544374Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:45:03.5544672Z 2025-05-07T19:45:03.6319915Z [CHECK] Binary gcc found in PATH 2025-05-07T19:45:05.2098059Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:45:05.2098908Z 2025-05-07T19:45:05.2663681Z [CHECK] Binary c++ found in PATH 2025-05-07T19:45:06.8417832Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:45:06.8418646Z 2025-05-07T19:45:06.8995354Z [CHECK] Binary g++ found in PATH 2025-05-07T19:45:06.8996576Z [INFO] Printing out all preprocessor defines in the C compiler ... 2025-05-07T19:45:06.8997820Z + conda run -n build_binary cc -dM -E - 2025-05-07T19:45:06.8998437Z 2025-05-07T19:45:08.5559799Z #define _LP64 1 2025-05-07T19:45:08.5560180Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:45:08.5560465Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:45:08.5560742Z #define __ATOMIC_CONSUME 1 2025-05-07T19:45:08.5561006Z #define __ATOMIC_RELAXED 0 2025-05-07T19:45:08.5561285Z #define __ATOMIC_RELEASE 3 2025-05-07T19:45:08.5561572Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:45:08.5561858Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:45:08.5562143Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:45:08.5562448Z #define __BOOL_WIDTH__ 8 2025-05-07T19:45:08.5562745Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:45:08.5563097Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:45:08.5563428Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:45:08.5563716Z #define __CHAR_BIT__ 8 2025-05-07T19:45:08.5563999Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:08.5564323Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:08.5564666Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:08.5564982Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:08.5565309Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:08.5565612Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:08.5565936Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:08.5566274Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:08.5566601Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:08.5566932Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:08.5567242Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:45:08.5567925Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:45:08.5568231Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:45:08.5568569Z #define __DBL_DIG__ 15 2025-05-07T19:45:08.5568929Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:45:08.5569266Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:45:08.5569552Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:45:08.5569821Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:08.5570101Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:45:08.5570379Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:45:08.5570661Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:45:08.5570945Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:45:08.5571256Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:45:08.5571549Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:45:08.5571828Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:45:08.5572162Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:45:08.5572459Z #define __ELF__ 1 2025-05-07T19:45:08.5572709Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:45:08.5572969Z #define __FLOAT128__ 1 2025-05-07T19:45:08.5573222Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:45:08.5573546Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:45:08.5573870Z #define __FLT16_DIG__ 3 2025-05-07T19:45:08.5574137Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:45:08.5574443Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:45:08.5574728Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:45:08.5575003Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:45:08.5575298Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:45:08.5575560Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:45:08.5576280Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:45:08.5576542Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:45:08.5576996Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:45:08.5577361Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:45:08.5577657Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:45:08.5577964Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:45:08.5578248Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:45:08.5578555Z #define __FLT_DIG__ 6 2025-05-07T19:45:08.5578795Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:45:08.5579095Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:45:08.5579357Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:45:08.5579637Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:45:08.5579900Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:45:08.5580261Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:45:08.5580543Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:45:08.5580800Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:45:08.5581096Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:45:08.5581366Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:45:08.5581651Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:45:08.5581927Z #define __FLT_RADIX__ 2 2025-05-07T19:45:08.5582172Z #define __FXSR__ 1 2025-05-07T19:45:08.5582403Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:45:08.5582708Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:08.5583019Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:08.5583352Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:08.5583678Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:08.5583974Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:08.5584284Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:08.5584581Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:08.5584896Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:08.5585206Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:08.5585530Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:45:08.5585845Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:08.5586163Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:45:08.5586473Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:45:08.5586822Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:45:08.5587167Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:45:08.5587614Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:45:08.5587935Z #define __GNUC_MINOR__ 2 2025-05-07T19:45:08.5588192Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:45:08.5588479Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:45:08.5588731Z #define __GNUC__ 4 2025-05-07T19:45:08.5588979Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:45:08.5589246Z #define __INT16_C_SUFFIX__ 2025-05-07T19:45:08.5589524Z #define __INT16_FMTd__ "hd" 2025-05-07T19:45:08.5589776Z #define __INT16_FMTi__ "hi" 2025-05-07T19:45:08.5590041Z #define __INT16_MAX__ 32767 2025-05-07T19:45:08.5590311Z #define __INT16_TYPE__ short 2025-05-07T19:45:08.5590568Z #define __INT32_C_SUFFIX__ 2025-05-07T19:45:08.5590835Z #define __INT32_FMTd__ "d" 2025-05-07T19:45:08.5591080Z #define __INT32_FMTi__ "i" 2025-05-07T19:45:08.5591346Z #define __INT32_MAX__ 2147483647 2025-05-07T19:45:08.5591609Z #define __INT32_TYPE__ int 2025-05-07T19:45:08.5591871Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:45:08.5592233Z #define __INT64_FMTd__ "ld" 2025-05-07T19:45:08.5592491Z #define __INT64_FMTi__ "li" 2025-05-07T19:45:08.5592744Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:45:08.5593043Z #define __INT64_TYPE__ long int 2025-05-07T19:45:08.5593311Z #define __INT8_C_SUFFIX__ 2025-05-07T19:45:08.5593552Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:45:08.5593811Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:45:08.5594056Z #define __INT8_MAX__ 127 2025-05-07T19:45:08.5594324Z #define __INT8_TYPE__ signed char 2025-05-07T19:45:08.5594598Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:45:08.5594878Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:45:08.5595134Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:45:08.5595414Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:45:08.5595710Z #define __INTMAX_TYPE__ long int 2025-05-07T19:45:08.5596091Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:45:08.5596361Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:45:08.5596617Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:45:08.5596898Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:45:08.5597203Z #define __INTPTR_TYPE__ long int 2025-05-07T19:45:08.5597494Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:45:08.5597748Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:45:08.5598027Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:45:08.5598290Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:45:08.5598572Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:45:08.5598845Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:45:08.5599119Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:45:08.5599389Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:45:08.5599649Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:45:08.5599940Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:45:08.5600197Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:45:08.5600469Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:45:08.5600735Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:45:08.5601040Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:08.5601352Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:45:08.5601649Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:45:08.5601915Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:45:08.5602194Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:45:08.5602471Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:45:08.5602739Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:45:08.5603041Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:45:08.5603300Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:45:08.5603587Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:45:08.5603855Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:45:08.5604141Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:45:08.5604412Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:45:08.5604689Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:45:08.5604952Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:45:08.5605235Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:45:08.5605529Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:45:08.5605790Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:45:08.5606068Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:45:08.5606432Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:45:08.5606741Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:08.5607057Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:45:08.5607353Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:45:08.5607618Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:45:08.5607900Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:45:08.5608187Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:45:08.5608458Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:45:08.5608765Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:45:08.5609026Z #define __INT_MAX__ 2147483647 2025-05-07T19:45:08.5609298Z #define __INT_WIDTH__ 32 2025-05-07T19:45:08.5609543Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:45:08.5609871Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:45:08.5610204Z #define __LDBL_DIG__ 18 2025-05-07T19:45:08.5610484Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:45:08.5610808Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:45:08.5611083Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:45:08.5611359Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:08.5611623Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:45:08.5611897Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:45:08.5612169Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:45:08.5612468Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:45:08.5612790Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:45:08.5613086Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:45:08.5613378Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:45:08.5613707Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:45:08.5613961Z #define __LLONG_WIDTH__ 64 2025-05-07T19:45:08.5614244Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:45:08.5614637Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:45:08.5614929Z #define __LONG_WIDTH__ 64 2025-05-07T19:45:08.5615184Z #define __LP64__ 1 2025-05-07T19:45:08.5615399Z #define __MMX__ 1 2025-05-07T19:45:08.5615632Z #define __NO_INLINE__ 1 2025-05-07T19:45:08.5615868Z #define __NO_MATH_INLINES 1 2025-05-07T19:45:08.5616134Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:45:08.5616422Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:45:08.5616763Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:45:08.5617082Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:45:08.5617417Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:45:08.5617767Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:45:08.5618089Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:45:08.5618410Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:45:08.5618713Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:45:08.5619012Z #define __PIC__ 2 2025-05-07T19:45:08.5619268Z #define __PIE__ 2 2025-05-07T19:45:08.5619506Z #define __POINTER_WIDTH__ 64 2025-05-07T19:45:08.5619798Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:45:08.5620081Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:45:08.5620464Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:45:08.5620934Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:45:08.5621262Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:45:08.5621574Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:45:08.5621854Z #define __REGISTER_PREFIX__ 2025-05-07T19:45:08.5622107Z #define __SCHAR_MAX__ 127 2025-05-07T19:45:08.5622365Z #define __SEG_FS 1 2025-05-07T19:45:08.5622591Z #define __SEG_GS 1 2025-05-07T19:45:08.5622854Z #define __SHRT_MAX__ 32767 2025-05-07T19:45:08.5623141Z #define __SHRT_WIDTH__ 16 2025-05-07T19:45:08.5623412Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:45:08.5623740Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:45:08.5624020Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:45:08.5624315Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:45:08.5624580Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:45:08.5624864Z #define __SIZEOF_INT128__ 16 2025-05-07T19:45:08.5625124Z #define __SIZEOF_INT__ 4 2025-05-07T19:45:08.5625389Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:45:08.5625749Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:45:08.5626027Z #define __SIZEOF_LONG__ 8 2025-05-07T19:45:08.5626290Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:45:08.5626558Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:45:08.5626834Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:45:08.5627083Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:45:08.5627354Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:45:08.5627612Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:45:08.5627877Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:45:08.5628140Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:45:08.5628419Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:45:08.5628689Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:45:08.5628992Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.5629348Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:45:08.5629663Z #define __SIZE_WIDTH__ 64 2025-05-07T19:45:08.5629951Z #define __SSE2_MATH__ 1 2025-05-07T19:45:08.5630203Z #define __SSE2__ 1 2025-05-07T19:45:08.5630467Z #define __SSE_MATH__ 1 2025-05-07T19:45:08.5630715Z #define __SSE__ 1 2025-05-07T19:45:08.5630976Z #define __STDC_HOSTED__ 1 2025-05-07T19:45:08.5631240Z #define __STDC_UTF_16__ 1 2025-05-07T19:45:08.5631523Z #define __STDC_UTF_32__ 1 2025-05-07T19:45:08.5631791Z #define __STDC_VERSION__ 201710L 2025-05-07T19:45:08.5632093Z #define __STDC__ 1 2025-05-07T19:45:08.5632336Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:45:08.5632637Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:45:08.5633040Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:45:08.5633300Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:45:08.5633577Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:45:08.5633828Z #define __UINT16_MAX__ 65535 2025-05-07T19:45:08.5634121Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:45:08.5634413Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:45:08.5634784Z #define __UINT32_FMTX__ "X" 2025-05-07T19:45:08.5635040Z #define __UINT32_FMTo__ "o" 2025-05-07T19:45:08.5635315Z #define __UINT32_FMTu__ "u" 2025-05-07T19:45:08.5635570Z #define __UINT32_FMTx__ "x" 2025-05-07T19:45:08.5635851Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:45:08.5636154Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:45:08.5636440Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:45:08.5636735Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:45:08.5636993Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:45:08.5637281Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:45:08.5637536Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:45:08.5637832Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.5638147Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:45:08.5638471Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:45:08.5638726Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:45:08.5639009Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:45:08.5639294Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:45:08.5639551Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:45:08.5639836Z #define __UINT8_MAX__ 255 2025-05-07T19:45:08.5640095Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:45:08.5640416Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:45:08.5640686Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:45:08.5640980Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:45:08.5641244Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:45:08.5641528Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:45:08.5641807Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.5642156Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:45:08.5642491Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:45:08.5642753Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:45:08.5643051Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:45:08.5643320Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:45:08.5643615Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:45:08.5643898Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.5644258Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:45:08.5644599Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:45:08.5644890Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:45:08.5645172Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:45:08.5645545Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:45:08.5645854Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:45:08.5646130Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:45:08.5646451Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:45:08.5646760Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:45:08.5647059Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:45:08.5647327Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:45:08.5647621Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:45:08.5647896Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:45:08.5648227Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:45:08.5648530Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:45:08.5648792Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:45:08.5649077Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:45:08.5649337Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:45:08.5649639Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.5649972Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:45:08.5650292Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:45:08.5650570Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:45:08.5650832Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:45:08.5651113Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:45:08.5651373Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:45:08.5651659Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:45:08.5651949Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:45:08.5652238Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:45:08.5652505Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:45:08.5652788Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:45:08.5653050Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:45:08.5653399Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:45:08.5653712Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:45:08.5653973Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:45:08.5654244Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:45:08.5654504Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:45:08.5654782Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:45:08.5655071Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:45:08.5655367Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:45:08.5655627Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:45:08.5655898Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:45:08.5656170Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:45:08.5656452Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.5656794Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:45:08.5657094Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:45:08.5657369Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:45:08.5657628Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:45:08.5657900Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:45:08.5658159Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:45:08.5658436Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:45:08.5658725Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:45:08.5659322Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:08.5659925Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:45:08.5660258Z #define __WCHAR_TYPE__ int 2025-05-07T19:45:08.5660687Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:45:08.5660940Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:45:08.5661296Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:45:08.5661579Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:45:08.5661851Z #define __WINT_WIDTH__ 32 2025-05-07T19:45:08.5662088Z #define __amd64 1 2025-05-07T19:45:08.5662317Z #define __amd64__ 1 2025-05-07T19:45:08.5662547Z #define __clang__ 1 2025-05-07T19:45:08.5662798Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:45:08.5663119Z #define __clang_major__ 16 2025-05-07T19:45:08.5663369Z #define __clang_minor__ 0 2025-05-07T19:45:08.5663637Z #define __clang_patchlevel__ 6 2025-05-07T19:45:08.5664322Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:08.5665002Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:45:08.5665337Z #define __code_model_small__ 1 2025-05-07T19:45:08.5665616Z #define __gnu_linux__ 1 2025-05-07T19:45:08.5665852Z #define __k8 1 2025-05-07T19:45:08.5666079Z #define __k8__ 1 2025-05-07T19:45:08.5666301Z #define __linux 1 2025-05-07T19:45:08.5666518Z #define __linux__ 1 2025-05-07T19:45:08.5666747Z #define __llvm__ 1 2025-05-07T19:45:08.5666961Z #define __pic__ 2 2025-05-07T19:45:08.5667191Z #define __pie__ 2 2025-05-07T19:45:08.5667456Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:45:08.5667850Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:45:08.5668185Z #define __tune_k8__ 1 2025-05-07T19:45:08.5668428Z #define __unix 1 2025-05-07T19:45:08.5668640Z #define __unix__ 1 2025-05-07T19:45:08.5668867Z #define __x86_64 1 2025-05-07T19:45:08.5669101Z #define __x86_64__ 1 2025-05-07T19:45:08.5669321Z #define linux 1 2025-05-07T19:45:08.5669542Z #define unix 1 2025-05-07T19:45:08.5669669Z 2025-05-07T19:45:08.6318460Z 2025-05-07T19:45:08.6319503Z [INFO] Printing out all preprocessor defines in the C++ compiler ... 2025-05-07T19:45:08.6320918Z + conda run -n build_binary c++ -dM -E -x c++ - 2025-05-07T19:45:08.6321620Z 2025-05-07T19:45:10.2606863Z #define _GNU_SOURCE 1 2025-05-07T19:45:10.2607631Z #define _LP64 1 2025-05-07T19:45:10.2608310Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:45:10.2609051Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:45:10.2609976Z #define __ATOMIC_CONSUME 1 2025-05-07T19:45:10.2610250Z #define __ATOMIC_RELAXED 0 2025-05-07T19:45:10.2610503Z #define __ATOMIC_RELEASE 3 2025-05-07T19:45:10.2612978Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:45:10.2613312Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:45:10.2613733Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:45:10.2614017Z #define __BOOL_WIDTH__ 8 2025-05-07T19:45:10.2614302Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:45:10.2614639Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:45:10.2614925Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:45:10.2615203Z #define __CHAR_BIT__ 8 2025-05-07T19:45:10.2615442Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:10.2615756Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:10.2616063Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:10.2616374Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:10.2616663Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:10.2616971Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:10.2617282Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:10.2617577Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:10.2617899Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:10.2618198Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:10.2618508Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:45:10.2618782Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:45:10.2619085Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:45:10.2619388Z #define __DBL_DIG__ 15 2025-05-07T19:45:10.2619653Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:45:10.2619965Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:45:10.2620345Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:45:10.2620800Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:10.2621075Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:45:10.2621392Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:45:10.2621664Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:45:10.2621963Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:45:10.2622280Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:45:10.2622586Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:45:10.2622873Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:45:10.2623216Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:45:10.2623548Z #define __DEPRECATED 1 2025-05-07T19:45:10.2623965Z #define __ELF__ 1 2025-05-07T19:45:10.2624207Z #define __EXCEPTIONS 1 2025-05-07T19:45:10.2624453Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:45:10.2624738Z #define __FLOAT128__ 1 2025-05-07T19:45:10.2624987Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:45:10.2625319Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:45:10.2625648Z #define __FLT16_DIG__ 3 2025-05-07T19:45:10.2625926Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:45:10.2626229Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:45:10.2626519Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:45:10.2626935Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:45:10.2627199Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:45:10.2627465Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:45:10.2627713Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:45:10.2627985Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:45:10.2628253Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:45:10.2628542Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:45:10.2628804Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:45:10.2629100Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:45:10.2629362Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:45:10.2629659Z #define __FLT_DIG__ 6 2025-05-07T19:45:10.2629907Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:45:10.2630180Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:45:10.2630443Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:45:10.2630697Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:45:10.2630965Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:45:10.2631207Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:45:10.2631471Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:45:10.2631715Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:45:10.2631997Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:45:10.2632325Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:45:10.2632600Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:45:10.2632880Z #define __FLT_RADIX__ 2 2025-05-07T19:45:10.2633103Z #define __FXSR__ 1 2025-05-07T19:45:10.2633346Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:45:10.2633626Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:10.2633940Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:10.2634246Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:10.2634563Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:10.2634847Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:10.2635142Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:10.2635428Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:10.2635736Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:10.2636046Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:10.2636344Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:45:10.2636666Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:10.2636961Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:45:10.2637270Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:45:10.2637586Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:45:10.2637922Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:45:10.2638232Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:45:10.2638550Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:45:10.2638849Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:45:10.2639127Z #define __GNUC_GNU_INLINE__ 1 2025-05-07T19:45:10.2639393Z #define __GNUC_MINOR__ 2 2025-05-07T19:45:10.2639633Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:45:10.2639895Z #define __GNUC__ 4 2025-05-07T19:45:10.2640102Z #define __GNUG__ 4 2025-05-07T19:45:10.2640337Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:45:10.2640599Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:45:10.2640883Z #define __GXX_RTTI 1 2025-05-07T19:45:10.2641099Z #define __GXX_WEAK__ 1 2025-05-07T19:45:10.2641341Z #define __INT16_C_SUFFIX__ 2025-05-07T19:45:10.2641601Z #define __INT16_FMTd__ "hd" 2025-05-07T19:45:10.2641839Z #define __INT16_FMTi__ "hi" 2025-05-07T19:45:10.2642093Z #define __INT16_MAX__ 32767 2025-05-07T19:45:10.2642442Z #define __INT16_TYPE__ short 2025-05-07T19:45:10.2642705Z #define __INT32_C_SUFFIX__ 2025-05-07T19:45:10.2642942Z #define __INT32_FMTd__ "d" 2025-05-07T19:45:10.2643208Z #define __INT32_FMTi__ "i" 2025-05-07T19:45:10.2643471Z #define __INT32_MAX__ 2147483647 2025-05-07T19:45:10.2643731Z #define __INT32_TYPE__ int 2025-05-07T19:45:10.2643990Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:45:10.2644237Z #define __INT64_FMTd__ "ld" 2025-05-07T19:45:10.2644501Z #define __INT64_FMTi__ "li" 2025-05-07T19:45:10.2644756Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:45:10.2645065Z #define __INT64_TYPE__ long int 2025-05-07T19:45:10.2645323Z #define __INT8_C_SUFFIX__ 2025-05-07T19:45:10.2645590Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:45:10.2645832Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:45:10.2646104Z #define __INT8_MAX__ 127 2025-05-07T19:45:10.2646369Z #define __INT8_TYPE__ signed char 2025-05-07T19:45:10.2646643Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:45:10.2646917Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:45:10.2647173Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:45:10.2647454Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:45:10.2647746Z #define __INTMAX_TYPE__ long int 2025-05-07T19:45:10.2648029Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:45:10.2648280Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:45:10.2648549Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:45:10.2648813Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:45:10.2649121Z #define __INTPTR_TYPE__ long int 2025-05-07T19:45:10.2649398Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:45:10.2649645Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:45:10.2649922Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:45:10.2650177Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:45:10.2650451Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:45:10.2650782Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:45:10.2651059Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:45:10.2651311Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:45:10.2651589Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:45:10.2651859Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:45:10.2652126Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:45:10.2652392Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:45:10.2652647Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:45:10.2652940Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:10.2653246Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:45:10.2653535Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:45:10.2653789Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:45:10.2654068Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:45:10.2654326Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:45:10.2654606Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:45:10.2654882Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:45:10.2655157Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:45:10.2655433Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:45:10.2655693Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:45:10.2655976Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:45:10.2656242Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:45:10.2656512Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:45:10.2656769Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:45:10.2657052Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:45:10.2657331Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:45:10.2657606Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:45:10.2657872Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:45:10.2658156Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:45:10.2658456Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:10.2658764Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:45:10.2659059Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:45:10.2659317Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:45:10.2659599Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:45:10.2659863Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:45:10.2660240Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:45:10.2660817Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:45:10.2661108Z #define __INT_MAX__ 2147483647 2025-05-07T19:45:10.2661392Z #define __INT_WIDTH__ 32 2025-05-07T19:45:10.2661661Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:45:10.2662007Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:45:10.2662357Z #define __LDBL_DIG__ 18 2025-05-07T19:45:10.2662661Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:45:10.2663000Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:45:10.2663289Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:45:10.2663566Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:10.2663861Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:45:10.2664136Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:45:10.2664427Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:45:10.2664739Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:45:10.2665068Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:45:10.2665373Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:45:10.2665682Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:45:10.2666021Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:45:10.2666281Z #define __LLONG_WIDTH__ 64 2025-05-07T19:45:10.2666578Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:45:10.2667013Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:45:10.2667308Z #define __LONG_WIDTH__ 64 2025-05-07T19:45:10.2667539Z #define __LP64__ 1 2025-05-07T19:45:10.2667763Z #define __MMX__ 1 2025-05-07T19:45:10.2667987Z #define __NO_INLINE__ 1 2025-05-07T19:45:10.2668224Z #define __NO_MATH_INLINES 1 2025-05-07T19:45:10.2668499Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:45:10.2668784Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:45:10.2669126Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:45:10.2669495Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:45:10.2669822Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:45:10.2670126Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:45:10.2670448Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:45:10.2670739Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:45:10.2671019Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:45:10.2671286Z #define __PIC__ 2 2025-05-07T19:45:10.2671492Z #define __PIE__ 2 2025-05-07T19:45:10.2671725Z #define __POINTER_WIDTH__ 64 2025-05-07T19:45:10.2671985Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:45:10.2672276Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:45:10.2672534Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:45:10.2672823Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:45:10.2673122Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:45:10.2673411Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:45:10.2673664Z #define __REGISTER_PREFIX__ 2025-05-07T19:45:10.2673935Z #define __SCHAR_MAX__ 127 2025-05-07T19:45:10.2674193Z #define __SEG_FS 1 2025-05-07T19:45:10.2674413Z #define __SEG_GS 1 2025-05-07T19:45:10.2674651Z #define __SHRT_MAX__ 32767 2025-05-07T19:45:10.2674894Z #define __SHRT_WIDTH__ 16 2025-05-07T19:45:10.2675172Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:45:10.2675453Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:45:10.2675734Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:45:10.2676371Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:45:10.2676777Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:45:10.2677112Z #define __SIZEOF_INT128__ 16 2025-05-07T19:45:10.2677395Z #define __SIZEOF_INT__ 4 2025-05-07T19:45:10.2677673Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:45:10.2677962Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:45:10.2678261Z #define __SIZEOF_LONG__ 8 2025-05-07T19:45:10.2678517Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:45:10.2678808Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:45:10.2679079Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:45:10.2679354Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:45:10.2679615Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:45:10.2679891Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:45:10.2680149Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:45:10.2680418Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:45:10.2680845Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:45:10.2681098Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:45:10.2681379Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.2681693Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:45:10.2682011Z #define __SIZE_WIDTH__ 64 2025-05-07T19:45:10.2682258Z #define __SSE2_MATH__ 1 2025-05-07T19:45:10.2682512Z #define __SSE2__ 1 2025-05-07T19:45:10.2682734Z #define __SSE_MATH__ 1 2025-05-07T19:45:10.2683094Z #define __SSE__ 1 2025-05-07T19:45:10.2683339Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16UL 2025-05-07T19:45:10.2683664Z #define __STDCPP_THREADS__ 1 2025-05-07T19:45:10.2683921Z #define __STDC_HOSTED__ 1 2025-05-07T19:45:10.2684175Z #define __STDC_UTF_16__ 1 2025-05-07T19:45:10.2684427Z #define __STDC_UTF_32__ 1 2025-05-07T19:45:10.2684655Z #define __STDC__ 1 2025-05-07T19:45:10.2684886Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:45:10.2685130Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:45:10.2685390Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:45:10.2685638Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:45:10.2685898Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:45:10.2686140Z #define __UINT16_MAX__ 65535 2025-05-07T19:45:10.2686417Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:45:10.2686698Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:45:10.2686966Z #define __UINT32_FMTX__ "X" 2025-05-07T19:45:10.2687227Z #define __UINT32_FMTo__ "o" 2025-05-07T19:45:10.2687464Z #define __UINT32_FMTu__ "u" 2025-05-07T19:45:10.2687723Z #define __UINT32_FMTx__ "x" 2025-05-07T19:45:10.2687969Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:45:10.2688256Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:45:10.2688530Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:45:10.2688797Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:45:10.2689132Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:45:10.2689402Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:45:10.2689646Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:45:10.2689931Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.2690264Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:45:10.2690553Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:45:10.2690814Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:45:10.2691063Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:45:10.2691329Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:45:10.2691574Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:45:10.2691838Z #define __UINT8_MAX__ 255 2025-05-07T19:45:10.2692087Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:45:10.2692378Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:45:10.2692637Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:45:10.2692908Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:45:10.2693177Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:45:10.2693425Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:45:10.2693712Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.2694022Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:45:10.2694328Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:45:10.2694587Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:45:10.2694851Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:45:10.2695101Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:45:10.2695367Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:45:10.2695633Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.2695961Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:45:10.2696269Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:45:10.2696521Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:45:10.2696814Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:45:10.2697083Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:45:10.2697369Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:45:10.2697630Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:45:10.2697927Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:45:10.2698229Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:45:10.2698510Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:45:10.2698768Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:45:10.2699123Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:45:10.2699406Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:45:10.2699703Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:45:10.2700060Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:45:10.2700444Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:45:10.2700946Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:45:10.2701257Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:45:10.2701626Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.2702005Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:45:10.2702383Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:45:10.2702720Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:45:10.2703019Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:45:10.2703361Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:45:10.2703662Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:45:10.2704003Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:45:10.2704346Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:45:10.2704696Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:45:10.2705008Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:45:10.2705352Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:45:10.2705665Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:45:10.2706024Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:45:10.2706398Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:45:10.2706709Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:45:10.2707048Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:45:10.2707353Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:45:10.2707697Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:45:10.2708045Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:45:10.2708416Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:45:10.2708788Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:45:10.2709116Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:45:10.2709438Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:45:10.2709764Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:10.2710168Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:45:10.2710508Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:45:10.2710836Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:45:10.2711134Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:45:10.2711456Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:45:10.2711754Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:45:10.2712082Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:45:10.2712409Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:45:10.2713184Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:10.2713803Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:45:10.2714070Z #define __WCHAR_TYPE__ int 2025-05-07T19:45:10.2714332Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:45:10.2714576Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:45:10.2714850Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:45:10.2715118Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:45:10.2715376Z #define __WINT_WIDTH__ 32 2025-05-07T19:45:10.2715602Z #define __amd64 1 2025-05-07T19:45:10.2715823Z #define __amd64__ 1 2025-05-07T19:45:10.2716045Z #define __clang__ 1 2025-05-07T19:45:10.2716282Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:45:10.2716589Z #define __clang_major__ 16 2025-05-07T19:45:10.2716830Z #define __clang_minor__ 0 2025-05-07T19:45:10.2717091Z #define __clang_patchlevel__ 6 2025-05-07T19:45:10.2717650Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:10.2718284Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:45:10.2718599Z #define __code_model_small__ 1 2025-05-07T19:45:10.2718877Z #define __cplusplus 201703L 2025-05-07T19:45:10.2719161Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:45:10.2719448Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:45:10.2719751Z #define __cpp_alias_templates 200704L 2025-05-07T19:45:10.2720128Z #define __cpp_aligned_new 201606L 2025-05-07T19:45:10.2720422Z #define __cpp_attributes 200809L 2025-05-07T19:45:10.2720694Z #define __cpp_binary_literals 201304L 2025-05-07T19:45:10.2721001Z #define __cpp_capture_star_this 201603L 2025-05-07T19:45:10.2721293Z #define __cpp_constexpr 201603L 2025-05-07T19:45:10.2721593Z #define __cpp_constexpr_in_decltype 201711L 2025-05-07T19:45:10.2721894Z #define __cpp_decltype 200707L 2025-05-07T19:45:10.2722172Z #define __cpp_decltype_auto 201304L 2025-05-07T19:45:10.2722473Z #define __cpp_deduction_guides 201703L 2025-05-07T19:45:10.2722781Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:45:10.2723112Z #define __cpp_digit_separators 201309L 2025-05-07T19:45:10.2723416Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:45:10.2723735Z #define __cpp_exceptions 199711L 2025-05-07T19:45:10.2724009Z #define __cpp_fold_expressions 201603L 2025-05-07T19:45:10.2724313Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:45:10.2724616Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:45:10.2724934Z #define __cpp_hex_float 201603L 2025-05-07T19:45:10.2725209Z #define __cpp_if_constexpr 201606L 2025-05-07T19:45:10.2725497Z #define __cpp_impl_destroying_delete 201806L 2025-05-07T19:45:10.2725833Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:45:10.2726140Z #define __cpp_init_captures 201304L 2025-05-07T19:45:10.2726437Z #define __cpp_initializer_lists 200806L 2025-05-07T19:45:10.2726729Z #define __cpp_inline_variables 201606L 2025-05-07T19:45:10.2727026Z #define __cpp_lambdas 200907L 2025-05-07T19:45:10.2727305Z #define __cpp_named_character_escapes 202207L 2025-05-07T19:45:10.2727639Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:45:10.2728038Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:45:10.2728400Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:45:10.2728734Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:45:10.2729079Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:45:10.2729430Z #define __cpp_nsdmi 200809L 2025-05-07T19:45:10.2729689Z #define __cpp_range_based_for 201603L 2025-05-07T19:45:10.2729993Z #define __cpp_raw_strings 200710L 2025-05-07T19:45:10.2730264Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:45:10.2730579Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:45:10.2730883Z #define __cpp_rtti 199711L 2025-05-07T19:45:10.2731173Z #define __cpp_rvalue_references 200610L 2025-05-07T19:45:10.2731501Z #define __cpp_static_assert 201411L 2025-05-07T19:45:10.2731803Z #define __cpp_static_call_operator 202207L 2025-05-07T19:45:10.2732146Z #define __cpp_structured_bindings 201606L 2025-05-07T19:45:10.2732461Z #define __cpp_template_auto 201606L 2025-05-07T19:45:10.2732802Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:45:10.2733138Z #define __cpp_unicode_characters 200704L 2025-05-07T19:45:10.2733482Z #define __cpp_unicode_literals 200710L 2025-05-07T19:45:10.2733799Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:45:10.2734145Z #define __cpp_variable_templates 201304L 2025-05-07T19:45:10.2734486Z #define __cpp_variadic_templates 200704L 2025-05-07T19:45:10.2734808Z #define __cpp_variadic_using 201611L 2025-05-07T19:45:10.2735123Z #define __gnu_linux__ 1 2025-05-07T19:45:10.2735365Z #define __k8 1 2025-05-07T19:45:10.2735628Z #define __k8__ 1 2025-05-07T19:45:10.2735844Z #define __linux 1 2025-05-07T19:45:10.2736106Z #define __linux__ 1 2025-05-07T19:45:10.2736329Z #define __llvm__ 1 2025-05-07T19:45:10.2736588Z #define __pic__ 2 2025-05-07T19:45:10.2736817Z #define __pie__ 2 2025-05-07T19:45:10.2737095Z #define __private_extern__ extern 2025-05-07T19:45:10.2737426Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:45:10.2737841Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:45:10.2738209Z #define __tune_k8__ 1 2025-05-07T19:45:10.2738446Z #define __unix 1 2025-05-07T19:45:10.2738703Z #define __unix__ 1 2025-05-07T19:45:10.2739009Z #define __x86_64 1 2025-05-07T19:45:10.2739264Z #define __x86_64__ 1 2025-05-07T19:45:10.2739498Z #define linux 1 2025-05-07T19:45:10.2739750Z #define unix 1 2025-05-07T19:45:10.2739884Z 2025-05-07T19:45:10.3199849Z 2025-05-07T19:45:10.3201049Z + conda run -n build_binary c++ --version 2025-05-07T19:45:10.3201758Z 2025-05-07T19:45:11.9121338Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:45:11.9123169Z Target: x86_64-conda-linux-gnu 2025-05-07T19:45:11.9123961Z Thread model: posix 2025-05-07T19:45:11.9124863Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:45:11.9126737Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:45:11.9128089Z 2025-05-07T19:45:11.9700927Z 2025-05-07T19:45:11.9701474Z [INFO] Printing the default version of the C standard used by the compiler ... 2025-05-07T19:45:11.9702130Z + conda run -n build_binary cc -dM -E - < /dev/null | grep __STDC_VERSION__ 2025-05-07T19:45:11.9702488Z 2025-05-07T19:45:13.6281427Z #define __STDC_VERSION__ 201710L 2025-05-07T19:45:13.6281857Z 2025-05-07T19:45:13.6282117Z [INFO] Printing the default version of the C++ standard used by the compiler ... 2025-05-07T19:45:13.6282705Z + conda run -n build_binary c++ -dM -E -x c++ - < /dev/null | grep __cplusplus 2025-05-07T19:45:13.6283055Z 2025-05-07T19:45:15.2852849Z #define __cplusplus 201703L 2025-05-07T19:45:15.2853259Z 2025-05-07T19:45:15.2853409Z [INSTALL] Successfully installed C/C++ compilers 2025-05-07T19:45:15.2922120Z ##[group]Run . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:45:15.2922600Z . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:45:15.2923476Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:15.2923857Z env: 2025-05-07T19:45:15.2924125Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:15.2924501Z BUILD_ENV: build_binary 2025-05-07T19:45:15.2924769Z BUILD_TARGET: default 2025-05-07T19:45:15.2925066Z BUILD_VARIANT: cuda 2025-05-07T19:45:15.2925322Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:45:15.2925603Z ##[endgroup] 2025-05-07T19:45:15.7230297Z ################################################################################ 2025-05-07T19:45:15.7230692Z # Install Build Tools 2025-05-07T19:45:15.7230922Z # 2025-05-07T19:45:15.7245033Z # [2025-05-07T19:45:15.724Z] + install_build_tools build_binary 2025-05-07T19:45:15.7245473Z ################################################################################ 2025-05-07T19:45:15.7245848Z 2025-05-07T19:45:15.7262095Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:15.8103344Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:15.8114774Z [INSTALL] Installing build tools ... 2025-05-07T19:45:15.8139221Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y auditwheel bazel cmake>=3.30 hypothesis jinja2 make ncurses ninja openblas patchelf rhash scikit-build wheel pyyaml 2025-05-07T19:45:16.5305354Z Channels: 2025-05-07T19:45:16.5305629Z - conda-forge 2025-05-07T19:45:16.5305884Z Platform: linux-64 2025-05-07T19:45:19.5783851Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:45:23.3395285Z Solving environment: \ | / - done 2025-05-07T19:45:23.3982934Z 2025-05-07T19:45:23.3983400Z ## Package Plan ## 2025-05-07T19:45:23.3984009Z 2025-05-07T19:45:23.3984599Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:23.3985503Z 2025-05-07T19:45:23.3985840Z added / updated specs: 2025-05-07T19:45:23.3986538Z - auditwheel 2025-05-07T19:45:23.3987148Z - bazel 2025-05-07T19:45:23.3987732Z - cmake[version='>=3.30'] 2025-05-07T19:45:23.3988486Z - hypothesis 2025-05-07T19:45:23.3989089Z - jinja2 2025-05-07T19:45:23.3989688Z - make 2025-05-07T19:45:23.3990208Z - ncurses 2025-05-07T19:45:23.3990773Z - ninja 2025-05-07T19:45:23.3991304Z - openblas 2025-05-07T19:45:23.3992352Z - patchelf 2025-05-07T19:45:23.3992929Z - pyyaml 2025-05-07T19:45:23.3993462Z - rhash 2025-05-07T19:45:23.3994030Z - scikit-build 2025-05-07T19:45:23.3994617Z - wheel 2025-05-07T19:45:23.3994929Z 2025-05-07T19:45:23.3994943Z 2025-05-07T19:45:23.3995263Z The following packages will be downloaded: 2025-05-07T19:45:23.3995493Z 2025-05-07T19:45:23.3995613Z package | build 2025-05-07T19:45:23.3995976Z ---------------------------|----------------- 2025-05-07T19:45:23.3996391Z alsa-lib-1.2.14 | hb9d3cd8_0 553 KB conda-forge 2025-05-07T19:45:23.3996837Z attrs-25.3.0 | pyh71513ae_0 56 KB conda-forge 2025-05-07T19:45:23.3997310Z auditwheel-6.2.0 | pyha804496_1 40 KB conda-forge 2025-05-07T19:45:23.3997875Z bazel-7.5.0 | h96810dc_2 47.4 MB conda-forge 2025-05-07T19:45:23.3998296Z bzip2-1.0.8 | h4bc722e_7 247 KB conda-forge 2025-05-07T19:45:23.3998710Z c-ares-1.34.5 | hb9d3cd8_0 202 KB conda-forge 2025-05-07T19:45:23.3999245Z cairo-1.18.0 | hbb29018_2 961 KB conda-forge 2025-05-07T19:45:23.3999647Z click-8.1.8 | pyh707e725_0 83 KB conda-forge 2025-05-07T19:45:23.4000028Z cmake-4.0.2 | h74e3db0_0 19.4 MB conda-forge 2025-05-07T19:45:23.4000634Z distro-1.9.0 | pyhd8ed1ab_1 41 KB conda-forge 2025-05-07T19:45:23.4001112Z exceptiongroup-1.2.2 | pyhd8ed1ab_1 20 KB conda-forge 2025-05-07T19:45:23.4001630Z font-ttf-dejavu-sans-mono-2.37| hab24e00_0 388 KB conda-forge 2025-05-07T19:45:23.4002141Z font-ttf-inconsolata-3.000 | h77eed37_0 94 KB conda-forge 2025-05-07T19:45:23.4002665Z font-ttf-source-code-pro-2.038| h77eed37_0 684 KB conda-forge 2025-05-07T19:45:23.4003168Z font-ttf-ubuntu-0.83 | h77eed37_3 1.5 MB conda-forge 2025-05-07T19:45:23.4003844Z fontconfig-2.15.0 | h7e30c49_1 259 KB conda-forge 2025-05-07T19:45:23.4004335Z fonts-conda-ecosystem-1 | 0 4 KB conda-forge 2025-05-07T19:45:23.4004821Z fonts-conda-forge-1 | 0 4 KB conda-forge 2025-05-07T19:45:23.4005286Z freetype-2.13.3 | ha770c72_1 168 KB conda-forge 2025-05-07T19:45:23.4005724Z giflib-5.2.2 | hd590300_0 75 KB conda-forge 2025-05-07T19:45:23.4006149Z graphite2-1.3.13 | h59595ed_1003 95 KB conda-forge 2025-05-07T19:45:23.4006600Z harfbuzz-9.0.0 | hfac3d4d_0 1.5 MB conda-forge 2025-05-07T19:45:23.4007046Z hypothesis-6.131.14 | pyha770c72_0 348 KB conda-forge 2025-05-07T19:45:23.4007492Z ijar-7.5.0 | h5888daf_0 114 KB conda-forge 2025-05-07T19:45:23.4007904Z jinja2-3.1.6 | pyhd8ed1ab_0 110 KB conda-forge 2025-05-07T19:45:23.4008343Z keyutils-1.6.1 | h166bdaf_0 115 KB conda-forge 2025-05-07T19:45:23.4008773Z krb5-1.21.3 | h659f571_0 1.3 MB conda-forge 2025-05-07T19:45:23.4009292Z lcms2-2.17 | h717163a_0 242 KB conda-forge 2025-05-07T19:45:23.4009753Z lerc-4.0.0 | h0aef613_1 258 KB conda-forge 2025-05-07T19:45:23.4010204Z libabseil-20250127.1 | cxx17_hbbce691_0 1.3 MB conda-forge 2025-05-07T19:45:23.4010640Z libcups-2.3.3 | h4637d8d_4 4.3 MB conda-forge 2025-05-07T19:45:23.4011058Z libcurl-8.13.0 | h332b0f4_0 428 KB conda-forge 2025-05-07T19:45:23.4011488Z libdeflate-1.23 | h86f0d12_0 71 KB conda-forge 2025-05-07T19:45:23.4011999Z libedit-3.1.20250104 | pl5321h7949ede_0 132 KB conda-forge 2025-05-07T19:45:23.4012433Z libev-4.33 | hd590300_2 110 KB conda-forge 2025-05-07T19:45:23.4012826Z libexpat-2.7.0 | h5888daf_0 73 KB conda-forge 2025-05-07T19:45:23.4013272Z libfreetype-2.13.3 | ha770c72_1 8 KB conda-forge 2025-05-07T19:45:23.4013721Z libfreetype6-2.13.3 | h48d6fc4_1 371 KB conda-forge 2025-05-07T19:45:23.4014190Z libgfortran-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:45:23.4014649Z libgfortran5-15.1.0 | hcea5267_2 1.5 MB conda-forge 2025-05-07T19:45:23.4015073Z libglib-2.84.0 | h2ff4ddf_0 3.8 MB conda-forge 2025-05-07T19:45:23.4015492Z libgrpc-1.71.0 | h8e591d7_1 7.6 MB conda-forge 2025-05-07T19:45:23.4015914Z libjpeg-turbo-3.1.0 | hb9d3cd8_0 614 KB conda-forge 2025-05-07T19:45:23.4016356Z liblzma-5.8.1 | hb9d3cd8_1 110 KB conda-forge 2025-05-07T19:45:23.4016798Z liblzma-devel-5.8.1 | hb9d3cd8_1 431 KB conda-forge 2025-05-07T19:45:23.4017231Z libnghttp2-1.64.0 | h161d5f1_0 632 KB conda-forge 2025-05-07T19:45:23.4017656Z libnsl-2.0.1 | hd590300_0 33 KB conda-forge 2025-05-07T19:45:23.4018089Z libopenblas-0.3.29 |pthreads_h94d23a6_0 5.6 MB conda-forge 2025-05-07T19:45:23.4018651Z libpng-1.6.47 | h943b412_0 282 KB conda-forge 2025-05-07T19:45:23.4019067Z libprotobuf-5.29.3 | h501fc15_1 3.2 MB conda-forge 2025-05-07T19:45:23.4019517Z libre2-11-2024.07.02 | hba17884_3 205 KB conda-forge 2025-05-07T19:45:23.4019958Z libsqlite-3.49.2 | hee588c1_0 895 KB conda-forge 2025-05-07T19:45:23.4020481Z libssh2-1.11.1 | hcf80075_0 298 KB conda-forge 2025-05-07T19:45:23.4021114Z libtiff-4.7.0 | hd9ff511_4 419 KB conda-forge 2025-05-07T19:45:23.4021606Z libuuid-2.38.1 | h0b41bf4_0 33 KB conda-forge 2025-05-07T19:45:23.4022050Z libuv-1.50.0 | hb9d3cd8_0 870 KB conda-forge 2025-05-07T19:45:23.4022510Z libwebp-base-1.5.0 | h851e524_0 420 KB conda-forge 2025-05-07T19:45:23.4022958Z libxcb-1.17.0 | h8a09558_0 387 KB conda-forge 2025-05-07T19:45:23.4023402Z libzlib-1.3.1 | hb9d3cd8_2 60 KB conda-forge 2025-05-07T19:45:23.4023819Z make-4.4.1 | hb9d3cd8_2 501 KB conda-forge 2025-05-07T19:45:23.4024274Z markupsafe-3.0.2 | py39h9399b63_1 22 KB conda-forge 2025-05-07T19:45:23.4024717Z ncurses-6.5 | h2d0b736_3 871 KB conda-forge 2025-05-07T19:45:23.4025155Z ninja-1.12.1 | hff21bea_1 158 KB conda-forge 2025-05-07T19:45:23.4025620Z openblas-0.3.29 |pthreads_h6ec200e_0 5.8 MB conda-forge 2025-05-07T19:45:23.4026077Z openjdk-23.0.1 | h4c11d01_0 181.3 MB conda-forge 2025-05-07T19:45:23.4026534Z packaging-25.0 | pyh29332c3_1 61 KB conda-forge 2025-05-07T19:45:23.4027086Z patchelf-0.18.0 | h3f2d84a_2 133 KB conda-forge 2025-05-07T19:45:23.4027498Z pcre2-10.44 | hc749103_2 934 KB conda-forge 2025-05-07T19:45:23.4027886Z pixman-0.46.0 | h29eaf8c_0 389 KB conda-forge 2025-05-07T19:45:23.4028324Z pthread-stubs-0.4 | hb9d3cd8_1002 8 KB conda-forge 2025-05-07T19:45:23.4028774Z pyelftools-0.32 | pyh707e725_1 146 KB conda-forge 2025-05-07T19:45:23.4029200Z python-3.9.22 |h85ef794_1_cpython 22.5 MB conda-forge 2025-05-07T19:45:23.4029698Z pyyaml-6.0.2 | py39h9399b63_2 178 KB conda-forge 2025-05-07T19:45:23.4030094Z re2-2024.07.02 | h9925aae_3 26 KB conda-forge 2025-05-07T19:45:23.4030499Z rhash-1.4.5 | hb9d3cd8_0 183 KB conda-forge 2025-05-07T19:45:23.4030939Z scikit-build-0.18.1 | pyhae55e72_2 114 KB conda-forge 2025-05-07T19:45:23.4031378Z singlejar-7.5.0 | h0e684df_1 122 KB conda-forge 2025-05-07T19:45:23.4031847Z sortedcontainers-2.4.0 | pyhd8ed1ab_1 28 KB conda-forge 2025-05-07T19:45:23.4032284Z sqlite-3.49.2 | h9eae976_0 840 KB conda-forge 2025-05-07T19:45:23.4032695Z tk-8.6.13 |noxft_h4845f30_101 3.2 MB conda-forge 2025-05-07T19:45:23.4033085Z tomli-2.2.1 | pyhd8ed1ab_1 19 KB conda-forge 2025-05-07T19:45:23.4033504Z wheel-0.45.1 | pyhd8ed1ab_1 61 KB conda-forge 2025-05-07T19:45:23.4033929Z xorg-libice-1.1.2 | hb9d3cd8_0 57 KB conda-forge 2025-05-07T19:45:23.4034349Z xorg-libsm-1.2.6 | he73a12e_0 27 KB conda-forge 2025-05-07T19:45:23.4034786Z xorg-libx11-1.8.12 | h4f16b4b_0 816 KB conda-forge 2025-05-07T19:45:23.4035229Z xorg-libxau-1.0.12 | hb9d3cd8_0 14 KB conda-forge 2025-05-07T19:45:23.4035778Z xorg-libxdmcp-1.1.5 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:23.4036233Z xorg-libxext-1.3.6 | hb9d3cd8_0 49 KB conda-forge 2025-05-07T19:45:23.4036715Z xorg-libxfixes-6.0.1 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:23.4037186Z xorg-libxi-1.8.2 | hb9d3cd8_0 46 KB conda-forge 2025-05-07T19:45:23.4037634Z xorg-libxrandr-1.5.4 | hb9d3cd8_0 29 KB conda-forge 2025-05-07T19:45:23.4038134Z xorg-libxrender-0.9.12 | hb9d3cd8_0 32 KB conda-forge 2025-05-07T19:45:23.4038595Z xorg-libxt-1.3.1 | hb9d3cd8_0 371 KB conda-forge 2025-05-07T19:45:23.4039056Z xorg-libxtst-1.2.5 | hb9d3cd8_3 32 KB conda-forge 2025-05-07T19:45:23.4039479Z xz-5.8.1 | hbcc6ac9_1 23 KB conda-forge 2025-05-07T19:45:23.4039886Z xz-gpl-tools-5.8.1 | hbcc6ac9_1 33 KB conda-forge 2025-05-07T19:45:23.4040349Z xz-tools-5.8.1 | hb9d3cd8_1 94 KB conda-forge 2025-05-07T19:45:23.4040739Z yaml-0.2.5 | h7f98852_2 87 KB conda-forge 2025-05-07T19:45:23.4041133Z zlib-1.3.1 | hb9d3cd8_2 90 KB conda-forge 2025-05-07T19:45:23.4041509Z zstd-1.5.7 | hb8e6e7a_2 554 KB conda-forge 2025-05-07T19:45:23.4041899Z ------------------------------------------------------------ 2025-05-07T19:45:23.4042252Z Total: 330.1 MB 2025-05-07T19:45:23.4042465Z 2025-05-07T19:45:23.4042596Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:23.4042823Z 2025-05-07T19:45:23.4043052Z alsa-lib conda-forge/linux-64::alsa-lib-1.2.14-hb9d3cd8_0 2025-05-07T19:45:23.4043488Z attrs conda-forge/noarch::attrs-25.3.0-pyh71513ae_0 2025-05-07T19:45:23.4043972Z auditwheel conda-forge/noarch::auditwheel-6.2.0-pyha804496_1 2025-05-07T19:45:23.4044441Z bazel conda-forge/linux-64::bazel-7.5.0-h96810dc_2 2025-05-07T19:45:23.4044848Z bzip2 conda-forge/linux-64::bzip2-1.0.8-h4bc722e_7 2025-05-07T19:45:23.4045285Z c-ares conda-forge/linux-64::c-ares-1.34.5-hb9d3cd8_0 2025-05-07T19:45:23.4045685Z cairo conda-forge/linux-64::cairo-1.18.0-hbb29018_2 2025-05-07T19:45:23.4046179Z click conda-forge/noarch::click-8.1.8-pyh707e725_0 2025-05-07T19:45:23.4046592Z cmake conda-forge/linux-64::cmake-4.0.2-h74e3db0_0 2025-05-07T19:45:23.4046994Z distro conda-forge/noarch::distro-1.9.0-pyhd8ed1ab_1 2025-05-07T19:45:23.4047488Z exceptiongroup conda-forge/noarch::exceptiongroup-1.2.2-pyhd8ed1ab_1 2025-05-07T19:45:23.4048063Z font-ttf-dejavu-s~ conda-forge/noarch::font-ttf-dejavu-sans-mono-2.37-hab24e00_0 2025-05-07T19:45:23.4048682Z font-ttf-inconsol~ conda-forge/noarch::font-ttf-inconsolata-3.000-h77eed37_0 2025-05-07T19:45:23.4049293Z font-ttf-source-c~ conda-forge/noarch::font-ttf-source-code-pro-2.038-h77eed37_0 2025-05-07T19:45:23.4049851Z font-ttf-ubuntu conda-forge/noarch::font-ttf-ubuntu-0.83-h77eed37_3 2025-05-07T19:45:23.4050360Z fontconfig conda-forge/linux-64::fontconfig-2.15.0-h7e30c49_1 2025-05-07T19:45:23.4050841Z fonts-conda-ecosy~ conda-forge/noarch::fonts-conda-ecosystem-1-0 2025-05-07T19:45:23.4051338Z fonts-conda-forge conda-forge/noarch::fonts-conda-forge-1-0 2025-05-07T19:45:23.4051788Z freetype conda-forge/linux-64::freetype-2.13.3-ha770c72_1 2025-05-07T19:45:23.4052253Z giflib conda-forge/linux-64::giflib-5.2.2-hd590300_0 2025-05-07T19:45:23.4052722Z graphite2 conda-forge/linux-64::graphite2-1.3.13-h59595ed_1003 2025-05-07T19:45:23.4053195Z harfbuzz conda-forge/linux-64::harfbuzz-9.0.0-hfac3d4d_0 2025-05-07T19:45:23.4053700Z hypothesis conda-forge/noarch::hypothesis-6.131.14-pyha770c72_0 2025-05-07T19:45:23.4054249Z ijar conda-forge/linux-64::ijar-7.5.0-h5888daf_0 2025-05-07T19:45:23.4054700Z jinja2 conda-forge/noarch::jinja2-3.1.6-pyhd8ed1ab_0 2025-05-07T19:45:23.4055186Z keyutils conda-forge/linux-64::keyutils-1.6.1-h166bdaf_0 2025-05-07T19:45:23.4055616Z krb5 conda-forge/linux-64::krb5-1.21.3-h659f571_0 2025-05-07T19:45:23.4056052Z lcms2 conda-forge/linux-64::lcms2-2.17-h717163a_0 2025-05-07T19:45:23.4056466Z lerc conda-forge/linux-64::lerc-4.0.0-h0aef613_1 2025-05-07T19:45:23.4056968Z libabseil conda-forge/linux-64::libabseil-20250127.1-cxx17_hbbce691_0 2025-05-07T19:45:23.4057497Z libcups conda-forge/linux-64::libcups-2.3.3-h4637d8d_4 2025-05-07T19:45:23.4057935Z libcurl conda-forge/linux-64::libcurl-8.13.0-h332b0f4_0 2025-05-07T19:45:23.4058410Z libdeflate conda-forge/linux-64::libdeflate-1.23-h86f0d12_0 2025-05-07T19:45:23.4058907Z libedit conda-forge/linux-64::libedit-3.1.20250104-pl5321h7949ede_0 2025-05-07T19:45:23.4059389Z libev conda-forge/linux-64::libev-4.33-hd590300_2 2025-05-07T19:45:23.4059845Z libexpat conda-forge/linux-64::libexpat-2.7.0-h5888daf_0 2025-05-07T19:45:23.4060430Z libfreetype conda-forge/linux-64::libfreetype-2.13.3-ha770c72_1 2025-05-07T19:45:23.4061186Z libfreetype6 conda-forge/linux-64::libfreetype6-2.13.3-h48d6fc4_1 2025-05-07T19:45:23.4061737Z libgfortran conda-forge/linux-64::libgfortran-15.1.0-h69a702a_2 2025-05-07T19:45:23.4062309Z libgfortran5 conda-forge/linux-64::libgfortran5-15.1.0-hcea5267_2 2025-05-07T19:45:23.4062852Z libglib conda-forge/linux-64::libglib-2.84.0-h2ff4ddf_0 2025-05-07T19:45:23.4063328Z libgrpc conda-forge/linux-64::libgrpc-1.71.0-h8e591d7_1 2025-05-07T19:45:23.4063863Z libjpeg-turbo conda-forge/linux-64::libjpeg-turbo-3.1.0-hb9d3cd8_0 2025-05-07T19:45:23.4064379Z liblzma conda-forge/linux-64::liblzma-5.8.1-hb9d3cd8_1 2025-05-07T19:45:23.4064914Z liblzma-devel conda-forge/linux-64::liblzma-devel-5.8.1-hb9d3cd8_1 2025-05-07T19:45:23.4065473Z libnghttp2 conda-forge/linux-64::libnghttp2-1.64.0-h161d5f1_0 2025-05-07T19:45:23.4065957Z libnsl conda-forge/linux-64::libnsl-2.0.1-hd590300_0 2025-05-07T19:45:23.4066509Z libopenblas conda-forge/linux-64::libopenblas-0.3.29-pthreads_h94d23a6_0 2025-05-07T19:45:23.4068076Z libpng conda-forge/linux-64::libpng-1.6.47-h943b412_0 2025-05-07T19:45:23.4068563Z libprotobuf conda-forge/linux-64::libprotobuf-5.29.3-h501fc15_1 2025-05-07T19:45:23.4069072Z libre2-11 conda-forge/linux-64::libre2-11-2024.07.02-hba17884_3 2025-05-07T19:45:23.4069544Z libsqlite conda-forge/linux-64::libsqlite-3.49.2-hee588c1_0 2025-05-07T19:45:23.4070020Z libssh2 conda-forge/linux-64::libssh2-1.11.1-hcf80075_0 2025-05-07T19:45:23.4070461Z libtiff conda-forge/linux-64::libtiff-4.7.0-hd9ff511_4 2025-05-07T19:45:23.4070919Z libuuid conda-forge/linux-64::libuuid-2.38.1-h0b41bf4_0 2025-05-07T19:45:23.4071342Z libuv conda-forge/linux-64::libuv-1.50.0-hb9d3cd8_0 2025-05-07T19:45:23.4071824Z libwebp-base conda-forge/linux-64::libwebp-base-1.5.0-h851e524_0 2025-05-07T19:45:23.4072318Z libxcb conda-forge/linux-64::libxcb-1.17.0-h8a09558_0 2025-05-07T19:45:23.4072733Z make conda-forge/linux-64::make-4.4.1-hb9d3cd8_2 2025-05-07T19:45:23.4073210Z markupsafe conda-forge/linux-64::markupsafe-3.0.2-py39h9399b63_1 2025-05-07T19:45:23.4073801Z ninja conda-forge/linux-64::ninja-1.12.1-hff21bea_1 2025-05-07T19:45:23.4074319Z openblas conda-forge/linux-64::openblas-0.3.29-pthreads_h6ec200e_0 2025-05-07T19:45:23.4074885Z openjdk conda-forge/linux-64::openjdk-23.0.1-h4c11d01_0 2025-05-07T19:45:23.4075436Z packaging conda-forge/noarch::packaging-25.0-pyh29332c3_1 2025-05-07T19:45:23.4076310Z patchelf conda-forge/linux-64::patchelf-0.18.0-h3f2d84a_2 2025-05-07T19:45:23.4076962Z pcre2 conda-forge/linux-64::pcre2-10.44-hc749103_2 2025-05-07T19:45:23.4077494Z pixman conda-forge/linux-64::pixman-0.46.0-h29eaf8c_0 2025-05-07T19:45:23.4078108Z pthread-stubs conda-forge/linux-64::pthread-stubs-0.4-hb9d3cd8_1002 2025-05-07T19:45:23.4078724Z pyelftools conda-forge/noarch::pyelftools-0.32-pyh707e725_1 2025-05-07T19:45:23.4079251Z pyyaml conda-forge/linux-64::pyyaml-6.0.2-py39h9399b63_2 2025-05-07T19:45:23.4079703Z re2 conda-forge/linux-64::re2-2024.07.02-h9925aae_3 2025-05-07T19:45:23.4080165Z rhash conda-forge/linux-64::rhash-1.4.5-hb9d3cd8_0 2025-05-07T19:45:23.4080693Z scikit-build conda-forge/noarch::scikit-build-0.18.1-pyhae55e72_2 2025-05-07T19:45:23.4081222Z singlejar conda-forge/linux-64::singlejar-7.5.0-h0e684df_1 2025-05-07T19:45:23.4081809Z sortedcontainers conda-forge/noarch::sortedcontainers-2.4.0-pyhd8ed1ab_1 2025-05-07T19:45:23.4082348Z tomli conda-forge/noarch::tomli-2.2.1-pyhd8ed1ab_1 2025-05-07T19:45:23.4082860Z xorg-libice conda-forge/linux-64::xorg-libice-1.1.2-hb9d3cd8_0 2025-05-07T19:45:23.4083400Z xorg-libsm conda-forge/linux-64::xorg-libsm-1.2.6-he73a12e_0 2025-05-07T19:45:23.4083959Z xorg-libx11 conda-forge/linux-64::xorg-libx11-1.8.12-h4f16b4b_0 2025-05-07T19:45:23.4084562Z xorg-libxau conda-forge/linux-64::xorg-libxau-1.0.12-hb9d3cd8_0 2025-05-07T19:45:23.4085105Z xorg-libxdmcp conda-forge/linux-64::xorg-libxdmcp-1.1.5-hb9d3cd8_0 2025-05-07T19:45:23.4085675Z xorg-libxext conda-forge/linux-64::xorg-libxext-1.3.6-hb9d3cd8_0 2025-05-07T19:45:23.4086251Z xorg-libxfixes conda-forge/linux-64::xorg-libxfixes-6.0.1-hb9d3cd8_0 2025-05-07T19:45:23.4086790Z xorg-libxi conda-forge/linux-64::xorg-libxi-1.8.2-hb9d3cd8_0 2025-05-07T19:45:23.4087348Z xorg-libxrandr conda-forge/linux-64::xorg-libxrandr-1.5.4-hb9d3cd8_0 2025-05-07T19:45:23.4087932Z xorg-libxrender conda-forge/linux-64::xorg-libxrender-0.9.12-hb9d3cd8_0 2025-05-07T19:45:23.4088505Z xorg-libxt conda-forge/linux-64::xorg-libxt-1.3.1-hb9d3cd8_0 2025-05-07T19:45:23.4089067Z xorg-libxtst conda-forge/linux-64::xorg-libxtst-1.2.5-hb9d3cd8_3 2025-05-07T19:45:23.4089756Z xz-gpl-tools conda-forge/linux-64::xz-gpl-tools-5.8.1-hbcc6ac9_1 2025-05-07T19:45:23.4090295Z xz-tools conda-forge/linux-64::xz-tools-5.8.1-hb9d3cd8_1 2025-05-07T19:45:23.4090755Z yaml conda-forge/linux-64::yaml-0.2.5-h7f98852_2 2025-05-07T19:45:23.4091056Z 2025-05-07T19:45:23.4091191Z The following packages will be UPDATED: 2025-05-07T19:45:23.4091418Z 2025-05-07T19:45:23.4091623Z libzlib 1.2.13-h4ab18f5_6 --> 1.3.1-hb9d3cd8_2 2025-05-07T19:45:23.4092194Z ncurses pkgs/main::ncurses-6.4-h6a678d5_0 --> conda-forge::ncurses-6.5-h2d0b736_3 2025-05-07T19:45:23.4092922Z python pkgs/main::python-3.9.21-he870216_1 --> conda-forge::python-3.9.22-h85ef794_1_cpython 2025-05-07T19:45:23.4093639Z sqlite pkgs/main::sqlite-3.45.3-h5eee18b_0 --> conda-forge::sqlite-3.49.2-h9eae976_0 2025-05-07T19:45:23.4094336Z wheel pkgs/main/linux-64::wheel-0.45.1-py39~ --> conda-forge/noarch::wheel-0.45.1-pyhd8ed1ab_1 2025-05-07T19:45:23.4095020Z xz pkgs/main::xz-5.6.4-h5eee18b_1 --> conda-forge::xz-5.8.1-hbcc6ac9_1 2025-05-07T19:45:23.4095508Z zlib 1.2.13-h4ab18f5_6 --> 1.3.1-hb9d3cd8_2 2025-05-07T19:45:23.4096077Z zstd 1.5.6-ha6fb4c9_0 --> 1.5.7-hb8e6e7a_2 2025-05-07T19:45:23.4096459Z 2025-05-07T19:45:23.4096714Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:45:23.4097130Z 2025-05-07T19:45:23.4097475Z tk pkgs/main::tk-8.6.14-h39e8969_0 --> conda-forge::tk-8.6.13-noxft_h4845f30_101 2025-05-07T19:45:23.4098072Z 2025-05-07T19:45:23.4098109Z 2025-05-07T19:45:23.4098114Z 2025-05-07T19:45:23.4098267Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:23.4098697Z openjdk-23.0.1 | 181.3 MB | | 0% 2025-05-07T19:45:23.4098947Z 2025-05-07T19:45:23.4099267Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:23.4099617Z 2025-05-07T19:45:23.4099621Z 2025-05-07T19:45:23.4100046Z python-3.9.22 | 22.5 MB | | 0%  2025-05-07T19:45:23.4100576Z 2025-05-07T19:45:23.4100580Z 2025-05-07T19:45:23.4100584Z 2025-05-07T19:45:23.4103145Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:23.4103446Z 2025-05-07T19:45:23.4103450Z 2025-05-07T19:45:23.4103454Z 2025-05-07T19:45:23.4103458Z 2025-05-07T19:45:23.4135981Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:23.4136901Z 2025-05-07T19:45:23.4136915Z 2025-05-07T19:45:23.4136927Z 2025-05-07T19:45:23.4136937Z 2025-05-07T19:45:23.4136949Z 2025-05-07T19:45:23.4137682Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:23.4138534Z 2025-05-07T19:45:23.4138545Z 2025-05-07T19:45:23.4138555Z 2025-05-07T19:45:23.4138565Z 2025-05-07T19:45:23.4138576Z 2025-05-07T19:45:23.4138586Z 2025-05-07T19:45:23.4139331Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:23.4140482Z 2025-05-07T19:45:23.4140493Z 2025-05-07T19:45:23.4140503Z 2025-05-07T19:45:23.4140513Z 2025-05-07T19:45:23.4140524Z 2025-05-07T19:45:23.4140534Z 2025-05-07T19:45:23.4140544Z 2025-05-07T19:45:23.4141297Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:23.4142136Z 2025-05-07T19:45:23.4142146Z 2025-05-07T19:45:23.4142157Z 2025-05-07T19:45:23.4142167Z 2025-05-07T19:45:23.4142177Z 2025-05-07T19:45:23.4142188Z 2025-05-07T19:45:23.4142213Z 2025-05-07T19:45:23.4142223Z 2025-05-07T19:45:23.4142872Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:23.4143189Z 2025-05-07T19:45:23.4143192Z 2025-05-07T19:45:23.4143196Z 2025-05-07T19:45:23.4143199Z 2025-05-07T19:45:23.4143203Z 2025-05-07T19:45:23.4143206Z 2025-05-07T19:45:23.4143209Z 2025-05-07T19:45:23.4143214Z 2025-05-07T19:45:23.4143217Z 2025-05-07T19:45:23.4143688Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:23.4143997Z 2025-05-07T19:45:23.4144023Z 2025-05-07T19:45:23.4144027Z 2025-05-07T19:45:23.4144030Z 2025-05-07T19:45:23.4144034Z 2025-05-07T19:45:23.4144038Z 2025-05-07T19:45:23.4144042Z 2025-05-07T19:45:23.4144045Z 2025-05-07T19:45:23.4144048Z 2025-05-07T19:45:23.4144052Z 2025-05-07T19:45:23.4144318Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:23.4144618Z 2025-05-07T19:45:23.4144622Z 2025-05-07T19:45:23.4144630Z 2025-05-07T19:45:23.4144633Z 2025-05-07T19:45:23.4144637Z 2025-05-07T19:45:23.4144641Z 2025-05-07T19:45:23.4144644Z 2025-05-07T19:45:23.4144647Z 2025-05-07T19:45:23.4144651Z 2025-05-07T19:45:23.4144654Z 2025-05-07T19:45:23.4144658Z 2025-05-07T19:45:23.4144949Z font-ttf-ubuntu-0.83 | 1.5 MB | | 0%  2025-05-07T19:45:23.4145305Z 2025-05-07T19:45:23.4145309Z 2025-05-07T19:45:23.4145316Z 2025-05-07T19:45:23.4145319Z 2025-05-07T19:45:23.4145323Z 2025-05-07T19:45:23.4145326Z 2025-05-07T19:45:23.4145329Z 2025-05-07T19:45:23.4145333Z 2025-05-07T19:45:23.4145336Z 2025-05-07T19:45:23.4145339Z 2025-05-07T19:45:23.4145343Z 2025-05-07T19:45:23.4145346Z 2025-05-07T19:45:23.4145619Z harfbuzz-9.0.0 | 1.5 MB | | 0%  2025-05-07T19:45:23.4145947Z 2025-05-07T19:45:23.4145950Z 2025-05-07T19:45:23.4145954Z 2025-05-07T19:45:23.4145957Z 2025-05-07T19:45:23.4145960Z 2025-05-07T19:45:23.4146038Z 2025-05-07T19:45:23.4146043Z 2025-05-07T19:45:23.4146046Z 2025-05-07T19:45:23.4146050Z 2025-05-07T19:45:23.4146053Z 2025-05-07T19:45:23.4146057Z 2025-05-07T19:45:23.4146060Z 2025-05-07T19:45:23.4146063Z 2025-05-07T19:45:23.4146381Z libgfortran5-15.1.0 | 1.5 MB | | 0%  2025-05-07T19:45:23.4146704Z 2025-05-07T19:45:23.4146707Z 2025-05-07T19:45:23.4146711Z 2025-05-07T19:45:23.4146718Z 2025-05-07T19:45:23.4146721Z 2025-05-07T19:45:23.4146725Z 2025-05-07T19:45:23.4146728Z 2025-05-07T19:45:23.4146731Z 2025-05-07T19:45:23.4146735Z 2025-05-07T19:45:23.4146738Z 2025-05-07T19:45:23.4146742Z 2025-05-07T19:45:23.4146745Z 2025-05-07T19:45:23.4146749Z 2025-05-07T19:45:23.4146803Z 2025-05-07T19:45:23.4147072Z krb5-1.21.3 | 1.3 MB | | 0%  2025-05-07T19:45:23.4147363Z 2025-05-07T19:45:23.4147367Z 2025-05-07T19:45:23.4147371Z 2025-05-07T19:45:23.4147374Z 2025-05-07T19:45:23.4147381Z 2025-05-07T19:45:23.4147385Z 2025-05-07T19:45:23.4147388Z 2025-05-07T19:45:23.4147415Z 2025-05-07T19:45:23.4147418Z 2025-05-07T19:45:23.4147421Z 2025-05-07T19:45:23.4147425Z 2025-05-07T19:45:23.4147428Z 2025-05-07T19:45:23.4147432Z 2025-05-07T19:45:23.4147435Z 2025-05-07T19:45:23.4147439Z 2025-05-07T19:45:23.4147759Z libabseil-20250127.1 | 1.3 MB | | 0%  2025-05-07T19:45:23.4148104Z 2025-05-07T19:45:23.4148132Z 2025-05-07T19:45:23.4148136Z 2025-05-07T19:45:23.4148139Z 2025-05-07T19:45:23.4148143Z 2025-05-07T19:45:23.4148146Z 2025-05-07T19:45:23.4148149Z 2025-05-07T19:45:23.4148153Z 2025-05-07T19:45:23.4148156Z 2025-05-07T19:45:23.4148160Z 2025-05-07T19:45:23.4148163Z 2025-05-07T19:45:23.4148166Z 2025-05-07T19:45:23.4148170Z 2025-05-07T19:45:23.4148174Z 2025-05-07T19:45:23.4148177Z 2025-05-07T19:45:23.4148180Z 2025-05-07T19:45:23.4148528Z cairo-1.18.0 | 961 KB | | 0%  2025-05-07T19:45:23.4148835Z 2025-05-07T19:45:23.4148839Z 2025-05-07T19:45:23.4148842Z 2025-05-07T19:45:23.4148846Z 2025-05-07T19:45:23.4148850Z 2025-05-07T19:45:23.4148853Z 2025-05-07T19:45:23.4148857Z 2025-05-07T19:45:23.4148860Z 2025-05-07T19:45:23.4148864Z 2025-05-07T19:45:23.4148867Z 2025-05-07T19:45:23.4148870Z 2025-05-07T19:45:23.4148874Z 2025-05-07T19:45:23.4148877Z 2025-05-07T19:45:23.4148880Z 2025-05-07T19:45:23.4148941Z 2025-05-07T19:45:23.4148970Z 2025-05-07T19:45:23.4148973Z 2025-05-07T19:45:23.4149266Z pcre2-10.44 | 934 KB | | 0%  2025-05-07T19:45:23.4149571Z 2025-05-07T19:45:23.4149574Z 2025-05-07T19:45:23.4149578Z 2025-05-07T19:45:23.4149581Z 2025-05-07T19:45:23.4149584Z 2025-05-07T19:45:23.4149588Z 2025-05-07T19:45:23.4149591Z 2025-05-07T19:45:23.4149612Z 2025-05-07T19:45:23.4149615Z 2025-05-07T19:45:23.4149619Z 2025-05-07T19:45:23.4149622Z 2025-05-07T19:45:23.4149630Z 2025-05-07T19:45:23.4149634Z 2025-05-07T19:45:23.4149637Z 2025-05-07T19:45:23.4149640Z 2025-05-07T19:45:23.4149644Z 2025-05-07T19:45:23.4149647Z 2025-05-07T19:45:23.4149665Z 2025-05-07T19:45:23.4149976Z libsqlite-3.49.2 | 895 KB | | 0%  2025-05-07T19:45:23.4150319Z 2025-05-07T19:45:23.4150322Z 2025-05-07T19:45:23.4150326Z 2025-05-07T19:45:23.4150329Z 2025-05-07T19:45:23.4150336Z 2025-05-07T19:45:23.4150340Z 2025-05-07T19:45:23.4150343Z 2025-05-07T19:45:23.4150347Z 2025-05-07T19:45:23.4150350Z 2025-05-07T19:45:23.4150354Z 2025-05-07T19:45:23.4150357Z 2025-05-07T19:45:23.4150360Z 2025-05-07T19:45:23.4150364Z 2025-05-07T19:45:23.4150367Z 2025-05-07T19:45:23.4150371Z 2025-05-07T19:45:23.4150374Z 2025-05-07T19:45:23.4150378Z 2025-05-07T19:45:23.4150381Z 2025-05-07T19:45:23.4150384Z 2025-05-07T19:45:23.5507921Z ... (more hidden) ... 2025-05-07T19:45:23.5508502Z 2025-05-07T19:45:23.5508508Z 2025-05-07T19:45:23.5508512Z 2025-05-07T19:45:23.5511402Z 2025-05-07T19:45:23.5622290Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:23.5622594Z 2025-05-07T19:45:23.5622599Z 2025-05-07T19:45:23.5622603Z 2025-05-07T19:45:23.6520154Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:23.6520507Z 2025-05-07T19:45:23.6520511Z 2025-05-07T19:45:23.6520534Z 2025-05-07T19:45:23.6520538Z 2025-05-07T19:45:23.6623054Z libgrpc-1.71.0 | 7.6 MB | | 1%  2025-05-07T19:45:23.6623388Z 2025-05-07T19:45:23.6623392Z 2025-05-07T19:45:23.6623396Z 2025-05-07T19:45:23.6975008Z cmake-4.0.2 | 19.4 MB | 1 | 2%  2025-05-07T19:45:23.7270836Z openjdk-23.0.1 | 181.3 MB | | 0% 2025-05-07T19:45:23.7271133Z 2025-05-07T19:45:23.7349677Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:23.7349959Z 2025-05-07T19:45:23.7349983Z 2025-05-07T19:45:23.7520663Z python-3.9.22 | 22.5 MB | | 0%  2025-05-07T19:45:23.7520950Z 2025-05-07T19:45:23.7520955Z 2025-05-07T19:45:23.7520958Z 2025-05-07T19:45:23.7520962Z 2025-05-07T19:45:23.7622144Z libgrpc-1.71.0 | 7.6 MB | ####5 | 45%  2025-05-07T19:45:23.7622445Z 2025-05-07T19:45:23.7622450Z 2025-05-07T19:45:23.7622455Z 2025-05-07T19:45:23.7975505Z cmake-4.0.2 | 19.4 MB | #### | 40%  2025-05-07T19:45:23.8270707Z openjdk-23.0.1 | 181.3 MB | 4 | 4% 2025-05-07T19:45:23.8270988Z 2025-05-07T19:45:23.8350242Z bazel-7.5.0 | 47.4 MB | 6 | 7%  2025-05-07T19:45:23.8350691Z 2025-05-07T19:45:23.8350697Z 2025-05-07T19:45:23.8622677Z python-3.9.22 | 22.5 MB | 8 | 8%  2025-05-07T19:45:23.8623035Z 2025-05-07T19:45:23.8623040Z 2025-05-07T19:45:23.8623044Z 2025-05-07T19:45:23.8974373Z cmake-4.0.2 | 19.4 MB | ########4 | 85%  2025-05-07T19:45:23.9066915Z openjdk-23.0.1 | 181.3 MB | 8 | 9% 2025-05-07T19:45:23.9067185Z 2025-05-07T19:45:23.9067190Z 2025-05-07T19:45:23.9067194Z 2025-05-07T19:45:23.9067197Z 2025-05-07T19:45:23.9067484Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:23.9067746Z 2025-05-07T19:45:23.9067749Z 2025-05-07T19:45:23.9067753Z 2025-05-07T19:45:23.9067757Z 2025-05-07T19:45:23.9272424Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:23.9273176Z 2025-05-07T19:45:23.9360155Z bazel-7.5.0 | 47.4 MB | #7 | 18%  2025-05-07T19:45:23.9360435Z 2025-05-07T19:45:23.9360440Z 2025-05-07T19:45:23.9521777Z python-3.9.22 | 22.5 MB | ##6 | 26%  2025-05-07T19:45:23.9522055Z 2025-05-07T19:45:23.9522059Z 2025-05-07T19:45:23.9522064Z 2025-05-07T19:45:23.9522069Z 2025-05-07T19:45:23.9522075Z 2025-05-07T19:45:23.9975806Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:24.0278211Z openjdk-23.0.1 | 181.3 MB | #2 | 13% 2025-05-07T19:45:24.0279171Z 2025-05-07T19:45:24.0360720Z bazel-7.5.0 | 47.4 MB | ###1 | 31%  2025-05-07T19:45:24.0360980Z 2025-05-07T19:45:24.0360985Z 2025-05-07T19:45:24.0524306Z python-3.9.22 | 22.5 MB | ####5 | 45%  2025-05-07T19:45:24.0524568Z 2025-05-07T19:45:24.0524572Z 2025-05-07T19:45:24.0524577Z 2025-05-07T19:45:24.0524599Z 2025-05-07T19:45:24.0524616Z 2025-05-07T19:45:24.1035395Z openblas-0.3.29 | 5.8 MB | #########3 | 94%  2025-05-07T19:45:24.1276463Z openjdk-23.0.1 | 181.3 MB | #6 | 16% 2025-05-07T19:45:24.1276918Z 2025-05-07T19:45:24.1364356Z bazel-7.5.0 | 47.4 MB | ####5 | 45%  2025-05-07T19:45:24.1364653Z 2025-05-07T19:45:24.1364658Z 2025-05-07T19:45:24.1496986Z python-3.9.22 | 22.5 MB | ######8 | 69%  2025-05-07T19:45:24.1497285Z 2025-05-07T19:45:24.1497409Z 2025-05-07T19:45:24.1497633Z 2025-05-07T19:45:24.1497639Z 2025-05-07T19:45:24.1497643Z 2025-05-07T19:45:24.1932105Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:24.1932431Z 2025-05-07T19:45:24.1932436Z 2025-05-07T19:45:24.1932439Z 2025-05-07T19:45:24.1932443Z 2025-05-07T19:45:24.1932447Z 2025-05-07T19:45:24.1932451Z 2025-05-07T19:45:24.2037179Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:24.2266029Z openjdk-23.0.1 | 181.3 MB | #9 | 20% 2025-05-07T19:45:24.2266292Z 2025-05-07T19:45:24.2266303Z 2025-05-07T19:45:24.2269252Z 2025-05-07T19:45:24.2280001Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:24.2280486Z 2025-05-07T19:45:24.2363673Z bazel-7.5.0 | 47.4 MB | #####6 | 57%  2025-05-07T19:45:24.2363965Z 2025-05-07T19:45:24.2363970Z 2025-05-07T19:45:24.2935482Z python-3.9.22 | 22.5 MB | #########1 | 91%  2025-05-07T19:45:24.2935779Z 2025-05-07T19:45:24.2935801Z 2025-05-07T19:45:24.2935804Z 2025-05-07T19:45:24.2935809Z 2025-05-07T19:45:24.2935813Z 2025-05-07T19:45:24.2935832Z 2025-05-07T19:45:24.2960774Z libopenblas-0.3.29 | 5.6 MB | #######6 | 76%  2025-05-07T19:45:24.2961115Z 2025-05-07T19:45:24.2961120Z 2025-05-07T19:45:24.2961124Z 2025-05-07T19:45:24.2961127Z 2025-05-07T19:45:24.2961130Z 2025-05-07T19:45:24.2961134Z 2025-05-07T19:45:24.2963530Z 2025-05-07T19:45:24.3472123Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:24.3496051Z openjdk-23.0.1 | 181.3 MB | ##3 | 23% 2025-05-07T19:45:24.3496498Z 2025-05-07T19:45:24.4312321Z bazel-7.5.0 | 47.4 MB | ######8 | 68%  2025-05-07T19:45:24.4312607Z 2025-05-07T19:45:24.4312612Z 2025-05-07T19:45:24.4312616Z 2025-05-07T19:45:24.4312619Z 2025-05-07T19:45:24.4312626Z 2025-05-07T19:45:24.4312630Z 2025-05-07T19:45:24.4472325Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:24.4497926Z openjdk-23.0.1 | 181.3 MB | ##6 | 27% 2025-05-07T19:45:24.4498718Z 2025-05-07T19:45:24.4641659Z bazel-7.5.0 | 47.4 MB | ########2 | 82%  2025-05-07T19:45:24.4642069Z 2025-05-07T19:45:24.4642089Z 2025-05-07T19:45:24.4642094Z 2025-05-07T19:45:24.4642098Z 2025-05-07T19:45:24.4642103Z 2025-05-07T19:45:24.4642109Z 2025-05-07T19:45:24.4642113Z 2025-05-07T19:45:24.4642596Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:24.4643160Z 2025-05-07T19:45:24.4643164Z 2025-05-07T19:45:24.4643167Z 2025-05-07T19:45:24.4643171Z 2025-05-07T19:45:24.4643174Z 2025-05-07T19:45:24.4643178Z 2025-05-07T19:45:24.4643187Z 2025-05-07T19:45:24.4966654Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:24.4966996Z 2025-05-07T19:45:24.4967000Z 2025-05-07T19:45:24.4967005Z 2025-05-07T19:45:24.4967009Z 2025-05-07T19:45:24.4967014Z 2025-05-07T19:45:24.4967018Z 2025-05-07T19:45:24.4967043Z 2025-05-07T19:45:24.4967047Z 2025-05-07T19:45:24.5310342Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:24.5310692Z 2025-05-07T19:45:24.5310697Z 2025-05-07T19:45:24.5310701Z 2025-05-07T19:45:24.5310704Z 2025-05-07T19:45:24.5310708Z 2025-05-07T19:45:24.5310712Z 2025-05-07T19:45:24.5310716Z 2025-05-07T19:45:24.5310720Z 2025-05-07T19:45:24.5310723Z 2025-05-07T19:45:24.5477784Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:24.5766101Z openjdk-23.0.1 | 181.3 MB | ###1 | 31% 2025-05-07T19:45:24.5766540Z 2025-05-07T19:45:24.6458543Z bazel-7.5.0 | 47.4 MB | #########4 | 94%  2025-05-07T19:45:24.6458855Z 2025-05-07T19:45:24.6458859Z 2025-05-07T19:45:24.6458863Z 2025-05-07T19:45:24.6458866Z 2025-05-07T19:45:24.6458870Z 2025-05-07T19:45:24.6458875Z 2025-05-07T19:45:24.6458878Z 2025-05-07T19:45:24.6458882Z 2025-05-07T19:45:24.6459390Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:24.6459718Z 2025-05-07T19:45:24.6459721Z 2025-05-07T19:45:24.6459725Z 2025-05-07T19:45:24.6459728Z 2025-05-07T19:45:24.6459732Z 2025-05-07T19:45:24.6459735Z 2025-05-07T19:45:24.6459738Z 2025-05-07T19:45:24.6459742Z 2025-05-07T19:45:24.6512641Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:24.6566719Z openjdk-23.0.1 | 181.3 MB | ###4 | 35% 2025-05-07T19:45:24.6567314Z 2025-05-07T19:45:24.6567415Z 2025-05-07T19:45:24.6567420Z 2025-05-07T19:45:24.6567424Z 2025-05-07T19:45:24.6567447Z 2025-05-07T19:45:24.6567451Z 2025-05-07T19:45:24.6567455Z 2025-05-07T19:45:24.6567458Z 2025-05-07T19:45:24.6567462Z 2025-05-07T19:45:24.6567805Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:24.6568114Z 2025-05-07T19:45:24.6568118Z 2025-05-07T19:45:24.6568122Z 2025-05-07T19:45:24.6568133Z 2025-05-07T19:45:24.6568161Z 2025-05-07T19:45:24.6568172Z 2025-05-07T19:45:24.6568175Z 2025-05-07T19:45:24.6568178Z 2025-05-07T19:45:24.6568182Z 2025-05-07T19:45:24.6857173Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:24.6857525Z 2025-05-07T19:45:24.6857530Z 2025-05-07T19:45:24.6857554Z 2025-05-07T19:45:24.6857558Z 2025-05-07T19:45:24.6857561Z 2025-05-07T19:45:24.6857565Z 2025-05-07T19:45:24.6857568Z 2025-05-07T19:45:24.6857585Z 2025-05-07T19:45:24.6857588Z 2025-05-07T19:45:24.6857592Z 2025-05-07T19:45:24.6869984Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:24.6870263Z 2025-05-07T19:45:24.6871237Z 2025-05-07T19:45:24.7282563Z python-3.9.22 | 22.5 MB | ########## | 100%  2025-05-07T19:45:24.7282863Z 2025-05-07T19:45:24.7282868Z 2025-05-07T19:45:24.7282872Z 2025-05-07T19:45:24.7282876Z 2025-05-07T19:45:24.7282881Z 2025-05-07T19:45:24.7282884Z 2025-05-07T19:45:24.7282908Z 2025-05-07T19:45:24.7282912Z 2025-05-07T19:45:24.7282935Z 2025-05-07T19:45:24.7282939Z 2025-05-07T19:45:24.7282943Z 2025-05-07T19:45:24.7364284Z font-ttf-ubuntu-0.83 | 1.5 MB | 1 | 1%  2025-05-07T19:45:24.7364638Z 2025-05-07T19:45:24.7364671Z 2025-05-07T19:45:24.7364698Z 2025-05-07T19:45:24.7364706Z 2025-05-07T19:45:24.7364710Z 2025-05-07T19:45:24.7364713Z 2025-05-07T19:45:24.7364729Z 2025-05-07T19:45:24.7364837Z 2025-05-07T19:45:24.7365065Z 2025-05-07T19:45:24.7365070Z 2025-05-07T19:45:24.7365075Z 2025-05-07T19:45:24.7365079Z 2025-05-07T19:45:24.7517424Z harfbuzz-9.0.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:24.7762623Z openjdk-23.0.1 | 181.3 MB | #### | 40% 2025-05-07T19:45:24.7762994Z 2025-05-07T19:45:24.7763035Z 2025-05-07T19:45:24.7763039Z 2025-05-07T19:45:24.7763052Z 2025-05-07T19:45:24.7763055Z 2025-05-07T19:45:24.7763077Z 2025-05-07T19:45:24.7763080Z 2025-05-07T19:45:24.7763084Z 2025-05-07T19:45:24.7763101Z 2025-05-07T19:45:24.7763132Z 2025-05-07T19:45:24.7821164Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:24.7821490Z 2025-05-07T19:45:24.7821495Z 2025-05-07T19:45:24.7821499Z 2025-05-07T19:45:24.7821503Z 2025-05-07T19:45:24.7821506Z 2025-05-07T19:45:24.7821511Z 2025-05-07T19:45:24.7821535Z 2025-05-07T19:45:24.7821539Z 2025-05-07T19:45:24.7821543Z 2025-05-07T19:45:24.7821562Z 2025-05-07T19:45:24.7821566Z 2025-05-07T19:45:24.7853985Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:24.7854342Z 2025-05-07T19:45:24.7854347Z 2025-05-07T19:45:24.7854372Z 2025-05-07T19:45:24.7854376Z 2025-05-07T19:45:24.7854380Z 2025-05-07T19:45:24.7854383Z 2025-05-07T19:45:24.7854386Z 2025-05-07T19:45:24.7854390Z 2025-05-07T19:45:24.7854393Z 2025-05-07T19:45:24.7854397Z 2025-05-07T19:45:24.7854400Z 2025-05-07T19:45:24.7854404Z 2025-05-07T19:45:24.8202071Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:24.8202450Z 2025-05-07T19:45:24.8202454Z 2025-05-07T19:45:24.8202458Z 2025-05-07T19:45:24.8202462Z 2025-05-07T19:45:24.8202465Z 2025-05-07T19:45:24.8202469Z 2025-05-07T19:45:24.8202472Z 2025-05-07T19:45:24.8202476Z 2025-05-07T19:45:24.8202479Z 2025-05-07T19:45:24.8202483Z 2025-05-07T19:45:24.8202486Z 2025-05-07T19:45:24.8202490Z 2025-05-07T19:45:24.8202493Z 2025-05-07T19:45:24.8202503Z 2025-05-07T19:45:24.8277218Z krb5-1.21.3 | 1.3 MB | 1 | 1%  2025-05-07T19:45:24.8277548Z 2025-05-07T19:45:24.8277553Z 2025-05-07T19:45:24.8277556Z 2025-05-07T19:45:24.8277560Z 2025-05-07T19:45:24.8277564Z 2025-05-07T19:45:24.8277567Z 2025-05-07T19:45:24.8277571Z 2025-05-07T19:45:24.8277574Z 2025-05-07T19:45:24.8277578Z 2025-05-07T19:45:24.8277581Z 2025-05-07T19:45:24.8277585Z 2025-05-07T19:45:24.8277588Z 2025-05-07T19:45:24.8277592Z 2025-05-07T19:45:24.8348121Z libgfortran5-15.1.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:24.8348480Z 2025-05-07T19:45:24.8348484Z 2025-05-07T19:45:24.8348488Z 2025-05-07T19:45:24.8348492Z 2025-05-07T19:45:24.8348495Z 2025-05-07T19:45:24.8348499Z 2025-05-07T19:45:24.8348503Z 2025-05-07T19:45:24.8348506Z 2025-05-07T19:45:24.8348510Z 2025-05-07T19:45:24.8348535Z 2025-05-07T19:45:24.8348538Z 2025-05-07T19:45:24.8348548Z 2025-05-07T19:45:24.8348552Z 2025-05-07T19:45:24.8348555Z 2025-05-07T19:45:24.8350286Z 2025-05-07T19:45:24.8519177Z libabseil-20250127.1 | 1.3 MB | 1 | 1%  2025-05-07T19:45:24.8657126Z openjdk-23.0.1 | 181.3 MB | ####4 | 45% 2025-05-07T19:45:24.8657585Z 2025-05-07T19:45:24.8657593Z 2025-05-07T19:45:24.8657599Z 2025-05-07T19:45:24.8657606Z 2025-05-07T19:45:24.8845241Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:24.8845562Z 2025-05-07T19:45:24.8845606Z 2025-05-07T19:45:24.8845610Z 2025-05-07T19:45:24.8845614Z 2025-05-07T19:45:24.8845617Z 2025-05-07T19:45:24.8845622Z 2025-05-07T19:45:24.8845625Z 2025-05-07T19:45:24.8845629Z 2025-05-07T19:45:24.8845633Z 2025-05-07T19:45:24.8845637Z 2025-05-07T19:45:24.8845641Z 2025-05-07T19:45:24.8845644Z 2025-05-07T19:45:24.8845648Z 2025-05-07T19:45:24.8845651Z 2025-05-07T19:45:24.8908807Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:24.8909385Z 2025-05-07T19:45:24.8909389Z 2025-05-07T19:45:24.8909393Z 2025-05-07T19:45:24.8909397Z 2025-05-07T19:45:24.8909400Z 2025-05-07T19:45:24.8909404Z 2025-05-07T19:45:24.8909407Z 2025-05-07T19:45:24.8909411Z 2025-05-07T19:45:24.8909414Z 2025-05-07T19:45:24.8909418Z 2025-05-07T19:45:24.8909421Z 2025-05-07T19:45:24.8909425Z 2025-05-07T19:45:24.8909428Z 2025-05-07T19:45:24.8909432Z 2025-05-07T19:45:24.8909435Z 2025-05-07T19:45:24.8936786Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:24.8937165Z 2025-05-07T19:45:24.8937170Z 2025-05-07T19:45:24.8937174Z 2025-05-07T19:45:24.8937177Z 2025-05-07T19:45:24.8937181Z 2025-05-07T19:45:24.8937184Z 2025-05-07T19:45:24.8937188Z 2025-05-07T19:45:24.8937191Z 2025-05-07T19:45:24.8937195Z 2025-05-07T19:45:24.8937198Z 2025-05-07T19:45:24.8937202Z 2025-05-07T19:45:24.8937229Z 2025-05-07T19:45:24.8937232Z 2025-05-07T19:45:24.9302682Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:24.9303050Z 2025-05-07T19:45:24.9303055Z 2025-05-07T19:45:24.9303059Z 2025-05-07T19:45:24.9303062Z 2025-05-07T19:45:24.9303066Z 2025-05-07T19:45:24.9303093Z 2025-05-07T19:45:24.9303096Z 2025-05-07T19:45:24.9303100Z 2025-05-07T19:45:24.9303103Z 2025-05-07T19:45:24.9303107Z 2025-05-07T19:45:24.9303110Z 2025-05-07T19:45:24.9303113Z 2025-05-07T19:45:24.9303117Z 2025-05-07T19:45:24.9303120Z 2025-05-07T19:45:24.9303124Z 2025-05-07T19:45:24.9303325Z 2025-05-07T19:45:24.9421824Z cairo-1.18.0 | 961 KB | 1 | 2%  2025-05-07T19:45:24.9422198Z 2025-05-07T19:45:24.9422203Z 2025-05-07T19:45:24.9422207Z 2025-05-07T19:45:24.9422210Z 2025-05-07T19:45:24.9422214Z 2025-05-07T19:45:24.9422217Z 2025-05-07T19:45:24.9422221Z 2025-05-07T19:45:24.9422224Z 2025-05-07T19:45:24.9422228Z 2025-05-07T19:45:24.9422232Z 2025-05-07T19:45:24.9422249Z 2025-05-07T19:45:24.9422253Z 2025-05-07T19:45:24.9422257Z 2025-05-07T19:45:24.9422260Z 2025-05-07T19:45:24.9422263Z 2025-05-07T19:45:24.9422267Z 2025-05-07T19:45:24.9422270Z 2025-05-07T19:45:24.9477446Z pcre2-10.44 | 934 KB | 1 | 2%  2025-05-07T19:45:24.9477798Z 2025-05-07T19:45:24.9477802Z 2025-05-07T19:45:24.9477806Z 2025-05-07T19:45:24.9477809Z 2025-05-07T19:45:24.9477813Z 2025-05-07T19:45:24.9477816Z 2025-05-07T19:45:24.9477820Z 2025-05-07T19:45:24.9477823Z 2025-05-07T19:45:24.9477844Z 2025-05-07T19:45:24.9477875Z 2025-05-07T19:45:24.9477879Z 2025-05-07T19:45:24.9477882Z 2025-05-07T19:45:24.9477886Z 2025-05-07T19:45:24.9477889Z 2025-05-07T19:45:24.9477892Z 2025-05-07T19:45:24.9477896Z 2025-05-07T19:45:24.9477899Z 2025-05-07T19:45:24.9477903Z 2025-05-07T19:45:24.9520015Z libsqlite-3.49.2 | 895 KB | 1 | 2%  2025-05-07T19:45:24.9678084Z openjdk-23.0.1 | 181.3 MB | ####9 | 50% 2025-05-07T19:45:24.9678372Z 2025-05-07T19:45:24.9678377Z 2025-05-07T19:45:24.9678381Z 2025-05-07T19:45:24.9678384Z 2025-05-07T19:45:24.9678388Z 2025-05-07T19:45:24.9681352Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:24.9681643Z 2025-05-07T19:45:24.9681647Z 2025-05-07T19:45:24.9681651Z 2025-05-07T19:45:24.9681654Z 2025-05-07T19:45:24.9681657Z 2025-05-07T19:45:24.9681661Z 2025-05-07T19:45:24.9681664Z 2025-05-07T19:45:24.9681668Z 2025-05-07T19:45:24.9681684Z 2025-05-07T19:45:24.9681687Z 2025-05-07T19:45:24.9681724Z 2025-05-07T19:45:24.9681728Z 2025-05-07T19:45:24.9681731Z 2025-05-07T19:45:24.9681735Z 2025-05-07T19:45:24.9681738Z 2025-05-07T19:45:24.9681742Z 2025-05-07T19:45:24.9764434Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:24.9764786Z 2025-05-07T19:45:24.9764813Z 2025-05-07T19:45:24.9764817Z 2025-05-07T19:45:24.9765090Z 2025-05-07T19:45:24.9765094Z 2025-05-07T19:45:24.9765098Z 2025-05-07T19:45:24.9765101Z 2025-05-07T19:45:24.9765105Z 2025-05-07T19:45:24.9765109Z 2025-05-07T19:45:24.9765112Z 2025-05-07T19:45:24.9765115Z 2025-05-07T19:45:24.9765119Z 2025-05-07T19:45:24.9765122Z 2025-05-07T19:45:24.9765126Z 2025-05-07T19:45:24.9765129Z 2025-05-07T19:45:24.9765133Z 2025-05-07T19:45:24.9765136Z 2025-05-07T19:45:24.9813679Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:24.9814070Z 2025-05-07T19:45:24.9814075Z 2025-05-07T19:45:24.9814079Z 2025-05-07T19:45:24.9814082Z 2025-05-07T19:45:24.9814086Z 2025-05-07T19:45:24.9814090Z 2025-05-07T19:45:24.9814093Z 2025-05-07T19:45:24.9814097Z 2025-05-07T19:45:24.9814100Z 2025-05-07T19:45:24.9814104Z 2025-05-07T19:45:24.9814107Z 2025-05-07T19:45:24.9814111Z 2025-05-07T19:45:24.9814114Z 2025-05-07T19:45:24.9814118Z 2025-05-07T19:45:24.9814146Z 2025-05-07T19:45:24.9814156Z 2025-05-07T19:45:24.9814159Z 2025-05-07T19:45:24.9814163Z 2025-05-07T19:45:25.0240554Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:25.0240929Z 2025-05-07T19:45:25.0240934Z 2025-05-07T19:45:25.0240963Z 2025-05-07T19:45:25.0240967Z 2025-05-07T19:45:25.0240970Z 2025-05-07T19:45:25.0240974Z 2025-05-07T19:45:25.0240977Z 2025-05-07T19:45:25.0240981Z 2025-05-07T19:45:25.0240984Z 2025-05-07T19:45:25.0240987Z 2025-05-07T19:45:25.0240991Z 2025-05-07T19:45:25.0240994Z 2025-05-07T19:45:25.0241202Z 2025-05-07T19:45:25.0241207Z 2025-05-07T19:45:25.0241210Z 2025-05-07T19:45:25.0241214Z 2025-05-07T19:45:25.0241217Z 2025-05-07T19:45:25.0241221Z 2025-05-07T19:45:25.0241224Z 2025-05-07T19:45:25.0522621Z ... (more hidden) ... 2025-05-07T19:45:25.0560144Z openjdk-23.0.1 | 181.3 MB | #####4 | 54% 2025-05-07T19:45:25.0560809Z 2025-05-07T19:45:25.0560860Z 2025-05-07T19:45:25.0560865Z 2025-05-07T19:45:25.0560873Z 2025-05-07T19:45:25.0560878Z 2025-05-07T19:45:25.0560883Z 2025-05-07T19:45:25.0560888Z 2025-05-07T19:45:25.0560893Z 2025-05-07T19:45:25.0560899Z 2025-05-07T19:45:25.0560909Z 2025-05-07T19:45:25.0560916Z 2025-05-07T19:45:25.0560921Z 2025-05-07T19:45:25.0560926Z 2025-05-07T19:45:25.0560930Z 2025-05-07T19:45:25.0560934Z 2025-05-07T19:45:25.0560951Z 2025-05-07T19:45:25.0560955Z 2025-05-07T19:45:25.0560959Z 2025-05-07T19:45:25.0561093Z 2025-05-07T19:45:25.1536404Z ... (more hidden) ... 2025-05-07T19:45:25.1680441Z openjdk-23.0.1 | 181.3 MB | #####8 | 59% 2025-05-07T19:45:25.1680731Z 2025-05-07T19:45:25.1680736Z 2025-05-07T19:45:25.1680739Z 2025-05-07T19:45:25.1680744Z 2025-05-07T19:45:25.1680748Z 2025-05-07T19:45:25.1680752Z 2025-05-07T19:45:25.1680757Z 2025-05-07T19:45:25.2538497Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:25.2555743Z openjdk-23.0.1 | 181.3 MB | ######4 | 64% 2025-05-07T19:45:25.2556047Z 2025-05-07T19:45:25.2556052Z 2025-05-07T19:45:25.2556056Z 2025-05-07T19:45:25.2556060Z 2025-05-07T19:45:25.2556065Z 2025-05-07T19:45:25.2556070Z 2025-05-07T19:45:25.2813954Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:25.3621781Z 2025-05-07T19:45:25.3622718Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:25.4634328Z openjdk-23.0.1 | 181.3 MB | ######9 | 69% 2025-05-07T19:45:25.5631753Z openjdk-23.0.1 | 181.3 MB | #######3 | 74% 2025-05-07T19:45:25.5882197Z openjdk-23.0.1 | 181.3 MB | #######8 | 79% 2025-05-07T19:45:25.5882475Z 2025-05-07T19:45:25.5882480Z 2025-05-07T19:45:25.5882484Z 2025-05-07T19:45:25.5882487Z 2025-05-07T19:45:25.5882491Z 2025-05-07T19:45:25.5882496Z 2025-05-07T19:45:25.5882525Z 2025-05-07T19:45:25.5882529Z 2025-05-07T19:45:25.6681165Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:25.7681807Z openjdk-23.0.1 | 181.3 MB | ########3 | 84% 2025-05-07T19:45:25.8045548Z openjdk-23.0.1 | 181.3 MB | ########9 | 89% 2025-05-07T19:45:25.8045919Z 2025-05-07T19:45:25.8046218Z 2025-05-07T19:45:25.8046234Z 2025-05-07T19:45:25.8046241Z 2025-05-07T19:45:25.8046247Z 2025-05-07T19:45:25.8046253Z 2025-05-07T19:45:25.8046258Z 2025-05-07T19:45:25.8046263Z 2025-05-07T19:45:25.8046267Z 2025-05-07T19:45:25.8682007Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:26.2528571Z openjdk-23.0.1 | 181.3 MB | #########4 | 95% 2025-05-07T19:45:26.2529431Z 2025-05-07T19:45:26.2529447Z 2025-05-07T19:45:26.2529459Z 2025-05-07T19:45:26.2529499Z 2025-05-07T19:45:26.2529513Z 2025-05-07T19:45:26.2529527Z 2025-05-07T19:45:26.2529542Z 2025-05-07T19:45:26.2529556Z 2025-05-07T19:45:26.2529570Z 2025-05-07T19:45:26.2529582Z 2025-05-07T19:45:26.2530475Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:26.2531316Z 2025-05-07T19:45:26.2531329Z 2025-05-07T19:45:26.2531341Z 2025-05-07T19:45:26.2531384Z 2025-05-07T19:45:26.2531396Z 2025-05-07T19:45:26.2531409Z 2025-05-07T19:45:26.2531420Z 2025-05-07T19:45:26.2531432Z 2025-05-07T19:45:26.2531444Z 2025-05-07T19:45:26.2531455Z 2025-05-07T19:45:26.3146038Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:26.3146964Z 2025-05-07T19:45:26.3146979Z 2025-05-07T19:45:26.3146991Z 2025-05-07T19:45:26.3147491Z 2025-05-07T19:45:26.3147505Z 2025-05-07T19:45:26.3147515Z 2025-05-07T19:45:26.3147525Z 2025-05-07T19:45:26.3147536Z 2025-05-07T19:45:26.3147546Z 2025-05-07T19:45:26.3147556Z 2025-05-07T19:45:26.3147566Z 2025-05-07T19:45:26.3148467Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:26.3149448Z 2025-05-07T19:45:26.3149458Z 2025-05-07T19:45:26.3149469Z 2025-05-07T19:45:26.3149496Z 2025-05-07T19:45:26.3149507Z 2025-05-07T19:45:26.3149517Z 2025-05-07T19:45:26.3149527Z 2025-05-07T19:45:26.3149538Z 2025-05-07T19:45:26.3149549Z 2025-05-07T19:45:26.3149559Z 2025-05-07T19:45:26.3149569Z 2025-05-07T19:45:26.3799737Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:26.3800121Z 2025-05-07T19:45:26.3800126Z 2025-05-07T19:45:26.3800130Z 2025-05-07T19:45:26.3800133Z 2025-05-07T19:45:26.3800137Z 2025-05-07T19:45:26.3800141Z 2025-05-07T19:45:26.3800144Z 2025-05-07T19:45:26.3800165Z 2025-05-07T19:45:26.3800169Z 2025-05-07T19:45:26.3800173Z 2025-05-07T19:45:26.3800176Z 2025-05-07T19:45:26.3800180Z 2025-05-07T19:45:26.3800498Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:26.3800820Z 2025-05-07T19:45:26.3800824Z 2025-05-07T19:45:26.3800828Z 2025-05-07T19:45:26.3800831Z 2025-05-07T19:45:26.3800835Z 2025-05-07T19:45:26.3800838Z 2025-05-07T19:45:26.3800853Z 2025-05-07T19:45:26.3800856Z 2025-05-07T19:45:26.3800860Z 2025-05-07T19:45:26.3800863Z 2025-05-07T19:45:26.3800898Z 2025-05-07T19:45:26.3800901Z 2025-05-07T19:45:26.5054032Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:26.5054390Z 2025-05-07T19:45:26.5054395Z 2025-05-07T19:45:26.5054398Z 2025-05-07T19:45:26.5054402Z 2025-05-07T19:45:26.5054406Z 2025-05-07T19:45:26.5054444Z 2025-05-07T19:45:26.5054448Z 2025-05-07T19:45:26.5054452Z 2025-05-07T19:45:26.5054456Z 2025-05-07T19:45:26.5054477Z 2025-05-07T19:45:26.5054481Z 2025-05-07T19:45:26.5054484Z 2025-05-07T19:45:26.5054488Z 2025-05-07T19:45:26.5054491Z 2025-05-07T19:45:26.5054789Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:26.5055121Z 2025-05-07T19:45:26.5055124Z 2025-05-07T19:45:26.5055128Z 2025-05-07T19:45:26.5055132Z 2025-05-07T19:45:26.5055135Z 2025-05-07T19:45:26.5055138Z 2025-05-07T19:45:26.5055451Z 2025-05-07T19:45:26.5055456Z 2025-05-07T19:45:26.5055463Z 2025-05-07T19:45:26.5055466Z 2025-05-07T19:45:26.5055470Z 2025-05-07T19:45:26.5055475Z 2025-05-07T19:45:26.5055478Z 2025-05-07T19:45:26.5055488Z 2025-05-07T19:45:26.9957988Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:26.9958342Z 2025-05-07T19:45:26.9958347Z 2025-05-07T19:45:26.9958351Z 2025-05-07T19:45:26.9958355Z 2025-05-07T19:45:26.9958359Z 2025-05-07T19:45:26.9958362Z 2025-05-07T19:45:26.9958366Z 2025-05-07T19:45:26.9958389Z 2025-05-07T19:45:26.9958424Z 2025-05-07T19:45:26.9958428Z 2025-05-07T19:45:26.9958431Z 2025-05-07T19:45:26.9958435Z 2025-05-07T19:45:26.9958439Z 2025-05-07T19:45:26.9958442Z 2025-05-07T19:45:26.9958446Z 2025-05-07T19:45:26.9958787Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:26.9959139Z 2025-05-07T19:45:26.9959144Z 2025-05-07T19:45:26.9959176Z 2025-05-07T19:45:26.9959190Z 2025-05-07T19:45:26.9959193Z 2025-05-07T19:45:26.9959196Z 2025-05-07T19:45:26.9959200Z 2025-05-07T19:45:26.9959203Z 2025-05-07T19:45:26.9959207Z 2025-05-07T19:45:26.9959210Z 2025-05-07T19:45:26.9959214Z 2025-05-07T19:45:26.9959217Z 2025-05-07T19:45:26.9959221Z 2025-05-07T19:45:26.9959224Z 2025-05-07T19:45:26.9959228Z 2025-05-07T19:45:27.0803893Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:27.0804337Z 2025-05-07T19:45:27.0804342Z 2025-05-07T19:45:27.0804346Z 2025-05-07T19:45:27.0804636Z 2025-05-07T19:45:27.0804642Z 2025-05-07T19:45:27.0804645Z 2025-05-07T19:45:27.0804649Z 2025-05-07T19:45:27.0804652Z 2025-05-07T19:45:27.0804656Z 2025-05-07T19:45:27.0804660Z 2025-05-07T19:45:27.0804664Z 2025-05-07T19:45:27.0804667Z 2025-05-07T19:45:27.0804671Z 2025-05-07T19:45:27.0805042Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:27.0805380Z 2025-05-07T19:45:27.0805409Z 2025-05-07T19:45:27.0805413Z 2025-05-07T19:45:27.0805416Z 2025-05-07T19:45:27.0805420Z 2025-05-07T19:45:27.0805423Z 2025-05-07T19:45:27.0805427Z 2025-05-07T19:45:27.0805430Z 2025-05-07T19:45:27.0805434Z 2025-05-07T19:45:27.0805437Z 2025-05-07T19:45:27.0805441Z 2025-05-07T19:45:27.0805445Z 2025-05-07T19:45:27.0805450Z 2025-05-07T19:45:27.1533600Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:27.1533987Z 2025-05-07T19:45:27.1533992Z 2025-05-07T19:45:27.1533996Z 2025-05-07T19:45:27.1534019Z 2025-05-07T19:45:27.1534023Z 2025-05-07T19:45:27.1534053Z 2025-05-07T19:45:27.1534056Z 2025-05-07T19:45:27.1534060Z 2025-05-07T19:45:27.1534064Z 2025-05-07T19:45:27.1534067Z 2025-05-07T19:45:27.1534071Z 2025-05-07T19:45:27.1534075Z 2025-05-07T19:45:27.1534078Z 2025-05-07T19:45:27.1534082Z 2025-05-07T19:45:27.1534085Z 2025-05-07T19:45:27.1534088Z 2025-05-07T19:45:27.1534397Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:27.1534927Z 2025-05-07T19:45:27.1534931Z 2025-05-07T19:45:27.1534934Z 2025-05-07T19:45:27.1534938Z 2025-05-07T19:45:27.1534941Z 2025-05-07T19:45:27.1534945Z 2025-05-07T19:45:27.1534948Z 2025-05-07T19:45:27.1534951Z 2025-05-07T19:45:27.1534955Z 2025-05-07T19:45:27.1534958Z 2025-05-07T19:45:27.1534961Z 2025-05-07T19:45:27.1534965Z 2025-05-07T19:45:27.1534968Z 2025-05-07T19:45:27.1534971Z 2025-05-07T19:45:27.1534975Z 2025-05-07T19:45:27.1534978Z 2025-05-07T19:45:27.3138427Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:27.3139440Z 2025-05-07T19:45:27.3139454Z 2025-05-07T19:45:27.3139465Z 2025-05-07T19:45:27.3139477Z 2025-05-07T19:45:27.3139488Z 2025-05-07T19:45:27.3139498Z 2025-05-07T19:45:27.3139508Z 2025-05-07T19:45:27.3139518Z 2025-05-07T19:45:27.3139529Z 2025-05-07T19:45:27.3139539Z 2025-05-07T19:45:27.3139549Z 2025-05-07T19:45:27.3141444Z 2025-05-07T19:45:27.3141455Z 2025-05-07T19:45:27.3141465Z 2025-05-07T19:45:27.3141476Z 2025-05-07T19:45:27.3141486Z 2025-05-07T19:45:27.3141496Z 2025-05-07T19:45:27.3142613Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:27.3142942Z 2025-05-07T19:45:27.3142946Z 2025-05-07T19:45:27.3142950Z 2025-05-07T19:45:27.3142953Z 2025-05-07T19:45:27.3142982Z 2025-05-07T19:45:27.3142985Z 2025-05-07T19:45:27.3142989Z 2025-05-07T19:45:27.3142993Z 2025-05-07T19:45:27.3142996Z 2025-05-07T19:45:27.3143007Z 2025-05-07T19:45:27.3143010Z 2025-05-07T19:45:27.3143014Z 2025-05-07T19:45:27.3143018Z 2025-05-07T19:45:27.3143022Z 2025-05-07T19:45:27.3143025Z 2025-05-07T19:45:27.3143029Z 2025-05-07T19:45:27.3143032Z 2025-05-07T19:45:27.3416640Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:27.3417000Z 2025-05-07T19:45:27.3417005Z 2025-05-07T19:45:27.3417021Z 2025-05-07T19:45:27.3417025Z 2025-05-07T19:45:27.3417029Z 2025-05-07T19:45:27.3417033Z 2025-05-07T19:45:27.3417036Z 2025-05-07T19:45:27.3417040Z 2025-05-07T19:45:27.3417044Z 2025-05-07T19:45:27.3417047Z 2025-05-07T19:45:27.3417051Z 2025-05-07T19:45:27.3417054Z 2025-05-07T19:45:27.3417058Z 2025-05-07T19:45:27.3417061Z 2025-05-07T19:45:27.3417065Z 2025-05-07T19:45:27.3417069Z 2025-05-07T19:45:27.3417072Z 2025-05-07T19:45:27.3417076Z 2025-05-07T19:45:27.3417717Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:27.3418088Z 2025-05-07T19:45:27.3418091Z 2025-05-07T19:45:27.3418095Z 2025-05-07T19:45:27.3418098Z 2025-05-07T19:45:27.3418102Z 2025-05-07T19:45:27.3418106Z 2025-05-07T19:45:27.3418109Z 2025-05-07T19:45:27.3418113Z 2025-05-07T19:45:27.3418118Z 2025-05-07T19:45:27.3418121Z 2025-05-07T19:45:27.3418153Z 2025-05-07T19:45:27.3418156Z 2025-05-07T19:45:27.3418159Z 2025-05-07T19:45:27.3418163Z 2025-05-07T19:45:27.3418171Z 2025-05-07T19:45:27.3418174Z 2025-05-07T19:45:27.3418178Z 2025-05-07T19:45:27.3418181Z 2025-05-07T19:45:27.4619788Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:27.4621109Z 2025-05-07T19:45:27.4621123Z 2025-05-07T19:45:27.6463277Z python-3.9.22 | 22.5 MB | ########## | 100%  2025-05-07T19:45:27.6463585Z 2025-05-07T19:45:27.6463590Z 2025-05-07T19:45:27.6463594Z 2025-05-07T19:45:27.7587645Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:28.5667389Z openjdk-23.0.1 | 181.3 MB | ########## | 100% 2025-05-07T19:45:28.5668165Z 2025-05-07T19:45:28.5668181Z 2025-05-07T19:45:28.5668193Z 2025-05-07T19:45:28.5668204Z 2025-05-07T19:45:28.5668214Z 2025-05-07T19:45:28.5668225Z 2025-05-07T19:45:28.5668235Z 2025-05-07T19:45:28.5668246Z 2025-05-07T19:45:28.5668288Z 2025-05-07T19:45:28.5668299Z 2025-05-07T19:45:28.5668309Z 2025-05-07T19:45:28.5668319Z 2025-05-07T19:45:28.5668346Z 2025-05-07T19:45:28.5668357Z 2025-05-07T19:45:28.5668367Z 2025-05-07T19:45:28.5668377Z 2025-05-07T19:45:28.5668387Z 2025-05-07T19:45:28.5668397Z 2025-05-07T19:45:28.5668407Z 2025-05-07T19:45:28.5669327Z ... (more hidden) ... 2025-05-07T19:45:28.5670225Z 2025-05-07T19:45:28.5670237Z 2025-05-07T19:45:28.5670247Z 2025-05-07T19:45:28.5670258Z 2025-05-07T19:45:28.5670268Z 2025-05-07T19:45:28.5670278Z 2025-05-07T19:45:28.5670289Z 2025-05-07T19:45:28.5670299Z 2025-05-07T19:45:28.5670321Z 2025-05-07T19:45:28.5670332Z 2025-05-07T19:45:28.5670343Z 2025-05-07T19:45:28.5670353Z 2025-05-07T19:45:28.5670364Z 2025-05-07T19:45:28.5670374Z 2025-05-07T19:45:28.5670385Z 2025-05-07T19:45:28.5670395Z 2025-05-07T19:45:28.5670405Z 2025-05-07T19:45:28.5670416Z 2025-05-07T19:45:28.5670426Z 2025-05-07T19:45:28.9605703Z ... (more hidden) ... 2025-05-07T19:45:28.9606047Z 2025-05-07T19:45:29.9490465Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:29.9496249Z openjdk-23.0.1 | 181.3 MB | ########## | 100% 2025-05-07T19:45:29.9496509Z 2025-05-07T19:45:29.9496514Z 2025-05-07T19:45:29.9496517Z 2025-05-07T19:45:29.9496577Z 2025-05-07T19:45:29.9496580Z 2025-05-07T19:45:29.9496584Z 2025-05-07T19:45:29.9496588Z 2025-05-07T19:45:29.9496592Z 2025-05-07T19:45:29.9496595Z 2025-05-07T19:45:29.9496599Z 2025-05-07T19:45:29.9496602Z 2025-05-07T19:45:29.9496606Z 2025-05-07T19:45:29.9496609Z 2025-05-07T19:45:29.9496632Z 2025-05-07T19:45:29.9496635Z 2025-05-07T19:45:29.9496639Z 2025-05-07T19:45:29.9496643Z 2025-05-07T19:45:29.9496646Z 2025-05-07T19:45:29.9496649Z 2025-05-07T19:45:29.9496771Z 2025-05-07T19:45:29.9497149Z  2025-05-07T19:45:29.9497527Z 2025-05-07T19:45:29.9497740Z 2025-05-07T19:45:29.9497939Z  2025-05-07T19:45:29.9498166Z 2025-05-07T19:45:29.9498170Z 2025-05-07T19:45:29.9498391Z  2025-05-07T19:45:29.9498633Z 2025-05-07T19:45:29.9498637Z 2025-05-07T19:45:29.9498642Z 2025-05-07T19:45:29.9498820Z  2025-05-07T19:45:29.9499041Z 2025-05-07T19:45:29.9499045Z 2025-05-07T19:45:29.9499049Z 2025-05-07T19:45:29.9499072Z 2025-05-07T19:45:29.9499540Z  2025-05-07T19:45:29.9499771Z 2025-05-07T19:45:29.9499775Z 2025-05-07T19:45:29.9499804Z 2025-05-07T19:45:29.9499807Z 2025-05-07T19:45:29.9499810Z 2025-05-07T19:45:29.9500023Z  2025-05-07T19:45:29.9500403Z 2025-05-07T19:45:29.9500407Z 2025-05-07T19:45:29.9500411Z 2025-05-07T19:45:29.9500415Z 2025-05-07T19:45:29.9500418Z 2025-05-07T19:45:29.9500428Z 2025-05-07T19:45:29.9500659Z  2025-05-07T19:45:29.9500897Z 2025-05-07T19:45:29.9500903Z 2025-05-07T19:45:29.9500907Z 2025-05-07T19:45:29.9500910Z 2025-05-07T19:45:29.9500913Z 2025-05-07T19:45:29.9500917Z 2025-05-07T19:45:29.9500920Z 2025-05-07T19:45:29.9501117Z  2025-05-07T19:45:29.9501373Z 2025-05-07T19:45:29.9501377Z 2025-05-07T19:45:29.9501380Z 2025-05-07T19:45:29.9501384Z 2025-05-07T19:45:29.9501388Z 2025-05-07T19:45:29.9501395Z 2025-05-07T19:45:29.9501399Z 2025-05-07T19:45:29.9501402Z 2025-05-07T19:45:29.9501598Z  2025-05-07T19:45:29.9501859Z 2025-05-07T19:45:29.9501863Z 2025-05-07T19:45:29.9501867Z 2025-05-07T19:45:29.9501870Z 2025-05-07T19:45:29.9501874Z 2025-05-07T19:45:29.9501877Z 2025-05-07T19:45:29.9501881Z 2025-05-07T19:45:29.9501885Z 2025-05-07T19:45:29.9501893Z 2025-05-07T19:45:29.9502150Z  2025-05-07T19:45:29.9502387Z 2025-05-07T19:45:29.9502390Z 2025-05-07T19:45:29.9502394Z 2025-05-07T19:45:29.9502397Z 2025-05-07T19:45:29.9502401Z 2025-05-07T19:45:29.9502404Z 2025-05-07T19:45:29.9502408Z 2025-05-07T19:45:29.9502411Z 2025-05-07T19:45:29.9502415Z 2025-05-07T19:45:29.9502418Z 2025-05-07T19:45:29.9502647Z  2025-05-07T19:45:29.9502899Z 2025-05-07T19:45:29.9502907Z 2025-05-07T19:45:29.9502911Z 2025-05-07T19:45:29.9502914Z 2025-05-07T19:45:29.9502918Z 2025-05-07T19:45:29.9502921Z 2025-05-07T19:45:29.9502925Z 2025-05-07T19:45:29.9502928Z 2025-05-07T19:45:29.9502931Z 2025-05-07T19:45:29.9502935Z 2025-05-07T19:45:29.9502938Z 2025-05-07T19:45:29.9503171Z  2025-05-07T19:45:29.9503416Z 2025-05-07T19:45:29.9503535Z 2025-05-07T19:45:29.9503539Z 2025-05-07T19:45:29.9503542Z 2025-05-07T19:45:29.9503545Z 2025-05-07T19:45:29.9503549Z 2025-05-07T19:45:29.9503552Z 2025-05-07T19:45:29.9503556Z 2025-05-07T19:45:29.9503559Z 2025-05-07T19:45:29.9503562Z 2025-05-07T19:45:29.9503566Z 2025-05-07T19:45:29.9503570Z 2025-05-07T19:45:29.9503802Z  2025-05-07T19:45:29.9504049Z 2025-05-07T19:45:29.9504053Z 2025-05-07T19:45:29.9504056Z 2025-05-07T19:45:29.9504060Z 2025-05-07T19:45:29.9504067Z 2025-05-07T19:45:29.9504071Z 2025-05-07T19:45:29.9504074Z 2025-05-07T19:45:29.9504079Z 2025-05-07T19:45:29.9504082Z 2025-05-07T19:45:29.9504086Z 2025-05-07T19:45:29.9504110Z 2025-05-07T19:45:29.9504114Z 2025-05-07T19:45:29.9504117Z 2025-05-07T19:45:29.9504332Z  2025-05-07T19:45:29.9504578Z 2025-05-07T19:45:29.9504582Z 2025-05-07T19:45:29.9504590Z 2025-05-07T19:45:29.9504593Z 2025-05-07T19:45:29.9504597Z 2025-05-07T19:45:29.9504601Z 2025-05-07T19:45:29.9504605Z 2025-05-07T19:45:29.9504628Z 2025-05-07T19:45:29.9504632Z 2025-05-07T19:45:29.9504635Z 2025-05-07T19:45:29.9504639Z 2025-05-07T19:45:29.9504642Z 2025-05-07T19:45:29.9504646Z 2025-05-07T19:45:29.9504649Z 2025-05-07T19:45:29.9504872Z  2025-05-07T19:45:29.9505122Z 2025-05-07T19:45:29.9505126Z 2025-05-07T19:45:29.9505131Z 2025-05-07T19:45:29.9505214Z 2025-05-07T19:45:29.9505218Z 2025-05-07T19:45:29.9505222Z 2025-05-07T19:45:29.9505225Z 2025-05-07T19:45:29.9505229Z 2025-05-07T19:45:29.9505232Z 2025-05-07T19:45:29.9505236Z 2025-05-07T19:45:29.9505239Z 2025-05-07T19:45:29.9505243Z 2025-05-07T19:45:29.9505246Z 2025-05-07T19:45:29.9505250Z 2025-05-07T19:45:29.9505253Z 2025-05-07T19:45:29.9505550Z  2025-05-07T19:45:29.9505809Z 2025-05-07T19:45:29.9505813Z 2025-05-07T19:45:29.9505816Z 2025-05-07T19:45:29.9505820Z 2025-05-07T19:45:29.9505823Z 2025-05-07T19:45:29.9505826Z 2025-05-07T19:45:29.9505830Z 2025-05-07T19:45:29.9505833Z 2025-05-07T19:45:29.9505836Z 2025-05-07T19:45:29.9505840Z 2025-05-07T19:45:29.9505844Z 2025-05-07T19:45:29.9505848Z 2025-05-07T19:45:29.9505851Z 2025-05-07T19:45:29.9505856Z 2025-05-07T19:45:29.9505859Z 2025-05-07T19:45:29.9505879Z 2025-05-07T19:45:29.9506109Z  2025-05-07T19:45:29.9506365Z 2025-05-07T19:45:29.9506368Z 2025-05-07T19:45:29.9506372Z 2025-05-07T19:45:29.9506375Z 2025-05-07T19:45:29.9506380Z 2025-05-07T19:45:29.9506385Z 2025-05-07T19:45:29.9506388Z 2025-05-07T19:45:29.9506392Z 2025-05-07T19:45:29.9506412Z 2025-05-07T19:45:29.9506415Z 2025-05-07T19:45:29.9506419Z 2025-05-07T19:45:29.9506422Z 2025-05-07T19:45:29.9506426Z 2025-05-07T19:45:29.9506434Z 2025-05-07T19:45:29.9506437Z 2025-05-07T19:45:29.9506440Z 2025-05-07T19:45:29.9506445Z 2025-05-07T19:45:29.9506680Z  2025-05-07T19:45:29.9506946Z 2025-05-07T19:45:29.9506969Z 2025-05-07T19:45:29.9506972Z 2025-05-07T19:45:29.9506975Z 2025-05-07T19:45:29.9506979Z 2025-05-07T19:45:29.9506982Z 2025-05-07T19:45:29.9506986Z 2025-05-07T19:45:29.9506989Z 2025-05-07T19:45:29.9506993Z 2025-05-07T19:45:29.9506996Z 2025-05-07T19:45:29.9507004Z 2025-05-07T19:45:29.9507007Z 2025-05-07T19:45:29.9507011Z 2025-05-07T19:45:29.9507014Z 2025-05-07T19:45:29.9507018Z 2025-05-07T19:45:29.9507022Z 2025-05-07T19:45:29.9507025Z 2025-05-07T19:45:29.9507029Z 2025-05-07T19:45:29.9507295Z  2025-05-07T19:45:29.9507560Z 2025-05-07T19:45:29.9507563Z 2025-05-07T19:45:29.9507674Z  2025-05-07T19:45:29.9507847Z 2025-05-07T19:45:29.9507870Z 2025-05-07T19:45:29.9507973Z  2025-05-07T19:45:29.9508088Z 2025-05-07T19:45:29.9508091Z 2025-05-07T19:45:29.9508095Z 2025-05-07T19:45:29.9508199Z  2025-05-07T19:45:29.9508330Z 2025-05-07T19:45:29.9508334Z 2025-05-07T19:45:29.9508337Z 2025-05-07T19:45:29.9508340Z 2025-05-07T19:45:29.9508461Z  2025-05-07T19:45:29.9508584Z 2025-05-07T19:45:29.9508587Z 2025-05-07T19:45:29.9508608Z 2025-05-07T19:45:29.9508611Z 2025-05-07T19:45:29.9508615Z 2025-05-07T19:45:29.9508730Z  2025-05-07T19:45:29.9508859Z 2025-05-07T19:45:29.9508863Z 2025-05-07T19:45:29.9508867Z 2025-05-07T19:45:29.9508870Z 2025-05-07T19:45:29.9508874Z 2025-05-07T19:45:29.9508877Z 2025-05-07T19:45:29.9509007Z  2025-05-07T19:45:29.9509140Z 2025-05-07T19:45:29.9509144Z 2025-05-07T19:45:29.9509231Z 2025-05-07T19:45:29.9509234Z 2025-05-07T19:45:29.9509238Z 2025-05-07T19:45:29.9509241Z 2025-05-07T19:45:29.9509245Z 2025-05-07T19:45:29.9509366Z  2025-05-07T19:45:29.9509511Z 2025-05-07T19:45:29.9509515Z 2025-05-07T19:45:29.9509519Z 2025-05-07T19:45:29.9509522Z 2025-05-07T19:45:29.9509542Z 2025-05-07T19:45:29.9509545Z 2025-05-07T19:45:29.9509549Z 2025-05-07T19:45:29.9509552Z 2025-05-07T19:45:29.9509675Z  2025-05-07T19:45:29.9509828Z 2025-05-07T19:45:29.9509831Z 2025-05-07T19:45:29.9509835Z 2025-05-07T19:45:29.9509838Z 2025-05-07T19:45:29.9509842Z 2025-05-07T19:45:29.9509845Z 2025-05-07T19:45:29.9509848Z 2025-05-07T19:45:29.9509927Z 2025-05-07T19:45:29.9509931Z 2025-05-07T19:45:29.9510058Z  2025-05-07T19:45:29.9510222Z 2025-05-07T19:45:29.9510226Z 2025-05-07T19:45:29.9510230Z 2025-05-07T19:45:29.9510233Z 2025-05-07T19:45:29.9510237Z 2025-05-07T19:45:29.9510240Z 2025-05-07T19:45:29.9510243Z 2025-05-07T19:45:29.9510247Z 2025-05-07T19:45:29.9510250Z 2025-05-07T19:45:29.9510270Z 2025-05-07T19:45:29.9510408Z  2025-05-07T19:45:29.9510584Z 2025-05-07T19:45:29.9510587Z 2025-05-07T19:45:29.9510591Z 2025-05-07T19:45:29.9510594Z 2025-05-07T19:45:29.9510598Z 2025-05-07T19:45:29.9510601Z 2025-05-07T19:45:29.9510604Z 2025-05-07T19:45:29.9510608Z 2025-05-07T19:45:29.9510611Z 2025-05-07T19:45:29.9510615Z 2025-05-07T19:45:29.9510634Z 2025-05-07T19:45:29.9510768Z  2025-05-07T19:45:29.9510950Z 2025-05-07T19:45:29.9510953Z 2025-05-07T19:45:29.9510957Z 2025-05-07T19:45:29.9510961Z 2025-05-07T19:45:29.9510964Z 2025-05-07T19:45:29.9510968Z 2025-05-07T19:45:29.9510976Z 2025-05-07T19:45:29.9510980Z 2025-05-07T19:45:29.9510983Z 2025-05-07T19:45:29.9510986Z 2025-05-07T19:45:29.9511007Z 2025-05-07T19:45:29.9511010Z 2025-05-07T19:45:29.9511149Z  2025-05-07T19:45:29.9511340Z 2025-05-07T19:45:29.9511343Z 2025-05-07T19:45:29.9511347Z 2025-05-07T19:45:29.9511350Z 2025-05-07T19:45:29.9511354Z 2025-05-07T19:45:29.9511357Z 2025-05-07T19:45:29.9511364Z 2025-05-07T19:45:29.9511368Z 2025-05-07T19:45:29.9511371Z 2025-05-07T19:45:29.9511392Z 2025-05-07T19:45:29.9511395Z 2025-05-07T19:45:29.9511399Z 2025-05-07T19:45:29.9511402Z 2025-05-07T19:45:29.9511548Z  2025-05-07T19:45:29.9511749Z 2025-05-07T19:45:29.9511753Z 2025-05-07T19:45:29.9511756Z 2025-05-07T19:45:29.9511759Z 2025-05-07T19:45:29.9511763Z 2025-05-07T19:45:29.9511766Z 2025-05-07T19:45:29.9511787Z 2025-05-07T19:45:29.9511791Z 2025-05-07T19:45:29.9511794Z 2025-05-07T19:45:29.9511798Z 2025-05-07T19:45:29.9511805Z 2025-05-07T19:45:29.9511809Z 2025-05-07T19:45:29.9511813Z 2025-05-07T19:45:29.9511816Z 2025-05-07T19:45:29.9511964Z  2025-05-07T19:45:29.9512170Z 2025-05-07T19:45:29.9512174Z 2025-05-07T19:45:29.9512178Z 2025-05-07T19:45:29.9512201Z 2025-05-07T19:45:29.9512205Z 2025-05-07T19:45:29.9512208Z 2025-05-07T19:45:29.9512211Z 2025-05-07T19:45:29.9512215Z 2025-05-07T19:45:29.9512218Z 2025-05-07T19:45:29.9512285Z 2025-05-07T19:45:29.9512289Z 2025-05-07T19:45:29.9512292Z 2025-05-07T19:45:29.9512295Z 2025-05-07T19:45:29.9512299Z 2025-05-07T19:45:29.9512302Z 2025-05-07T19:45:29.9512457Z  2025-05-07T19:45:29.9512688Z 2025-05-07T19:45:29.9512692Z 2025-05-07T19:45:29.9512695Z 2025-05-07T19:45:29.9512699Z 2025-05-07T19:45:29.9512702Z 2025-05-07T19:45:29.9512706Z 2025-05-07T19:45:29.9512709Z 2025-05-07T19:45:29.9512713Z 2025-05-07T19:45:29.9512716Z 2025-05-07T19:45:29.9512719Z 2025-05-07T19:45:29.9512727Z 2025-05-07T19:45:29.9512730Z 2025-05-07T19:45:29.9512734Z 2025-05-07T19:45:29.9512737Z 2025-05-07T19:45:29.9512741Z 2025-05-07T19:45:29.9512744Z 2025-05-07T19:45:29.9512903Z  2025-05-07T19:45:29.9513146Z 2025-05-07T19:45:29.9513150Z 2025-05-07T19:45:29.9513153Z 2025-05-07T19:45:29.9513157Z 2025-05-07T19:45:29.9513160Z 2025-05-07T19:45:29.9513164Z 2025-05-07T19:45:29.9513167Z 2025-05-07T19:45:29.9513174Z 2025-05-07T19:45:29.9513178Z 2025-05-07T19:45:29.9513181Z 2025-05-07T19:45:29.9513187Z 2025-05-07T19:45:29.9513193Z 2025-05-07T19:45:29.9513198Z 2025-05-07T19:45:29.9513203Z 2025-05-07T19:45:29.9513207Z 2025-05-07T19:45:29.9513212Z 2025-05-07T19:45:29.9513217Z 2025-05-07T19:45:29.9513461Z  2025-05-07T19:45:29.9513734Z 2025-05-07T19:45:29.9513737Z 2025-05-07T19:45:29.9513741Z 2025-05-07T19:45:29.9513744Z 2025-05-07T19:45:29.9513748Z 2025-05-07T19:45:29.9513751Z 2025-05-07T19:45:29.9513841Z 2025-05-07T19:45:29.9513846Z 2025-05-07T19:45:29.9513849Z 2025-05-07T19:45:29.9513853Z 2025-05-07T19:45:29.9513873Z 2025-05-07T19:45:29.9513876Z 2025-05-07T19:45:29.9513880Z 2025-05-07T19:45:29.9513883Z 2025-05-07T19:45:29.9513887Z 2025-05-07T19:45:29.9513891Z 2025-05-07T19:45:29.9513894Z 2025-05-07T19:45:29.9513898Z 2025-05-07T19:45:29.9514075Z  2025-05-07T19:45:29.9514303Z 2025-05-07T19:45:29.9514307Z 2025-05-07T19:45:29.9514470Z  2025-05-07T19:45:29.9514599Z 2025-05-07T19:45:29.9514603Z 2025-05-07T19:45:29.9514706Z  2025-05-07T19:45:29.9514818Z 2025-05-07T19:45:29.9514821Z 2025-05-07T19:45:29.9514825Z 2025-05-07T19:45:29.9514948Z  2025-05-07T19:45:29.9515064Z 2025-05-07T19:45:29.9515068Z 2025-05-07T19:45:29.9515072Z 2025-05-07T19:45:29.9515075Z 2025-05-07T19:45:29.9515191Z  2025-05-07T19:45:29.9515332Z 2025-05-07T19:45:29.9515336Z 2025-05-07T19:45:29.9515339Z 2025-05-07T19:45:29.9515342Z 2025-05-07T19:45:29.9515350Z 2025-05-07T19:45:29.9515460Z  2025-05-07T19:45:29.9515590Z 2025-05-07T19:45:29.9515594Z 2025-05-07T19:45:29.9515598Z 2025-05-07T19:45:29.9515601Z 2025-05-07T19:45:29.9515604Z 2025-05-07T19:45:29.9515608Z 2025-05-07T19:45:29.9515739Z  2025-05-07T19:45:29.9515876Z 2025-05-07T19:45:29.9515880Z 2025-05-07T19:45:29.9515883Z 2025-05-07T19:45:29.9515886Z 2025-05-07T19:45:29.9515894Z 2025-05-07T19:45:29.9515897Z 2025-05-07T19:45:29.9515901Z 2025-05-07T19:45:29.9516046Z  2025-05-07T19:45:29.9516262Z 2025-05-07T19:45:29.9516267Z 2025-05-07T19:45:29.9516275Z 2025-05-07T19:45:29.9516280Z 2025-05-07T19:45:29.9516285Z 2025-05-07T19:45:29.9516293Z 2025-05-07T19:45:29.9516298Z 2025-05-07T19:45:29.9516304Z 2025-05-07T19:45:29.9516451Z  2025-05-07T19:45:29.9516668Z 2025-05-07T19:45:29.9516671Z 2025-05-07T19:45:29.9516675Z 2025-05-07T19:45:29.9516678Z 2025-05-07T19:45:29.9516682Z 2025-05-07T19:45:29.9516689Z 2025-05-07T19:45:29.9516693Z 2025-05-07T19:45:29.9516697Z 2025-05-07T19:45:29.9516700Z 2025-05-07T19:45:29.9516832Z  2025-05-07T19:45:29.9517016Z 2025-05-07T19:45:29.9517019Z 2025-05-07T19:45:29.9517023Z 2025-05-07T19:45:29.9517026Z 2025-05-07T19:45:29.9517029Z 2025-05-07T19:45:29.9517033Z 2025-05-07T19:45:29.9517036Z 2025-05-07T19:45:29.9517040Z 2025-05-07T19:45:29.9517043Z 2025-05-07T19:45:29.9517114Z 2025-05-07T19:45:29.9517247Z  2025-05-07T19:45:29.9517436Z 2025-05-07T19:45:29.9517440Z 2025-05-07T19:45:29.9517444Z 2025-05-07T19:45:29.9517447Z 2025-05-07T19:45:29.9517451Z 2025-05-07T19:45:29.9517454Z 2025-05-07T19:45:29.9517458Z 2025-05-07T19:45:29.9517462Z 2025-05-07T19:45:29.9517469Z 2025-05-07T19:45:29.9517474Z 2025-05-07T19:45:29.9517479Z 2025-05-07T19:45:29.9517666Z  2025-05-07T19:45:29.9517868Z 2025-05-07T19:45:29.9517872Z 2025-05-07T19:45:29.9517876Z 2025-05-07T19:45:29.9517884Z 2025-05-07T19:45:29.9517887Z 2025-05-07T19:45:29.9517891Z 2025-05-07T19:45:29.9517894Z 2025-05-07T19:45:29.9517898Z 2025-05-07T19:45:29.9517901Z 2025-05-07T19:45:29.9517904Z 2025-05-07T19:45:29.9517908Z 2025-05-07T19:45:29.9517911Z 2025-05-07T19:45:29.9518050Z  2025-05-07T19:45:29.9518313Z 2025-05-07T19:45:29.9518317Z 2025-05-07T19:45:29.9518322Z 2025-05-07T19:45:29.9518326Z 2025-05-07T19:45:29.9518335Z 2025-05-07T19:45:29.9518340Z 2025-05-07T19:45:29.9518344Z 2025-05-07T19:45:29.9518349Z 2025-05-07T19:45:29.9518353Z 2025-05-07T19:45:29.9518357Z 2025-05-07T19:45:29.9518362Z 2025-05-07T19:45:29.9518367Z 2025-05-07T19:45:29.9518372Z 2025-05-07T19:45:29.9518610Z  2025-05-07T19:45:29.9518904Z 2025-05-07T19:45:29.9518913Z 2025-05-07T19:45:29.9518919Z 2025-05-07T19:45:29.9518926Z 2025-05-07T19:45:29.9518934Z 2025-05-07T19:45:29.9518939Z 2025-05-07T19:45:29.9518943Z 2025-05-07T19:45:29.9518948Z 2025-05-07T19:45:29.9519050Z 2025-05-07T19:45:29.9519056Z 2025-05-07T19:45:29.9519063Z 2025-05-07T19:45:29.9519068Z 2025-05-07T19:45:29.9519072Z 2025-05-07T19:45:29.9519076Z 2025-05-07T19:45:29.9519334Z  2025-05-07T19:45:29.9519677Z 2025-05-07T19:45:29.9519681Z 2025-05-07T19:45:29.9519686Z 2025-05-07T19:45:29.9519690Z 2025-05-07T19:45:29.9519695Z 2025-05-07T19:45:29.9519699Z 2025-05-07T19:45:29.9519708Z 2025-05-07T19:45:29.9519713Z 2025-05-07T19:45:29.9519718Z 2025-05-07T19:45:29.9519722Z 2025-05-07T19:45:29.9519728Z 2025-05-07T19:45:29.9519733Z 2025-05-07T19:45:29.9519738Z 2025-05-07T19:45:29.9519763Z 2025-05-07T19:45:29.9519767Z 2025-05-07T19:45:29.9519985Z  2025-05-07T19:45:29.9520265Z 2025-05-07T19:45:29.9520273Z 2025-05-07T19:45:29.9520278Z 2025-05-07T19:45:29.9520286Z 2025-05-07T19:45:29.9520290Z 2025-05-07T19:45:29.9520295Z 2025-05-07T19:45:29.9520303Z 2025-05-07T19:45:29.9520308Z 2025-05-07T19:45:29.9520320Z 2025-05-07T19:45:29.9520344Z 2025-05-07T19:45:29.9520349Z 2025-05-07T19:45:29.9520354Z 2025-05-07T19:45:29.9520358Z 2025-05-07T19:45:29.9520365Z 2025-05-07T19:45:29.9520371Z 2025-05-07T19:45:29.9520376Z 2025-05-07T19:45:29.9520601Z  2025-05-07T19:45:29.9520913Z 2025-05-07T19:45:29.9520918Z 2025-05-07T19:45:29.9520924Z 2025-05-07T19:45:29.9520977Z 2025-05-07T19:45:29.9520990Z 2025-05-07T19:45:29.9520994Z 2025-05-07T19:45:29.9521010Z 2025-05-07T19:45:29.9521015Z 2025-05-07T19:45:29.9521023Z 2025-05-07T19:45:29.9521027Z 2025-05-07T19:45:29.9521035Z 2025-05-07T19:45:29.9521040Z 2025-05-07T19:45:29.9521045Z 2025-05-07T19:45:29.9521054Z 2025-05-07T19:45:29.9521062Z 2025-05-07T19:45:29.9521068Z 2025-05-07T19:45:29.9521074Z 2025-05-07T19:45:29.9521301Z  2025-05-07T19:45:29.9521761Z 2025-05-07T19:45:29.9521767Z 2025-05-07T19:45:29.9521774Z 2025-05-07T19:45:29.9521782Z 2025-05-07T19:45:29.9521791Z 2025-05-07T19:45:29.9521800Z 2025-05-07T19:45:29.9521808Z 2025-05-07T19:45:29.9521814Z 2025-05-07T19:45:29.9521821Z 2025-05-07T19:45:29.9521829Z 2025-05-07T19:45:29.9521835Z 2025-05-07T19:45:29.9521842Z 2025-05-07T19:45:29.9521850Z 2025-05-07T19:45:29.9521855Z 2025-05-07T19:45:29.9521862Z 2025-05-07T19:45:29.9521870Z 2025-05-07T19:45:29.9521875Z 2025-05-07T19:45:29.9521885Z 2025-05-07T19:45:29.9522155Z  2025-05-07T19:45:29.9522446Z 2025-05-07T19:45:29.9522451Z 2025-05-07T19:45:29.9522556Z  2025-05-07T19:45:29.9522683Z 2025-05-07T19:45:29.9522687Z 2025-05-07T19:45:29.9522903Z  2025-05-07T19:45:29.9523016Z 2025-05-07T19:45:29.9523020Z 2025-05-07T19:45:29.9523024Z 2025-05-07T19:45:29.9523147Z  2025-05-07T19:45:29.9523259Z 2025-05-07T19:45:29.9523263Z 2025-05-07T19:45:29.9523267Z 2025-05-07T19:45:29.9523270Z 2025-05-07T19:45:29.9523378Z  2025-05-07T19:45:29.9523515Z 2025-05-07T19:45:29.9523523Z 2025-05-07T19:45:29.9523526Z 2025-05-07T19:45:29.9523530Z 2025-05-07T19:45:29.9523533Z 2025-05-07T19:45:29.9523642Z  2025-05-07T19:45:29.9523770Z 2025-05-07T19:45:29.9523773Z 2025-05-07T19:45:29.9523777Z 2025-05-07T19:45:29.9523780Z 2025-05-07T19:45:29.9523802Z 2025-05-07T19:45:29.9523805Z 2025-05-07T19:45:29.9523916Z  2025-05-07T19:45:29.9524048Z 2025-05-07T19:45:29.9524051Z 2025-05-07T19:45:29.9524059Z 2025-05-07T19:45:29.9524062Z 2025-05-07T19:45:29.9524066Z 2025-05-07T19:45:29.9524070Z 2025-05-07T19:45:29.9524073Z 2025-05-07T19:45:29.9524235Z  2025-05-07T19:45:29.9524404Z 2025-05-07T19:45:29.9524408Z 2025-05-07T19:45:29.9524411Z 2025-05-07T19:45:29.9524415Z 2025-05-07T19:45:29.9524418Z 2025-05-07T19:45:29.9524422Z 2025-05-07T19:45:29.9524425Z 2025-05-07T19:45:29.9524429Z 2025-05-07T19:45:29.9524569Z  2025-05-07T19:45:29.9524724Z 2025-05-07T19:45:29.9524728Z 2025-05-07T19:45:29.9524731Z 2025-05-07T19:45:29.9524799Z 2025-05-07T19:45:29.9524803Z 2025-05-07T19:45:29.9524807Z 2025-05-07T19:45:29.9524810Z 2025-05-07T19:45:29.9524813Z 2025-05-07T19:45:29.9524817Z 2025-05-07T19:45:29.9524945Z  2025-05-07T19:45:29.9525125Z 2025-05-07T19:45:29.9525129Z 2025-05-07T19:45:29.9525132Z 2025-05-07T19:45:29.9525136Z 2025-05-07T19:45:29.9525139Z 2025-05-07T19:45:29.9525142Z 2025-05-07T19:45:29.9525146Z 2025-05-07T19:45:29.9525153Z 2025-05-07T19:45:29.9525157Z 2025-05-07T19:45:29.9525160Z 2025-05-07T19:45:29.9525289Z  2025-05-07T19:45:29.9525474Z 2025-05-07T19:45:29.9525478Z 2025-05-07T19:45:29.9525481Z 2025-05-07T19:45:29.9525485Z 2025-05-07T19:45:29.9525488Z 2025-05-07T19:45:29.9525492Z 2025-05-07T19:45:29.9525495Z 2025-05-07T19:45:29.9525498Z 2025-05-07T19:45:29.9525502Z 2025-05-07T19:45:29.9525505Z 2025-05-07T19:45:29.9525509Z 2025-05-07T19:45:29.9525642Z  2025-05-07T19:45:29.9525839Z 2025-05-07T19:45:29.9525846Z 2025-05-07T19:45:29.9525850Z 2025-05-07T19:45:29.9525853Z 2025-05-07T19:45:29.9525856Z 2025-05-07T19:45:29.9525860Z 2025-05-07T19:45:29.9525863Z 2025-05-07T19:45:29.9525867Z 2025-05-07T19:45:29.9525870Z 2025-05-07T19:45:29.9525873Z 2025-05-07T19:45:29.9525876Z 2025-05-07T19:45:29.9525880Z 2025-05-07T19:45:29.9526018Z  2025-05-07T19:45:29.9526223Z 2025-05-07T19:45:29.9526227Z 2025-05-07T19:45:29.9526235Z 2025-05-07T19:45:29.9526238Z 2025-05-07T19:45:29.9526242Z 2025-05-07T19:45:29.9526245Z 2025-05-07T19:45:29.9526249Z 2025-05-07T19:45:29.9526252Z 2025-05-07T19:45:29.9526256Z 2025-05-07T19:45:29.9526259Z 2025-05-07T19:45:29.9526263Z 2025-05-07T19:45:29.9526266Z 2025-05-07T19:45:29.9526270Z 2025-05-07T19:45:29.9526426Z  2025-05-07T19:45:29.9526625Z 2025-05-07T19:45:29.9526628Z 2025-05-07T19:45:29.9526632Z 2025-05-07T19:45:29.9526636Z 2025-05-07T19:45:29.9526639Z 2025-05-07T19:45:29.9526643Z 2025-05-07T19:45:29.9526650Z 2025-05-07T19:45:29.9526654Z 2025-05-07T19:45:29.9526657Z 2025-05-07T19:45:29.9526661Z 2025-05-07T19:45:29.9526664Z 2025-05-07T19:45:29.9526667Z 2025-05-07T19:45:29.9526671Z 2025-05-07T19:45:29.9526674Z 2025-05-07T19:45:29.9526835Z  2025-05-07T19:45:29.9527040Z 2025-05-07T19:45:29.9527044Z 2025-05-07T19:45:29.9527047Z 2025-05-07T19:45:29.9527051Z 2025-05-07T19:45:29.9527126Z 2025-05-07T19:45:29.9527129Z 2025-05-07T19:45:29.9527133Z 2025-05-07T19:45:29.9527136Z 2025-05-07T19:45:29.9527140Z 2025-05-07T19:45:29.9527143Z 2025-05-07T19:45:29.9527146Z 2025-05-07T19:45:29.9527150Z 2025-05-07T19:45:29.9527170Z 2025-05-07T19:45:29.9527174Z 2025-05-07T19:45:29.9527177Z 2025-05-07T19:45:29.9527331Z  2025-05-07T19:45:29.9527539Z 2025-05-07T19:45:29.9527543Z 2025-05-07T19:45:29.9527546Z 2025-05-07T19:45:29.9527550Z 2025-05-07T19:45:29.9527553Z 2025-05-07T19:45:29.9527557Z 2025-05-07T19:45:29.9527563Z 2025-05-07T19:45:29.9527584Z 2025-05-07T19:45:29.9527588Z 2025-05-07T19:45:29.9527591Z 2025-05-07T19:45:29.9527594Z 2025-05-07T19:45:29.9527598Z 2025-05-07T19:45:29.9527602Z 2025-05-07T19:45:29.9527605Z 2025-05-07T19:45:29.9527608Z 2025-05-07T19:45:29.9527612Z 2025-05-07T19:45:29.9527771Z  2025-05-07T19:45:29.9527988Z 2025-05-07T19:45:29.9527991Z 2025-05-07T19:45:29.9528016Z 2025-05-07T19:45:29.9528019Z 2025-05-07T19:45:29.9528022Z 2025-05-07T19:45:29.9528026Z 2025-05-07T19:45:29.9528029Z 2025-05-07T19:45:29.9528033Z 2025-05-07T19:45:29.9528036Z 2025-05-07T19:45:29.9528040Z 2025-05-07T19:45:29.9528044Z 2025-05-07T19:45:29.9528047Z 2025-05-07T19:45:29.9528051Z 2025-05-07T19:45:29.9528054Z 2025-05-07T19:45:29.9528058Z 2025-05-07T19:45:29.9528061Z 2025-05-07T19:45:29.9528065Z 2025-05-07T19:45:29.9528226Z  2025-05-07T19:45:29.9528470Z 2025-05-07T19:45:29.9528474Z 2025-05-07T19:45:29.9528537Z 2025-05-07T19:45:29.9528542Z 2025-05-07T19:45:29.9528545Z 2025-05-07T19:45:29.9528549Z 2025-05-07T19:45:29.9528552Z 2025-05-07T19:45:29.9528556Z 2025-05-07T19:45:29.9528559Z 2025-05-07T19:45:29.9528562Z 2025-05-07T19:45:29.9528566Z 2025-05-07T19:45:29.9528570Z 2025-05-07T19:45:29.9528574Z 2025-05-07T19:45:29.9528577Z 2025-05-07T19:45:29.9528581Z 2025-05-07T19:45:29.9528584Z 2025-05-07T19:45:29.9528588Z 2025-05-07T19:45:29.9528595Z 2025-05-07T19:45:29.9528787Z  2025-05-07T19:45:29.9529013Z 2025-05-07T19:45:29.9529017Z 2025-05-07T19:45:29.9529124Z  2025-05-07T19:45:29.9529258Z 2025-05-07T19:45:29.9529262Z 2025-05-07T19:45:29.9529361Z  2025-05-07T19:45:29.9529472Z 2025-05-07T19:45:29.9529475Z 2025-05-07T19:45:29.9529479Z 2025-05-07T19:45:29.9529601Z  2025-05-07T19:45:29.9529716Z 2025-05-07T19:45:29.9529720Z 2025-05-07T19:45:29.9529724Z 2025-05-07T19:45:29.9529727Z 2025-05-07T19:45:29.9529839Z  2025-05-07T19:45:29.9529980Z 2025-05-07T19:45:29.9529984Z 2025-05-07T19:45:29.9529987Z 2025-05-07T19:45:29.9529990Z 2025-05-07T19:45:29.9529994Z 2025-05-07T19:45:29.9530102Z  2025-05-07T19:45:29.9530232Z 2025-05-07T19:45:29.9530236Z 2025-05-07T19:45:29.9530239Z 2025-05-07T19:45:29.9530243Z 2025-05-07T19:45:29.9530263Z 2025-05-07T19:45:29.9530267Z 2025-05-07T19:45:29.9530379Z  2025-05-07T19:45:29.9530515Z 2025-05-07T19:45:29.9530519Z 2025-05-07T19:45:29.9530523Z 2025-05-07T19:45:29.9530527Z 2025-05-07T19:45:29.9530530Z 2025-05-07T19:45:29.9530533Z 2025-05-07T19:45:29.9530537Z 2025-05-07T19:45:29.9530671Z  2025-05-07T19:45:29.9530815Z 2025-05-07T19:45:29.9530818Z 2025-05-07T19:45:29.9530822Z 2025-05-07T19:45:29.9530825Z 2025-05-07T19:45:29.9530829Z 2025-05-07T19:45:29.9530832Z 2025-05-07T19:45:29.9530835Z 2025-05-07T19:45:29.9530839Z 2025-05-07T19:45:29.9530976Z  2025-05-07T19:45:29.9531129Z 2025-05-07T19:45:29.9531137Z 2025-05-07T19:45:29.9531140Z 2025-05-07T19:45:29.9531144Z 2025-05-07T19:45:29.9531147Z 2025-05-07T19:45:29.9531151Z 2025-05-07T19:45:29.9531154Z 2025-05-07T19:45:29.9531158Z 2025-05-07T19:45:29.9531161Z 2025-05-07T19:45:29.9531288Z  2025-05-07T19:45:29.9531475Z 2025-05-07T19:45:29.9531478Z 2025-05-07T19:45:29.9531482Z 2025-05-07T19:45:29.9531486Z 2025-05-07T19:45:29.9531489Z 2025-05-07T19:45:29.9531552Z 2025-05-07T19:45:29.9531556Z 2025-05-07T19:45:29.9531559Z 2025-05-07T19:45:29.9531563Z 2025-05-07T19:45:29.9531566Z 2025-05-07T19:45:29.9531696Z  2025-05-07T19:45:29.9531882Z 2025-05-07T19:45:29.9531885Z 2025-05-07T19:45:29.9531889Z 2025-05-07T19:45:29.9531892Z 2025-05-07T19:45:29.9531897Z 2025-05-07T19:45:29.9531900Z 2025-05-07T19:45:29.9531904Z 2025-05-07T19:45:29.9531907Z 2025-05-07T19:45:29.9531911Z 2025-05-07T19:45:29.9531915Z 2025-05-07T19:45:29.9531918Z 2025-05-07T19:45:29.9532056Z  2025-05-07T19:45:29.9532254Z 2025-05-07T19:45:29.9532258Z 2025-05-07T19:45:29.9532261Z 2025-05-07T19:45:29.9532264Z 2025-05-07T19:45:29.9532268Z 2025-05-07T19:45:29.9532271Z 2025-05-07T19:45:29.9532274Z 2025-05-07T19:45:29.9532278Z 2025-05-07T19:45:29.9532282Z 2025-05-07T19:45:29.9532285Z 2025-05-07T19:45:29.9532289Z 2025-05-07T19:45:29.9532292Z 2025-05-07T19:45:29.9532430Z  2025-05-07T19:45:29.9532639Z 2025-05-07T19:45:29.9532643Z 2025-05-07T19:45:29.9532647Z 2025-05-07T19:45:29.9532650Z 2025-05-07T19:45:29.9532654Z 2025-05-07T19:45:29.9532657Z 2025-05-07T19:45:29.9532661Z 2025-05-07T19:45:29.9532664Z 2025-05-07T19:45:29.9532668Z 2025-05-07T19:45:29.9532671Z 2025-05-07T19:45:29.9532674Z 2025-05-07T19:45:29.9532678Z 2025-05-07T19:45:29.9532681Z 2025-05-07T19:45:29.9532838Z  2025-05-07T19:45:29.9533037Z 2025-05-07T19:45:29.9533041Z 2025-05-07T19:45:29.9533044Z 2025-05-07T19:45:29.9533121Z 2025-05-07T19:45:29.9533125Z 2025-05-07T19:45:29.9533128Z 2025-05-07T19:45:29.9533132Z 2025-05-07T19:45:29.9533135Z 2025-05-07T19:45:29.9533139Z 2025-05-07T19:45:29.9533142Z 2025-05-07T19:45:29.9533146Z 2025-05-07T19:45:29.9533149Z 2025-05-07T19:45:29.9533153Z 2025-05-07T19:45:29.9533156Z 2025-05-07T19:45:29.9533322Z  2025-05-07T19:45:29.9533530Z 2025-05-07T19:45:29.9533534Z 2025-05-07T19:45:29.9533540Z 2025-05-07T19:45:29.9533544Z 2025-05-07T19:45:29.9533547Z 2025-05-07T19:45:29.9533551Z 2025-05-07T19:45:29.9533555Z 2025-05-07T19:45:29.9533558Z 2025-05-07T19:45:29.9533561Z 2025-05-07T19:45:29.9533565Z 2025-05-07T19:45:29.9533568Z 2025-05-07T19:45:29.9533572Z 2025-05-07T19:45:29.9533592Z 2025-05-07T19:45:29.9533595Z 2025-05-07T19:45:29.9533599Z 2025-05-07T19:45:29.9533752Z  2025-05-07T19:45:29.9533962Z 2025-05-07T19:45:29.9533966Z 2025-05-07T19:45:29.9533969Z 2025-05-07T19:45:29.9533973Z 2025-05-07T19:45:29.9533980Z 2025-05-07T19:45:29.9533983Z 2025-05-07T19:45:29.9533987Z 2025-05-07T19:45:29.9534010Z 2025-05-07T19:45:29.9534014Z 2025-05-07T19:45:29.9534017Z 2025-05-07T19:45:29.9534021Z 2025-05-07T19:45:29.9534024Z 2025-05-07T19:45:29.9534028Z 2025-05-07T19:45:29.9534031Z 2025-05-07T19:45:29.9534035Z 2025-05-07T19:45:29.9534038Z 2025-05-07T19:45:29.9534191Z  2025-05-07T19:45:29.9534413Z 2025-05-07T19:45:29.9534416Z 2025-05-07T19:45:29.9534436Z 2025-05-07T19:45:29.9534439Z 2025-05-07T19:45:29.9534443Z 2025-05-07T19:45:29.9534447Z 2025-05-07T19:45:29.9534450Z 2025-05-07T19:45:29.9534454Z 2025-05-07T19:45:29.9534457Z 2025-05-07T19:45:29.9534460Z 2025-05-07T19:45:29.9534464Z 2025-05-07T19:45:29.9534467Z 2025-05-07T19:45:29.9534471Z 2025-05-07T19:45:29.9534474Z 2025-05-07T19:45:29.9534477Z 2025-05-07T19:45:29.9534481Z 2025-05-07T19:45:29.9534485Z 2025-05-07T19:45:29.9534653Z  2025-05-07T19:45:29.9534894Z 2025-05-07T19:45:29.9534898Z 2025-05-07T19:45:29.9534901Z 2025-05-07T19:45:29.9534905Z 2025-05-07T19:45:29.9534908Z 2025-05-07T19:45:29.9534912Z 2025-05-07T19:45:29.9534916Z 2025-05-07T19:45:29.9534920Z 2025-05-07T19:45:29.9534923Z 2025-05-07T19:45:29.9534926Z 2025-05-07T19:45:29.9534930Z 2025-05-07T19:45:29.9534933Z 2025-05-07T19:45:29.9534937Z 2025-05-07T19:45:29.9534940Z 2025-05-07T19:45:29.9535001Z 2025-05-07T19:45:29.9535005Z 2025-05-07T19:45:29.9535008Z 2025-05-07T19:45:29.9535012Z 2025-05-07T19:45:29.9535202Z  2025-05-07T19:45:29.9535428Z 2025-05-07T19:45:29.9535432Z 2025-05-07T19:45:29.9535537Z  2025-05-07T19:45:29.9535670Z 2025-05-07T19:45:29.9535673Z 2025-05-07T19:45:29.9535775Z  2025-05-07T19:45:29.9535893Z 2025-05-07T19:45:29.9535896Z 2025-05-07T19:45:29.9535899Z 2025-05-07T19:45:29.9536025Z  2025-05-07T19:45:29.9536139Z 2025-05-07T19:45:29.9536143Z 2025-05-07T19:45:29.9536150Z 2025-05-07T19:45:29.9536154Z 2025-05-07T19:45:29.9536281Z  done 2025-05-07T19:45:30.2672820Z Preparing transaction: | / - done 2025-05-07T19:45:33.7823757Z Verifying transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | done 2025-05-07T19:45:36.2038955Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:45:36.6198207Z [INSTALL] Adding symlink librhash.so.0, which is needed by CMake ... 2025-05-07T19:45:38.2891559Z + ln -s /github/home/miniconda/envs/build_binary/lib/librhash.so /github/home/miniconda/envs/build_binary/lib/librhash.so.0 2025-05-07T19:45:38.2892183Z 2025-05-07T19:45:38.2901917Z 2025-05-07T19:45:38.2927534Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install build 2025-05-07T19:45:40.5894050Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:45:40.5896421Z 2025-05-07T19:45:40.5896534Z Collecting build 2025-05-07T19:45:40.5896874Z Downloading build-1.2.2.post1-py3-none-any.whl.metadata (6.5 kB) 2025-05-07T19:45:40.5897682Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from build) (25.0) 2025-05-07T19:45:40.5898354Z Collecting pyproject_hooks (from build) 2025-05-07T19:45:40.5898782Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl.metadata (1.3 kB) 2025-05-07T19:45:40.5899227Z Collecting importlib-metadata>=4.6 (from build) 2025-05-07T19:45:40.5899692Z Downloading importlib_metadata-8.7.0-py3-none-any.whl.metadata (4.8 kB) 2025-05-07T19:45:40.5900818Z Requirement already satisfied: tomli>=1.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from build) (2.2.1) 2025-05-07T19:45:40.5901596Z Collecting zipp>=3.20 (from importlib-metadata>=4.6->build) 2025-05-07T19:45:40.5902091Z Downloading zipp-3.21.0-py3-none-any.whl.metadata (3.7 kB) 2025-05-07T19:45:40.5902547Z Downloading build-1.2.2.post1-py3-none-any.whl (22 kB) 2025-05-07T19:45:40.5903047Z Downloading importlib_metadata-8.7.0-py3-none-any.whl (27 kB) 2025-05-07T19:45:40.5903518Z Downloading zipp-3.21.0-py3-none-any.whl (9.6 kB) 2025-05-07T19:45:40.5903938Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl (10 kB) 2025-05-07T19:45:40.5904501Z Installing collected packages: zipp, pyproject_hooks, importlib-metadata, build 2025-05-07T19:45:40.5904881Z 2025-05-07T19:45:40.5905229Z Successfully installed build-1.2.2.post1 importlib-metadata-8.7.0 pyproject_hooks-1.2.0 zipp-3.21.0 2025-05-07T19:45:40.5905704Z 2025-05-07T19:45:42.2343095Z /github/home/miniconda/envs/build_binary/bin/make 2025-05-07T19:45:42.2343400Z 2025-05-07T19:45:42.2906615Z [CHECK] Binary make found in PATH 2025-05-07T19:45:43.8699437Z /github/home/miniconda/envs/build_binary/bin/cmake 2025-05-07T19:45:43.8699864Z 2025-05-07T19:45:43.9269000Z [CHECK] Binary cmake found in PATH 2025-05-07T19:45:45.5007460Z /github/home/miniconda/envs/build_binary/bin/ninja 2025-05-07T19:45:45.5007805Z 2025-05-07T19:45:45.5567848Z [CHECK] Binary ninja found in PATH 2025-05-07T19:45:47.2200730Z [CHECK] Python (sub-)package 'click' found ... 2025-05-07T19:45:49.0151454Z [CHECK] Python (sub-)package 'hypothesis' found ... 2025-05-07T19:45:50.7063063Z [CHECK] Python (sub-)package 'jinja2' found ... 2025-05-07T19:45:52.4984261Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:45:54.1483957Z [CHECK] Python (sub-)package 'wheel' found ... 2025-05-07T19:45:54.1487682Z [INSTALL] Successfully installed all the build tools 2025-05-07T19:45:54.1562158Z ##[group]Run . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:54.1562647Z . $PRELUDE; install_cuda $BUILD_ENV 12.6.3 2025-05-07T19:45:54.1563341Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:54.1563726Z env: 2025-05-07T19:45:54.1563974Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:54.1564329Z BUILD_ENV: build_binary 2025-05-07T19:45:54.1564592Z BUILD_TARGET: default 2025-05-07T19:45:54.1564869Z BUILD_VARIANT: cuda 2025-05-07T19:45:54.1565118Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:45:54.1565439Z ##[endgroup] 2025-05-07T19:45:54.6131029Z ################################################################################ 2025-05-07T19:45:54.6131431Z # Install CUDA 2025-05-07T19:45:54.6131660Z # 2025-05-07T19:45:54.6145621Z # [2025-05-07T19:45:54.614Z] + install_cuda build_binary 12.6.3 2025-05-07T19:45:54.6146039Z ################################################################################ 2025-05-07T19:45:54.6146343Z 2025-05-07T19:45:54.6171256Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:54.7108142Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:54.7109196Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:45:54.7111787Z + conda clean --packages --tarball -y 2025-05-07T19:45:54.7112386Z 2025-05-07T19:45:55.2747745Z Will remove 148 (613.1 MB) tarball(s). 2025-05-07T19:45:55.2748776Z Will remove 21 (76.2 MB) package(s). 2025-05-07T19:45:55.3346326Z 2025-05-07T19:45:55.3351441Z + conda clean --all -y 2025-05-07T19:45:55.3351957Z 2025-05-07T19:45:55.9611391Z There are no unused tarball(s) to remove. 2025-05-07T19:45:55.9612617Z Will remove 1 index cache(s). 2025-05-07T19:45:55.9613480Z There are no unused package(s) to remove. 2025-05-07T19:45:55.9614400Z There are no tempfile(s) to remove. 2025-05-07T19:45:55.9615378Z There are no logfile(s) to remove. 2025-05-07T19:45:56.0172141Z 2025-05-07T19:45:56.0180382Z [INSTALL] Installing CUDA 12.6.3 ... 2025-05-07T19:45:56.0207798Z [EXEC] [ATTEMPT 0/3] + conda install --force-reinstall -n build_binary -c conda-forge --override-channels -y cuda=12.6.3 2025-05-07T19:45:56.8710924Z Channels: 2025-05-07T19:45:56.8711611Z - conda-forge 2025-05-07T19:45:56.8712250Z Platform: linux-64 2025-05-07T19:46:06.7494555Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:46:08.2698148Z Solving environment: | / - \ done 2025-05-07T19:46:08.4067835Z 2025-05-07T19:46:08.4068438Z ## Package Plan ## 2025-05-07T19:46:08.4069076Z 2025-05-07T19:46:08.4069712Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:46:08.4070618Z 2025-05-07T19:46:08.4070890Z added / updated specs: 2025-05-07T19:46:08.4071616Z - cuda=12.6.3 2025-05-07T19:46:08.4071995Z 2025-05-07T19:46:08.4072008Z 2025-05-07T19:46:08.4072355Z The following packages will be downloaded: 2025-05-07T19:46:08.4073018Z 2025-05-07T19:46:08.4073357Z package | build 2025-05-07T19:46:08.4074315Z ---------------------------|----------------- 2025-05-07T19:46:08.4075358Z attr-2.5.1 | h166bdaf_1 69 KB conda-forge 2025-05-07T19:46:08.4077012Z binutils-2.40 | h4852527_7 31 KB conda-forge 2025-05-07T19:46:08.4078291Z c-compiler-1.5.2 | h0b41bf4_0 6 KB conda-forge 2025-05-07T19:46:08.4079528Z cuda-12.6.3 | ha804496_0 26 KB conda-forge 2025-05-07T19:46:08.4080134Z cuda-cccl_linux-64-12.6.77 | ha770c72_0 1.0 MB conda-forge 2025-05-07T19:46:08.4080968Z cuda-command-line-tools-12.6.3| ha770c72_0 20 KB conda-forge 2025-05-07T19:46:08.4081508Z cuda-compiler-12.6.3 | hbad6d8a_0 20 KB conda-forge 2025-05-07T19:46:08.4082005Z cuda-crt-dev_linux-64-12.6.85| ha770c72_0 87 KB conda-forge 2025-05-07T19:46:08.4082851Z cuda-crt-tools-12.6.85 | ha770c72_0 26 KB conda-forge 2025-05-07T19:46:08.4083318Z cuda-cudart-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:08.4083807Z cuda-cudart-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:08.4084333Z cuda-cudart-dev_linux-64-12.6.77| h3f2d84a_0 357 KB conda-forge 2025-05-07T19:46:08.4084858Z cuda-cudart-static-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:08.4085412Z cuda-cudart-static_linux-64-12.6.77| h3f2d84a_0 744 KB conda-forge 2025-05-07T19:46:08.4104451Z cuda-cudart_linux-64-12.6.77| h3f2d84a_0 184 KB conda-forge 2025-05-07T19:46:08.4105111Z cuda-cuobjdump-12.6.77 | hbd13f7d_1 241 KB conda-forge 2025-05-07T19:46:08.4105629Z cuda-cupti-12.6.80 | hbd13f7d_0 1.9 MB conda-forge 2025-05-07T19:46:08.4106133Z cuda-cupti-dev-12.6.80 | h5888daf_0 3.4 MB conda-forge 2025-05-07T19:46:08.4106641Z cuda-cuxxfilt-12.6.77 | hbd13f7d_1 211 KB conda-forge 2025-05-07T19:46:08.4107260Z cuda-driver-dev-12.6.77 | h5888daf_0 22 KB conda-forge 2025-05-07T19:46:08.4107770Z cuda-driver-dev_linux-64-12.6.77| h3f2d84a_0 35 KB conda-forge 2025-05-07T19:46:08.4108268Z cuda-gdb-12.6.77 | h50b4baa_1 370 KB conda-forge 2025-05-07T19:46:08.4108722Z cuda-libraries-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:46:08.4109229Z cuda-libraries-dev-12.6.3 | ha770c72_0 20 KB conda-forge 2025-05-07T19:46:08.4109743Z cuda-nsight-12.6.77 | h7938cbb_0 113.2 MB conda-forge 2025-05-07T19:46:08.4110189Z cuda-nvcc-12.6.85 | hcdd1206_0 23 KB conda-forge 2025-05-07T19:46:08.4110687Z cuda-nvcc-dev_linux-64-12.6.85| he91c749_0 10.8 MB conda-forge 2025-05-07T19:46:08.4111188Z cuda-nvcc-impl-12.6.85 | h85509e4_0 25 KB conda-forge 2025-05-07T19:46:08.4111792Z cuda-nvcc-tools-12.6.85 | he02047a_0 23.0 MB conda-forge 2025-05-07T19:46:08.4112249Z cuda-nvcc_linux-64-12.6.85 | h04802cd_0 25 KB conda-forge 2025-05-07T19:46:08.4112721Z cuda-nvdisasm-12.6.77 | hbd13f7d_1 47.6 MB conda-forge 2025-05-07T19:46:08.4113189Z cuda-nvml-dev-12.6.77 | hbd13f7d_1 159 KB conda-forge 2025-05-07T19:46:08.4113631Z cuda-nvprof-12.6.80 | hbd13f7d_0 2.6 MB conda-forge 2025-05-07T19:46:08.4114078Z cuda-nvprune-12.6.77 | hbd13f7d_1 66 KB conda-forge 2025-05-07T19:46:08.4114514Z cuda-nvrtc-12.6.85 | hbd13f7d_0 17.3 MB conda-forge 2025-05-07T19:46:08.4114973Z cuda-nvrtc-dev-12.6.85 | h5888daf_0 31 KB conda-forge 2025-05-07T19:46:08.4115429Z cuda-nvtx-12.6.77 | hbd13f7d_0 31 KB conda-forge 2025-05-07T19:46:08.4115883Z cuda-nvvm-dev_linux-64-12.6.85| ha770c72_0 25 KB conda-forge 2025-05-07T19:46:08.4116365Z cuda-nvvm-impl-12.6.85 | he02047a_0 7.7 MB conda-forge 2025-05-07T19:46:08.4116811Z cuda-nvvm-tools-12.6.85 | he02047a_0 10.4 MB conda-forge 2025-05-07T19:46:08.4117260Z cuda-nvvp-12.6.80 | hbd13f7d_1 109.3 MB conda-forge 2025-05-07T19:46:08.4117681Z cuda-opencl-12.6.77 | hbd13f7d_0 29 KB conda-forge 2025-05-07T19:46:08.4118139Z cuda-opencl-dev-12.6.77 | h5888daf_0 93 KB conda-forge 2025-05-07T19:46:08.4119269Z cuda-profiler-api-12.6.77 | h7938cbb_0 22 KB conda-forge 2025-05-07T19:46:08.4119728Z cuda-runtime-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:46:08.4120208Z cuda-sanitizer-api-12.6.77 | hbd13f7d_1 8.9 MB conda-forge 2025-05-07T19:46:08.4120775Z cuda-toolkit-12.6.3 | ha804496_0 19 KB conda-forge 2025-05-07T19:46:08.4121210Z cuda-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:46:08.4121632Z cuda-version-12.6 | h7480c83_3 20 KB conda-forge 2025-05-07T19:46:08.4122096Z cuda-visual-tools-12.6.3 | ha770c72_0 19 KB conda-forge 2025-05-07T19:46:08.4122564Z cxx-compiler-1.5.2 | hf52228f_0 6 KB conda-forge 2025-05-07T19:46:08.4122967Z dbus-1.13.6 | h5008d03_3 604 KB conda-forge 2025-05-07T19:46:08.4123367Z expat-2.7.0 | h5888daf_0 137 KB conda-forge 2025-05-07T19:46:08.4123741Z gcc-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:46:08.4124154Z gds-tools-1.11.1.6 | h5888daf_4 37.8 MB conda-forge 2025-05-07T19:46:08.4124563Z gmp-6.3.0 | hac33072_2 449 KB conda-forge 2025-05-07T19:46:08.4124931Z gxx-11.4.0 | h602e360_13 49 KB conda-forge 2025-05-07T19:46:08.4125329Z libcap-2.75 | h39aace5_0 118 KB conda-forge 2025-05-07T19:46:08.4125739Z libcublas-12.6.4.1 | h5888daf_1 256.2 MB conda-forge 2025-05-07T19:46:08.4126184Z libcublas-dev-12.6.4.1 | h5888daf_1 88 KB conda-forge 2025-05-07T19:46:08.4126611Z libcufft-11.3.0.4 | hbd13f7d_0 156.2 MB conda-forge 2025-05-07T19:46:08.4127059Z libcufft-dev-11.3.0.4 | h5888daf_0 33 KB conda-forge 2025-05-07T19:46:08.4127486Z libcufile-1.11.1.6 | h12f29b5_4 900 KB conda-forge 2025-05-07T19:46:08.4127939Z libcufile-dev-1.11.1.6 | h5888daf_4 35 KB conda-forge 2025-05-07T19:46:08.4128389Z libcurand-10.3.7.77 | hbd13f7d_0 39.9 MB conda-forge 2025-05-07T19:46:08.4128827Z libcurand-dev-10.3.7.77 | h5888daf_0 262 KB conda-forge 2025-05-07T19:46:08.4129285Z libcusolver-11.7.1.2 | h5888daf_1 95.8 MB conda-forge 2025-05-07T19:46:08.4129733Z libcusolver-dev-11.7.1.2 | h5888daf_1 59 KB conda-forge 2025-05-07T19:46:08.4130197Z libcusparse-12.5.4.2 | hbd13f7d_0 118.6 MB conda-forge 2025-05-07T19:46:08.4130646Z libcusparse-dev-12.5.4.2 | h5888daf_0 51 KB conda-forge 2025-05-07T19:46:08.4131119Z libgcrypt-lib-1.11.0 | hb9d3cd8_2 572 KB conda-forge 2025-05-07T19:46:08.4131576Z libgpg-error-1.55 | h3f2d84a_0 305 KB conda-forge 2025-05-07T19:46:08.4131983Z libnl-3.11.0 | hb9d3cd8_0 724 KB conda-forge 2025-05-07T19:46:08.4132396Z libnpp-12.3.1.54 | h5888daf_0 93.4 MB conda-forge 2025-05-07T19:46:08.4132810Z libnpp-dev-12.3.1.54 | h5888daf_0 441 KB conda-forge 2025-05-07T19:46:08.4133246Z libnuma-2.0.18 | h4ab18f5_2 42 KB conda-forge 2025-05-07T19:46:08.4133664Z libnvfatbin-12.6.77 | hbd13f7d_0 783 KB conda-forge 2025-05-07T19:46:08.4134125Z libnvfatbin-dev-12.6.77 | h5888daf_0 26 KB conda-forge 2025-05-07T19:46:08.4134590Z libnvjitlink-12.6.85 | hbd13f7d_0 14.9 MB conda-forge 2025-05-07T19:46:08.4135044Z libnvjitlink-dev-12.6.85 | h5888daf_0 25 KB conda-forge 2025-05-07T19:46:08.4135503Z libnvjpeg-12.3.3.54 | h5888daf_0 2.4 MB conda-forge 2025-05-07T19:46:08.4136103Z libnvjpeg-dev-12.3.3.54 | ha770c72_0 31 KB conda-forge 2025-05-07T19:46:08.4136544Z libsystemd0-257.4 | h4e0b6ca_1 477 KB conda-forge 2025-05-07T19:46:08.4136988Z libudev1-257.4 | hbe16f8c_1 141 KB conda-forge 2025-05-07T19:46:08.4137501Z libxkbcommon-1.7.0 | h2c5496b_1 579 KB conda-forge 2025-05-07T19:46:08.4137931Z libxkbfile-1.1.0 | h166bdaf_1 111 KB conda-forge 2025-05-07T19:46:08.4138343Z lz4-c-1.10.0 | h5888daf_1 163 KB conda-forge 2025-05-07T19:46:08.4138773Z nsight-compute-2024.3.2.3 | hb5ebaad_0 443.1 MB conda-forge 2025-05-07T19:46:08.4139212Z nspr-4.36 | h5888daf_0 225 KB conda-forge 2025-05-07T19:46:08.4139598Z nss-3.111 | h159eef7_0 1.9 MB conda-forge 2025-05-07T19:46:08.4139987Z ocl-icd-2.3.3 | hb9d3cd8_0 104 KB conda-forge 2025-05-07T19:46:08.4140565Z opencl-headers-2024.10.24 | h5888daf_0 53 KB conda-forge 2025-05-07T19:46:08.4141218Z rdma-core-57.0 | h5888daf_0 1.2 MB conda-forge 2025-05-07T19:46:08.4141671Z wayland-1.23.1 | h3e06ad9_0 314 KB conda-forge 2025-05-07T19:46:08.4142107Z xcb-util-0.4.1 | hb711507_2 19 KB conda-forge 2025-05-07T19:46:08.4142580Z xcb-util-cursor-0.1.5 | hb9d3cd8_0 20 KB conda-forge 2025-05-07T19:46:08.4143073Z xcb-util-image-0.4.0 | hb711507_2 24 KB conda-forge 2025-05-07T19:46:08.4143552Z xcb-util-keysyms-0.4.1 | hb711507_0 14 KB conda-forge 2025-05-07T19:46:08.4144073Z xcb-util-renderutil-0.3.10 | hb711507_0 17 KB conda-forge 2025-05-07T19:46:08.4144559Z xcb-util-wm-0.4.2 | hb711507_0 50 KB conda-forge 2025-05-07T19:46:08.4145051Z xkeyboard-config-2.44 | hb9d3cd8_0 384 KB conda-forge 2025-05-07T19:46:08.4145561Z xorg-libxcomposite-0.4.6 | hb9d3cd8_2 13 KB conda-forge 2025-05-07T19:46:08.4146084Z xorg-libxdamage-1.1.6 | hb9d3cd8_0 13 KB conda-forge 2025-05-07T19:46:08.4146548Z ------------------------------------------------------------ 2025-05-07T19:46:08.4146909Z Total: 1.59 GB 2025-05-07T19:46:08.4147152Z 2025-05-07T19:46:08.4147285Z The following NEW packages will be INSTALLED: 2025-05-07T19:46:08.4147517Z 2025-05-07T19:46:08.4147703Z attr conda-forge/linux-64::attr-2.5.1-h166bdaf_1 2025-05-07T19:46:08.4148155Z binutils conda-forge/linux-64::binutils-2.40-h4852527_7 2025-05-07T19:46:08.4148652Z c-compiler conda-forge/linux-64::c-compiler-1.5.2-h0b41bf4_0 2025-05-07T19:46:08.4149103Z cuda conda-forge/noarch::cuda-12.6.3-ha804496_0 2025-05-07T19:46:08.4149624Z cuda-cccl_linux-64 conda-forge/noarch::cuda-cccl_linux-64-12.6.77-ha770c72_0 2025-05-07T19:46:08.4150255Z cuda-command-line~ conda-forge/linux-64::cuda-command-line-tools-12.6.3-ha770c72_0 2025-05-07T19:46:08.4150888Z cuda-compiler conda-forge/noarch::cuda-compiler-12.6.3-hbad6d8a_0 2025-05-07T19:46:08.4151490Z cuda-crt-dev_linu~ conda-forge/noarch::cuda-crt-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:46:08.4152074Z cuda-crt-tools conda-forge/linux-64::cuda-crt-tools-12.6.85-ha770c72_0 2025-05-07T19:46:08.4152638Z cuda-cudart conda-forge/linux-64::cuda-cudart-12.6.77-h5888daf_0 2025-05-07T19:46:08.4153284Z cuda-cudart-dev conda-forge/linux-64::cuda-cudart-dev-12.6.77-h5888daf_0 2025-05-07T19:46:08.4153871Z cuda-cudart-dev_l~ conda-forge/noarch::cuda-cudart-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:08.4154481Z cuda-cudart-static conda-forge/linux-64::cuda-cudart-static-12.6.77-h5888daf_0 2025-05-07T19:46:08.4155166Z cuda-cudart-stati~ conda-forge/noarch::cuda-cudart-static_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:08.4155786Z cuda-cudart_linux~ conda-forge/noarch::cuda-cudart_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:08.4156337Z cuda-cuobjdump conda-forge/linux-64::cuda-cuobjdump-12.6.77-hbd13f7d_1 2025-05-07T19:46:08.4156860Z cuda-cupti conda-forge/linux-64::cuda-cupti-12.6.80-hbd13f7d_0 2025-05-07T19:46:08.4157469Z cuda-cupti-dev conda-forge/linux-64::cuda-cupti-dev-12.6.80-h5888daf_0 2025-05-07T19:46:08.4157988Z cuda-cuxxfilt conda-forge/linux-64::cuda-cuxxfilt-12.6.77-hbd13f7d_1 2025-05-07T19:46:08.4158533Z cuda-driver-dev conda-forge/linux-64::cuda-driver-dev-12.6.77-h5888daf_0 2025-05-07T19:46:08.4159105Z cuda-driver-dev_l~ conda-forge/noarch::cuda-driver-dev_linux-64-12.6.77-h3f2d84a_0 2025-05-07T19:46:08.4159645Z cuda-gdb conda-forge/linux-64::cuda-gdb-12.6.77-h50b4baa_1 2025-05-07T19:46:08.4160134Z cuda-libraries conda-forge/linux-64::cuda-libraries-12.6.3-ha770c72_0 2025-05-07T19:46:08.4160683Z cuda-libraries-dev conda-forge/linux-64::cuda-libraries-dev-12.6.3-ha770c72_0 2025-05-07T19:46:08.4161231Z cuda-nsight conda-forge/linux-64::cuda-nsight-12.6.77-h7938cbb_0 2025-05-07T19:46:08.4161693Z cuda-nvcc conda-forge/linux-64::cuda-nvcc-12.6.85-hcdd1206_0 2025-05-07T19:46:08.4162218Z cuda-nvcc-dev_lin~ conda-forge/noarch::cuda-nvcc-dev_linux-64-12.6.85-he91c749_0 2025-05-07T19:46:08.4163307Z cuda-nvcc-impl conda-forge/linux-64::cuda-nvcc-impl-12.6.85-h85509e4_0 2025-05-07T19:46:08.4163873Z cuda-nvcc-tools conda-forge/linux-64::cuda-nvcc-tools-12.6.85-he02047a_0 2025-05-07T19:46:08.4164487Z cuda-nvcc_linux-64 conda-forge/linux-64::cuda-nvcc_linux-64-12.6.85-h04802cd_0 2025-05-07T19:46:08.4165153Z cuda-nvdisasm conda-forge/linux-64::cuda-nvdisasm-12.6.77-hbd13f7d_1 2025-05-07T19:46:08.4165716Z cuda-nvml-dev conda-forge/linux-64::cuda-nvml-dev-12.6.77-hbd13f7d_1 2025-05-07T19:46:08.4166266Z cuda-nvprof conda-forge/linux-64::cuda-nvprof-12.6.80-hbd13f7d_0 2025-05-07T19:46:08.4166792Z cuda-nvprune conda-forge/linux-64::cuda-nvprune-12.6.77-hbd13f7d_1 2025-05-07T19:46:08.4167326Z cuda-nvrtc conda-forge/linux-64::cuda-nvrtc-12.6.85-hbd13f7d_0 2025-05-07T19:46:08.4167858Z cuda-nvrtc-dev conda-forge/linux-64::cuda-nvrtc-dev-12.6.85-h5888daf_0 2025-05-07T19:46:08.4168398Z cuda-nvtx conda-forge/linux-64::cuda-nvtx-12.6.77-hbd13f7d_0 2025-05-07T19:46:08.4168961Z cuda-nvvm-dev_lin~ conda-forge/noarch::cuda-nvvm-dev_linux-64-12.6.85-ha770c72_0 2025-05-07T19:46:08.4169550Z cuda-nvvm-impl conda-forge/linux-64::cuda-nvvm-impl-12.6.85-he02047a_0 2025-05-07T19:46:08.4170132Z cuda-nvvm-tools conda-forge/linux-64::cuda-nvvm-tools-12.6.85-he02047a_0 2025-05-07T19:46:08.4170663Z cuda-nvvp conda-forge/linux-64::cuda-nvvp-12.6.80-hbd13f7d_1 2025-05-07T19:46:08.4171179Z cuda-opencl conda-forge/linux-64::cuda-opencl-12.6.77-hbd13f7d_0 2025-05-07T19:46:08.4171743Z cuda-opencl-dev conda-forge/linux-64::cuda-opencl-dev-12.6.77-h5888daf_0 2025-05-07T19:46:08.4172341Z cuda-profiler-api conda-forge/linux-64::cuda-profiler-api-12.6.77-h7938cbb_0 2025-05-07T19:46:08.4172925Z cuda-runtime conda-forge/noarch::cuda-runtime-12.6.3-ha804496_0 2025-05-07T19:46:08.4173502Z cuda-sanitizer-api conda-forge/linux-64::cuda-sanitizer-api-12.6.77-hbd13f7d_1 2025-05-07T19:46:08.4174095Z cuda-toolkit conda-forge/noarch::cuda-toolkit-12.6.3-ha804496_0 2025-05-07T19:46:08.4174613Z cuda-tools conda-forge/linux-64::cuda-tools-12.6.3-ha770c72_0 2025-05-07T19:46:08.4175106Z cuda-version conda-forge/noarch::cuda-version-12.6-h7480c83_3 2025-05-07T19:46:08.4175679Z cuda-visual-tools conda-forge/linux-64::cuda-visual-tools-12.6.3-ha770c72_0 2025-05-07T19:46:08.4176599Z cxx-compiler conda-forge/linux-64::cxx-compiler-1.5.2-hf52228f_0 2025-05-07T19:46:08.4177094Z dbus conda-forge/linux-64::dbus-1.13.6-h5008d03_3 2025-05-07T19:46:08.4177722Z expat conda-forge/linux-64::expat-2.7.0-h5888daf_0 2025-05-07T19:46:08.4178138Z gcc conda-forge/linux-64::gcc-11.4.0-h602e360_13 2025-05-07T19:46:08.4178595Z gds-tools conda-forge/linux-64::gds-tools-1.11.1.6-h5888daf_4 2025-05-07T19:46:08.4179035Z gmp conda-forge/linux-64::gmp-6.3.0-hac33072_2 2025-05-07T19:46:08.4179558Z gxx conda-forge/linux-64::gxx-11.4.0-h602e360_13 2025-05-07T19:46:08.4179993Z libcap conda-forge/linux-64::libcap-2.75-h39aace5_0 2025-05-07T19:46:08.4180573Z libcublas conda-forge/linux-64::libcublas-12.6.4.1-h5888daf_1 2025-05-07T19:46:08.4181120Z libcublas-dev conda-forge/linux-64::libcublas-dev-12.6.4.1-h5888daf_1 2025-05-07T19:46:08.4181641Z libcufft conda-forge/linux-64::libcufft-11.3.0.4-hbd13f7d_0 2025-05-07T19:46:08.4182165Z libcufft-dev conda-forge/linux-64::libcufft-dev-11.3.0.4-h5888daf_0 2025-05-07T19:46:08.4182705Z libcufile conda-forge/linux-64::libcufile-1.11.1.6-h12f29b5_4 2025-05-07T19:46:08.4183230Z libcufile-dev conda-forge/linux-64::libcufile-dev-1.11.1.6-h5888daf_4 2025-05-07T19:46:08.4183777Z libcurand conda-forge/linux-64::libcurand-10.3.7.77-hbd13f7d_0 2025-05-07T19:46:08.4184313Z libcurand-dev conda-forge/linux-64::libcurand-dev-10.3.7.77-h5888daf_0 2025-05-07T19:46:08.4184878Z libcusolver conda-forge/linux-64::libcusolver-11.7.1.2-h5888daf_1 2025-05-07T19:46:08.4185454Z libcusolver-dev conda-forge/linux-64::libcusolver-dev-11.7.1.2-h5888daf_1 2025-05-07T19:46:08.4186015Z libcusparse conda-forge/linux-64::libcusparse-12.5.4.2-hbd13f7d_0 2025-05-07T19:46:08.4186594Z libcusparse-dev conda-forge/linux-64::libcusparse-dev-12.5.4.2-h5888daf_0 2025-05-07T19:46:08.4187161Z libgcrypt-lib conda-forge/linux-64::libgcrypt-lib-1.11.0-hb9d3cd8_2 2025-05-07T19:46:08.4187701Z libgpg-error conda-forge/linux-64::libgpg-error-1.55-h3f2d84a_0 2025-05-07T19:46:08.4188193Z libnl conda-forge/linux-64::libnl-3.11.0-hb9d3cd8_0 2025-05-07T19:46:08.4188633Z libnpp conda-forge/linux-64::libnpp-12.3.1.54-h5888daf_0 2025-05-07T19:46:08.4189128Z libnpp-dev conda-forge/linux-64::libnpp-dev-12.3.1.54-h5888daf_0 2025-05-07T19:46:08.4189619Z libnuma conda-forge/linux-64::libnuma-2.0.18-h4ab18f5_2 2025-05-07T19:46:08.4190123Z libnvfatbin conda-forge/linux-64::libnvfatbin-12.6.77-hbd13f7d_0 2025-05-07T19:46:08.4190688Z libnvfatbin-dev conda-forge/linux-64::libnvfatbin-dev-12.6.77-h5888daf_0 2025-05-07T19:46:08.4191251Z libnvjitlink conda-forge/linux-64::libnvjitlink-12.6.85-hbd13f7d_0 2025-05-07T19:46:08.4191837Z libnvjitlink-dev conda-forge/linux-64::libnvjitlink-dev-12.6.85-h5888daf_0 2025-05-07T19:46:08.4192392Z libnvjpeg conda-forge/linux-64::libnvjpeg-12.3.3.54-h5888daf_0 2025-05-07T19:46:08.4192941Z libnvjpeg-dev conda-forge/linux-64::libnvjpeg-dev-12.3.3.54-ha770c72_0 2025-05-07T19:46:08.4193504Z libsystemd0 conda-forge/linux-64::libsystemd0-257.4-h4e0b6ca_1 2025-05-07T19:46:08.4193993Z libudev1 conda-forge/linux-64::libudev1-257.4-hbe16f8c_1 2025-05-07T19:46:08.4194515Z libxkbcommon conda-forge/linux-64::libxkbcommon-1.7.0-h2c5496b_1 2025-05-07T19:46:08.4195038Z libxkbfile conda-forge/linux-64::libxkbfile-1.1.0-h166bdaf_1 2025-05-07T19:46:08.4195506Z lz4-c conda-forge/linux-64::lz4-c-1.10.0-h5888daf_1 2025-05-07T19:46:08.4196025Z nsight-compute conda-forge/linux-64::nsight-compute-2024.3.2.3-hb5ebaad_0 2025-05-07T19:46:08.4196528Z nspr conda-forge/linux-64::nspr-4.36-h5888daf_0 2025-05-07T19:46:08.4196935Z nss conda-forge/linux-64::nss-3.111-h159eef7_0 2025-05-07T19:46:08.4197348Z ocl-icd conda-forge/linux-64::ocl-icd-2.3.3-hb9d3cd8_0 2025-05-07T19:46:08.4197888Z opencl-headers conda-forge/linux-64::opencl-headers-2024.10.24-h5888daf_0 2025-05-07T19:46:08.4198504Z rdma-core conda-forge/linux-64::rdma-core-57.0-h5888daf_0 2025-05-07T19:46:08.4198991Z wayland conda-forge/linux-64::wayland-1.23.1-h3e06ad9_0 2025-05-07T19:46:08.4199468Z xcb-util conda-forge/linux-64::xcb-util-0.4.1-hb711507_2 2025-05-07T19:46:08.4200050Z xcb-util-cursor conda-forge/linux-64::xcb-util-cursor-0.1.5-hb9d3cd8_0 2025-05-07T19:46:08.4200626Z xcb-util-image conda-forge/linux-64::xcb-util-image-0.4.0-hb711507_2 2025-05-07T19:46:08.4201192Z xcb-util-keysyms conda-forge/linux-64::xcb-util-keysyms-0.4.1-hb711507_0 2025-05-07T19:46:08.4201817Z xcb-util-renderut~ conda-forge/linux-64::xcb-util-renderutil-0.3.10-hb711507_0 2025-05-07T19:46:08.4202402Z xcb-util-wm conda-forge/linux-64::xcb-util-wm-0.4.2-hb711507_0 2025-05-07T19:46:08.4202947Z xkeyboard-config conda-forge/linux-64::xkeyboard-config-2.44-hb9d3cd8_0 2025-05-07T19:46:08.4203553Z xorg-libxcomposite conda-forge/linux-64::xorg-libxcomposite-0.4.6-hb9d3cd8_2 2025-05-07T19:46:08.4204170Z xorg-libxdamage conda-forge/linux-64::xorg-libxdamage-1.1.6-hb9d3cd8_0 2025-05-07T19:46:08.4204509Z 2025-05-07T19:46:08.4204562Z 2025-05-07T19:46:08.4204567Z 2025-05-07T19:46:08.4204712Z Downloading and Extracting Packages: ...working... 2025-05-07T19:46:08.4205100Z nsight-compute-2024. | 443.1 MB | | 0% 2025-05-07T19:46:08.4205364Z 2025-05-07T19:46:08.4205685Z libcublas-12.6.4.1 | 256.2 MB | | 0%  2025-05-07T19:46:08.4205937Z 2025-05-07T19:46:08.4205941Z 2025-05-07T19:46:08.4206171Z libcufft-11.3.0.4 | 156.2 MB | | 0%  2025-05-07T19:46:08.4206425Z 2025-05-07T19:46:08.4206429Z 2025-05-07T19:46:08.4206433Z 2025-05-07T19:46:08.4206670Z libcusparse-12.5.4.2 | 118.6 MB | | 0%  2025-05-07T19:46:08.4206960Z 2025-05-07T19:46:08.4206963Z 2025-05-07T19:46:08.4206967Z 2025-05-07T19:46:08.4206970Z 2025-05-07T19:46:08.4207303Z cuda-nsight-12.6.77 | 113.2 MB | | 0%  2025-05-07T19:46:08.4207580Z 2025-05-07T19:46:08.4207584Z 2025-05-07T19:46:08.4207600Z 2025-05-07T19:46:08.4207604Z 2025-05-07T19:46:08.4207616Z 2025-05-07T19:46:08.4208908Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:46:08.4209197Z 2025-05-07T19:46:08.4209201Z 2025-05-07T19:46:08.4209205Z 2025-05-07T19:46:08.4209208Z 2025-05-07T19:46:08.4209227Z 2025-05-07T19:46:08.4209231Z 2025-05-07T19:46:08.4210214Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:46:08.4210565Z 2025-05-07T19:46:08.4210569Z 2025-05-07T19:46:08.4210574Z 2025-05-07T19:46:08.4210578Z 2025-05-07T19:46:08.4210582Z 2025-05-07T19:46:08.4210587Z 2025-05-07T19:46:08.4210625Z 2025-05-07T19:46:08.4210883Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:46:08.4211165Z 2025-05-07T19:46:08.4211170Z 2025-05-07T19:46:08.4211175Z 2025-05-07T19:46:08.4211205Z 2025-05-07T19:46:08.4211208Z 2025-05-07T19:46:08.4211212Z 2025-05-07T19:46:08.4211216Z 2025-05-07T19:46:08.4211248Z 2025-05-07T19:46:08.4212013Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:46:08.4212318Z 2025-05-07T19:46:08.4212322Z 2025-05-07T19:46:08.4212327Z 2025-05-07T19:46:08.4212346Z 2025-05-07T19:46:08.4212350Z 2025-05-07T19:46:08.4212354Z 2025-05-07T19:46:08.4212372Z 2025-05-07T19:46:08.4212389Z 2025-05-07T19:46:08.4212393Z 2025-05-07T19:46:08.4213051Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:46:08.4213360Z 2025-05-07T19:46:08.4213363Z 2025-05-07T19:46:08.4213367Z 2025-05-07T19:46:08.4213370Z 2025-05-07T19:46:08.4213374Z 2025-05-07T19:46:08.4213391Z 2025-05-07T19:46:08.4213394Z 2025-05-07T19:46:08.4213397Z 2025-05-07T19:46:08.4213401Z 2025-05-07T19:46:08.4213405Z 2025-05-07T19:46:08.4214162Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:46:08.4214643Z 2025-05-07T19:46:08.4214646Z 2025-05-07T19:46:08.4214650Z 2025-05-07T19:46:08.4214654Z 2025-05-07T19:46:08.4214672Z 2025-05-07T19:46:08.4214675Z 2025-05-07T19:46:08.4214691Z 2025-05-07T19:46:08.4214694Z 2025-05-07T19:46:08.4214698Z 2025-05-07T19:46:08.4214701Z 2025-05-07T19:46:08.4214705Z 2025-05-07T19:46:08.4215354Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:46:08.4215667Z 2025-05-07T19:46:08.4215684Z 2025-05-07T19:46:08.4215702Z 2025-05-07T19:46:08.4215706Z 2025-05-07T19:46:08.4215709Z 2025-05-07T19:46:08.4215713Z 2025-05-07T19:46:08.4215716Z 2025-05-07T19:46:08.4215719Z 2025-05-07T19:46:08.4215723Z 2025-05-07T19:46:08.4215726Z 2025-05-07T19:46:08.4215730Z 2025-05-07T19:46:08.4215733Z 2025-05-07T19:46:08.4216269Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:46:08.4216589Z 2025-05-07T19:46:08.4216593Z 2025-05-07T19:46:08.4216596Z 2025-05-07T19:46:08.4216607Z 2025-05-07T19:46:08.4216610Z 2025-05-07T19:46:08.4216627Z 2025-05-07T19:46:08.4216631Z 2025-05-07T19:46:08.4216635Z 2025-05-07T19:46:08.4216638Z 2025-05-07T19:46:08.4216642Z 2025-05-07T19:46:08.4216645Z 2025-05-07T19:46:08.4216649Z 2025-05-07T19:46:08.4216652Z 2025-05-07T19:46:08.4220981Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:46:08.4221325Z 2025-05-07T19:46:08.4221330Z 2025-05-07T19:46:08.4221333Z 2025-05-07T19:46:08.4221337Z 2025-05-07T19:46:08.4221341Z 2025-05-07T19:46:08.4221344Z 2025-05-07T19:46:08.4221348Z 2025-05-07T19:46:08.4221352Z 2025-05-07T19:46:08.4221355Z 2025-05-07T19:46:08.4221359Z 2025-05-07T19:46:08.4221362Z 2025-05-07T19:46:08.4221366Z 2025-05-07T19:46:08.4221369Z 2025-05-07T19:46:08.4221388Z 2025-05-07T19:46:08.4228540Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:46:08.4229632Z 2025-05-07T19:46:08.4229636Z 2025-05-07T19:46:08.4229646Z 2025-05-07T19:46:08.4229649Z 2025-05-07T19:46:08.4229665Z 2025-05-07T19:46:08.4229669Z 2025-05-07T19:46:08.4229672Z 2025-05-07T19:46:08.4229675Z 2025-05-07T19:46:08.4229679Z 2025-05-07T19:46:08.4229682Z 2025-05-07T19:46:08.4229686Z 2025-05-07T19:46:08.4229689Z 2025-05-07T19:46:08.4229692Z 2025-05-07T19:46:08.4229696Z 2025-05-07T19:46:08.4229699Z 2025-05-07T19:46:08.4230011Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:46:08.4230359Z 2025-05-07T19:46:08.4230362Z 2025-05-07T19:46:08.4230366Z 2025-05-07T19:46:08.4230369Z 2025-05-07T19:46:08.4230373Z 2025-05-07T19:46:08.4230376Z 2025-05-07T19:46:08.4230379Z 2025-05-07T19:46:08.4230383Z 2025-05-07T19:46:08.4230386Z 2025-05-07T19:46:08.4230390Z 2025-05-07T19:46:08.4230393Z 2025-05-07T19:46:08.4230396Z 2025-05-07T19:46:08.4230400Z 2025-05-07T19:46:08.4230404Z 2025-05-07T19:46:08.4230407Z 2025-05-07T19:46:08.4230410Z 2025-05-07T19:46:08.4230745Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:46:08.4231088Z 2025-05-07T19:46:08.4231092Z 2025-05-07T19:46:08.4231095Z 2025-05-07T19:46:08.4231099Z 2025-05-07T19:46:08.4231102Z 2025-05-07T19:46:08.4231106Z 2025-05-07T19:46:08.4231110Z 2025-05-07T19:46:08.4231113Z 2025-05-07T19:46:08.4231117Z 2025-05-07T19:46:08.4231120Z 2025-05-07T19:46:08.4231124Z 2025-05-07T19:46:08.4231131Z 2025-05-07T19:46:08.4231135Z 2025-05-07T19:46:08.4231138Z 2025-05-07T19:46:08.4231141Z 2025-05-07T19:46:08.4231145Z 2025-05-07T19:46:08.4231161Z 2025-05-07T19:46:08.4231508Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:46:08.4231835Z 2025-05-07T19:46:08.4231839Z 2025-05-07T19:46:08.4231843Z 2025-05-07T19:46:08.4231846Z 2025-05-07T19:46:08.4231849Z 2025-05-07T19:46:08.4231853Z 2025-05-07T19:46:08.4231869Z 2025-05-07T19:46:08.4231872Z 2025-05-07T19:46:08.4231876Z 2025-05-07T19:46:08.4232026Z 2025-05-07T19:46:08.4232030Z 2025-05-07T19:46:08.4232033Z 2025-05-07T19:46:08.4232037Z 2025-05-07T19:46:08.4232040Z 2025-05-07T19:46:08.4232044Z 2025-05-07T19:46:08.4232047Z 2025-05-07T19:46:08.4232050Z 2025-05-07T19:46:08.4232054Z 2025-05-07T19:46:08.4232388Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:46:08.4232729Z 2025-05-07T19:46:08.4232802Z 2025-05-07T19:46:08.4232806Z 2025-05-07T19:46:08.4232810Z 2025-05-07T19:46:08.4232813Z 2025-05-07T19:46:08.4232817Z 2025-05-07T19:46:08.4232820Z 2025-05-07T19:46:08.4232824Z 2025-05-07T19:46:08.4232828Z 2025-05-07T19:46:08.4232831Z 2025-05-07T19:46:08.4232835Z 2025-05-07T19:46:08.4232838Z 2025-05-07T19:46:08.4232841Z 2025-05-07T19:46:08.4232844Z 2025-05-07T19:46:08.4232848Z 2025-05-07T19:46:08.4232851Z 2025-05-07T19:46:08.4232855Z 2025-05-07T19:46:08.4232858Z 2025-05-07T19:46:08.4232862Z 2025-05-07T19:46:08.5167402Z ... (more hidden) ... 2025-05-07T19:46:08.5167908Z nsight-compute-2024. | 443.1 MB | | 0% 2025-05-07T19:46:08.5168175Z 2025-05-07T19:46:08.5175861Z libcublas-12.6.4.1 | 256.2 MB | | 0%  2025-05-07T19:46:08.5177156Z 2025-05-07T19:46:08.5177169Z 2025-05-07T19:46:08.5188091Z libcufft-11.3.0.4 | 156.2 MB | 1 | 1%  2025-05-07T19:46:08.5188869Z 2025-05-07T19:46:08.5188907Z 2025-05-07T19:46:08.5188917Z 2025-05-07T19:46:08.5197987Z libcusparse-12.5.4.2 | 118.6 MB | 1 | 1%  2025-05-07T19:46:08.5198799Z 2025-05-07T19:46:08.5198811Z 2025-05-07T19:46:08.5198821Z 2025-05-07T19:46:08.5198832Z 2025-05-07T19:46:08.6168611Z cuda-nsight-12.6.77 | 113.2 MB | 6 | 6%  2025-05-07T19:46:08.6169934Z nsight-compute-2024. | 443.1 MB | 1 | 2% 2025-05-07T19:46:08.6170678Z 2025-05-07T19:46:08.6180587Z libcublas-12.6.4.1 | 256.2 MB | 3 | 3%  2025-05-07T19:46:08.6180880Z 2025-05-07T19:46:08.6180910Z 2025-05-07T19:46:08.6188936Z libcufft-11.3.0.4 | 156.2 MB | 5 | 6%  2025-05-07T19:46:08.6189724Z 2025-05-07T19:46:08.6189736Z 2025-05-07T19:46:08.6189747Z 2025-05-07T19:46:08.6199435Z libcusparse-12.5.4.2 | 118.6 MB | 7 | 8%  2025-05-07T19:46:08.6199712Z 2025-05-07T19:46:08.6199734Z 2025-05-07T19:46:08.6199738Z 2025-05-07T19:46:08.6199742Z 2025-05-07T19:46:08.7168842Z cuda-nsight-12.6.77 | 113.2 MB | #2 | 13%  2025-05-07T19:46:08.7171745Z nsight-compute-2024. | 443.1 MB | 2 | 3% 2025-05-07T19:46:08.7172006Z 2025-05-07T19:46:08.7183121Z libcublas-12.6.4.1 | 256.2 MB | 5 | 6%  2025-05-07T19:46:08.7183930Z 2025-05-07T19:46:08.7183954Z 2025-05-07T19:46:08.7191861Z libcufft-11.3.0.4 | 156.2 MB | 8 | 9%  2025-05-07T19:46:08.7192121Z 2025-05-07T19:46:08.7192133Z 2025-05-07T19:46:08.7192137Z 2025-05-07T19:46:08.7371698Z libcusparse-12.5.4.2 | 118.6 MB | #3 | 13%  2025-05-07T19:46:08.7372614Z 2025-05-07T19:46:08.7372629Z 2025-05-07T19:46:08.7372655Z 2025-05-07T19:46:08.7372666Z 2025-05-07T19:46:08.8170316Z cuda-nsight-12.6.77 | 113.2 MB | #9 | 19%  2025-05-07T19:46:08.8172452Z nsight-compute-2024. | 443.1 MB | 4 | 4% 2025-05-07T19:46:08.8174096Z 2025-05-07T19:46:08.8189423Z libcublas-12.6.4.1 | 256.2 MB | 8 | 8%  2025-05-07T19:46:08.8189690Z 2025-05-07T19:46:08.8189705Z 2025-05-07T19:46:08.8189709Z 2025-05-07T19:46:08.8203514Z libcusparse-12.5.4.2 | 118.6 MB | #8 | 19%  2025-05-07T19:46:08.8203835Z 2025-05-07T19:46:08.8203840Z 2025-05-07T19:46:08.8445811Z libcufft-11.3.0.4 | 156.2 MB | #2 | 12%  2025-05-07T19:46:08.8446115Z 2025-05-07T19:46:08.8446120Z 2025-05-07T19:46:08.8446123Z 2025-05-07T19:46:08.8446127Z 2025-05-07T19:46:08.9172140Z cuda-nsight-12.6.77 | 113.2 MB | ##5 | 25%  2025-05-07T19:46:08.9178714Z nsight-compute-2024. | 443.1 MB | 5 | 6% 2025-05-07T19:46:08.9181445Z 2025-05-07T19:46:08.9191298Z libcublas-12.6.4.1 | 256.2 MB | # | 11%  2025-05-07T19:46:08.9191578Z 2025-05-07T19:46:08.9191583Z 2025-05-07T19:46:08.9191586Z 2025-05-07T19:46:08.9221018Z libcusparse-12.5.4.2 | 118.6 MB | ##3 | 24%  2025-05-07T19:46:08.9221347Z 2025-05-07T19:46:08.9221352Z 2025-05-07T19:46:08.9570396Z libcufft-11.3.0.4 | 156.2 MB | #5 | 15%  2025-05-07T19:46:08.9570709Z 2025-05-07T19:46:08.9570714Z 2025-05-07T19:46:08.9570718Z 2025-05-07T19:46:08.9570722Z 2025-05-07T19:46:09.0183109Z cuda-nsight-12.6.77 | 113.2 MB | ### | 31%  2025-05-07T19:46:09.0183431Z 2025-05-07T19:46:09.0193753Z libcublas-12.6.4.1 | 256.2 MB | #3 | 13%  2025-05-07T19:46:09.0194508Z 2025-05-07T19:46:09.0194520Z 2025-05-07T19:46:09.0195495Z 2025-05-07T19:46:09.0221960Z libcusparse-12.5.4.2 | 118.6 MB | ##9 | 29%  2025-05-07T19:46:09.0222861Z 2025-05-07T19:46:09.0224007Z 2025-05-07T19:46:09.0319695Z libcufft-11.3.0.4 | 156.2 MB | #8 | 18%  2025-05-07T19:46:09.0582558Z nsight-compute-2024. | 443.1 MB | 6 | 7% 2025-05-07T19:46:09.0583339Z 2025-05-07T19:46:09.0583365Z 2025-05-07T19:46:09.0583376Z 2025-05-07T19:46:09.0583401Z 2025-05-07T19:46:09.1182987Z cuda-nsight-12.6.77 | 113.2 MB | ###6 | 37%  2025-05-07T19:46:09.1183319Z 2025-05-07T19:46:09.1197798Z libcublas-12.6.4.1 | 256.2 MB | #5 | 16%  2025-05-07T19:46:09.1198092Z 2025-05-07T19:46:09.1198227Z 2025-05-07T19:46:09.1198268Z 2025-05-07T19:46:09.1223426Z libcusparse-12.5.4.2 | 118.6 MB | ###5 | 35%  2025-05-07T19:46:09.1224307Z 2025-05-07T19:46:09.1224337Z 2025-05-07T19:46:09.1359044Z libcufft-11.3.0.4 | 156.2 MB | ##1 | 21%  2025-05-07T19:46:09.1582136Z nsight-compute-2024. | 443.1 MB | 8 | 8% 2025-05-07T19:46:09.1582434Z 2025-05-07T19:46:09.1582439Z 2025-05-07T19:46:09.1582462Z 2025-05-07T19:46:09.1582466Z 2025-05-07T19:46:09.2200978Z cuda-nsight-12.6.77 | 113.2 MB | ####2 | 42%  2025-05-07T19:46:09.2201856Z 2025-05-07T19:46:09.2201871Z 2025-05-07T19:46:09.2201882Z 2025-05-07T19:46:09.2224677Z libcusparse-12.5.4.2 | 118.6 MB | #### | 40%  2025-05-07T19:46:09.2225558Z 2025-05-07T19:46:09.2225641Z 2025-05-07T19:46:09.2229516Z libcufft-11.3.0.4 | 156.2 MB | ##4 | 25%  2025-05-07T19:46:09.2230493Z 2025-05-07T19:46:09.2362144Z libcublas-12.6.4.1 | 256.2 MB | #8 | 18%  2025-05-07T19:46:09.2615188Z nsight-compute-2024. | 443.1 MB | 9 | 9% 2025-05-07T19:46:09.2615992Z 2025-05-07T19:46:09.2616021Z 2025-05-07T19:46:09.2616033Z 2025-05-07T19:46:09.2616044Z 2025-05-07T19:46:09.3200564Z cuda-nsight-12.6.77 | 113.2 MB | ####7 | 48%  2025-05-07T19:46:09.3200881Z 2025-05-07T19:46:09.3201011Z 2025-05-07T19:46:09.3201019Z 2025-05-07T19:46:09.3224597Z libcusparse-12.5.4.2 | 118.6 MB | ####5 | 46%  2025-05-07T19:46:09.3224945Z 2025-05-07T19:46:09.3224949Z 2025-05-07T19:46:09.3304185Z libcufft-11.3.0.4 | 156.2 MB | ##7 | 28%  2025-05-07T19:46:09.3305040Z 2025-05-07T19:46:09.3362896Z libcublas-12.6.4.1 | 256.2 MB | ## | 20%  2025-05-07T19:46:09.3703314Z nsight-compute-2024. | 443.1 MB | # | 11% 2025-05-07T19:46:09.3704096Z 2025-05-07T19:46:09.3704142Z 2025-05-07T19:46:09.3704154Z 2025-05-07T19:46:09.3704164Z 2025-05-07T19:46:09.4212528Z cuda-nsight-12.6.77 | 113.2 MB | #####3 | 53%  2025-05-07T19:46:09.4212845Z 2025-05-07T19:46:09.4212849Z 2025-05-07T19:46:09.4212854Z 2025-05-07T19:46:09.4231679Z libcusparse-12.5.4.2 | 118.6 MB | #####1 | 51%  2025-05-07T19:46:09.4232551Z 2025-05-07T19:46:09.4232580Z 2025-05-07T19:46:09.4365101Z libcufft-11.3.0.4 | 156.2 MB | ### | 31%  2025-05-07T19:46:09.4408801Z nsight-compute-2024. | 443.1 MB | #2 | 12% 2025-05-07T19:46:09.4409377Z 2025-05-07T19:46:09.4703124Z libcublas-12.6.4.1 | 256.2 MB | ##2 | 23%  2025-05-07T19:46:09.4703424Z 2025-05-07T19:46:09.4703428Z 2025-05-07T19:46:09.4703433Z 2025-05-07T19:46:09.4703437Z 2025-05-07T19:46:09.5262287Z cuda-nsight-12.6.77 | 113.2 MB | #####8 | 59%  2025-05-07T19:46:09.5262602Z 2025-05-07T19:46:09.5262607Z 2025-05-07T19:46:09.5270072Z libcufft-11.3.0.4 | 156.2 MB | ###4 | 34%  2025-05-07T19:46:09.5270886Z 2025-05-07T19:46:09.5270898Z 2025-05-07T19:46:09.5270909Z 2025-05-07T19:46:09.5365159Z libcusparse-12.5.4.2 | 118.6 MB | #####6 | 57%  2025-05-07T19:46:09.5411131Z nsight-compute-2024. | 443.1 MB | #3 | 13% 2025-05-07T19:46:09.5411921Z 2025-05-07T19:46:09.5703163Z libcublas-12.6.4.1 | 256.2 MB | ##5 | 25%  2025-05-07T19:46:09.5703471Z 2025-05-07T19:46:09.5703476Z 2025-05-07T19:46:09.5703481Z 2025-05-07T19:46:09.5703484Z 2025-05-07T19:46:09.6264261Z cuda-nsight-12.6.77 | 113.2 MB | ######4 | 65%  2025-05-07T19:46:09.6265185Z 2025-05-07T19:46:09.6265199Z 2025-05-07T19:46:09.6293788Z libcufft-11.3.0.4 | 156.2 MB | ###7 | 37%  2025-05-07T19:46:09.6294625Z 2025-05-07T19:46:09.6294639Z 2025-05-07T19:46:09.6294650Z 2025-05-07T19:46:09.6415088Z libcusparse-12.5.4.2 | 118.6 MB | ######2 | 62%  2025-05-07T19:46:09.6698815Z 2025-05-07T19:46:09.6699269Z libcublas-12.6.4.1 | 256.2 MB | ##7 | 27%  2025-05-07T19:46:09.6704444Z nsight-compute-2024. | 443.1 MB | #4 | 15% 2025-05-07T19:46:09.6705198Z 2025-05-07T19:46:09.6705212Z 2025-05-07T19:46:09.6705223Z 2025-05-07T19:46:09.6705632Z 2025-05-07T19:46:09.7296803Z cuda-nsight-12.6.77 | 113.2 MB | #######1 | 72%  2025-05-07T19:46:09.7297291Z 2025-05-07T19:46:09.7297439Z 2025-05-07T19:46:09.7297450Z 2025-05-07T19:46:09.7336547Z libcusparse-12.5.4.2 | 118.6 MB | ######7 | 68%  2025-05-07T19:46:09.7336860Z 2025-05-07T19:46:09.7336884Z 2025-05-07T19:46:09.7415978Z libcufft-11.3.0.4 | 156.2 MB | #### | 40%  2025-05-07T19:46:09.7416837Z 2025-05-07T19:46:09.7699927Z libcublas-12.6.4.1 | 256.2 MB | ##9 | 30%  2025-05-07T19:46:09.7861611Z nsight-compute-2024. | 443.1 MB | #6 | 16% 2025-05-07T19:46:09.7861912Z 2025-05-07T19:46:09.7861916Z 2025-05-07T19:46:09.7861921Z 2025-05-07T19:46:09.7861924Z 2025-05-07T19:46:09.8298737Z cuda-nsight-12.6.77 | 113.2 MB | #######7 | 78%  2025-05-07T19:46:09.8299662Z 2025-05-07T19:46:09.8299677Z 2025-05-07T19:46:09.8299689Z 2025-05-07T19:46:09.8338527Z libcusparse-12.5.4.2 | 118.6 MB | #######3 | 73%  2025-05-07T19:46:09.8338888Z 2025-05-07T19:46:09.8339095Z 2025-05-07T19:46:09.8416971Z libcufft-11.3.0.4 | 156.2 MB | ####3 | 44%  2025-05-07T19:46:09.8417332Z 2025-05-07T19:46:09.8862548Z libcublas-12.6.4.1 | 256.2 MB | ###2 | 33%  2025-05-07T19:46:09.8863348Z 2025-05-07T19:46:09.8863412Z 2025-05-07T19:46:09.8863424Z 2025-05-07T19:46:09.8863456Z 2025-05-07T19:46:09.9297720Z cuda-nsight-12.6.77 | 113.2 MB | ########4 | 84%  2025-05-07T19:46:09.9298034Z 2025-05-07T19:46:09.9298039Z 2025-05-07T19:46:09.9298043Z 2025-05-07T19:46:09.9417099Z libcusparse-12.5.4.2 | 118.6 MB | #######9 | 79%  2025-05-07T19:46:09.9417426Z 2025-05-07T19:46:09.9422912Z libcublas-12.6.4.1 | 256.2 MB | ###5 | 35%  2025-05-07T19:46:09.9423179Z 2025-05-07T19:46:09.9423195Z 2025-05-07T19:46:09.9518547Z libcufft-11.3.0.4 | 156.2 MB | ####7 | 47%  2025-05-07T19:46:09.9862173Z nsight-compute-2024. | 443.1 MB | #7 | 17% 2025-05-07T19:46:09.9862463Z 2025-05-07T19:46:09.9862468Z 2025-05-07T19:46:09.9862472Z 2025-05-07T19:46:09.9862475Z 2025-05-07T19:46:10.0420856Z cuda-nsight-12.6.77 | 113.2 MB | ######### | 91%  2025-05-07T19:46:10.0421759Z 2025-05-07T19:46:10.0474581Z libcublas-12.6.4.1 | 256.2 MB | ###8 | 39%  2025-05-07T19:46:10.0475098Z 2025-05-07T19:46:10.0475103Z 2025-05-07T19:46:10.0475107Z 2025-05-07T19:46:10.0517024Z libcusparse-12.5.4.2 | 118.6 MB | ########4 | 85%  2025-05-07T19:46:10.0862635Z nsight-compute-2024. | 443.1 MB | #8 | 19% 2025-05-07T19:46:10.0863419Z 2025-05-07T19:46:10.0863450Z 2025-05-07T19:46:10.0863461Z 2025-05-07T19:46:10.0863473Z 2025-05-07T19:46:10.0936269Z cuda-nsight-12.6.77 | 113.2 MB | #########8 | 99%  2025-05-07T19:46:10.0936591Z 2025-05-07T19:46:10.0936596Z 2025-05-07T19:46:10.1422305Z libcufft-11.3.0.4 | 156.2 MB | ##### | 51%  2025-05-07T19:46:10.1423148Z 2025-05-07T19:46:10.1479007Z libcublas-12.6.4.1 | 256.2 MB | ####1 | 41%  2025-05-07T19:46:10.1479292Z 2025-05-07T19:46:10.1479297Z 2025-05-07T19:46:10.1479301Z 2025-05-07T19:46:10.1518214Z libcusparse-12.5.4.2 | 118.6 MB | ######### | 91%  2025-05-07T19:46:10.1938150Z nsight-compute-2024. | 443.1 MB | ## | 20% 2025-05-07T19:46:10.1938615Z 2025-05-07T19:46:10.1938653Z 2025-05-07T19:46:10.2421861Z libcufft-11.3.0.4 | 156.2 MB | #####4 | 55%  2025-05-07T19:46:10.2422160Z 2025-05-07T19:46:10.2477546Z libcublas-12.6.4.1 | 256.2 MB | ####4 | 44%  2025-05-07T19:46:10.2477842Z 2025-05-07T19:46:10.2477849Z 2025-05-07T19:46:10.2477854Z 2025-05-07T19:46:10.2517539Z libcusparse-12.5.4.2 | 118.6 MB | #########6 | 97%  2025-05-07T19:46:10.2940553Z nsight-compute-2024. | 443.1 MB | ##1 | 22% 2025-05-07T19:46:10.2940880Z 2025-05-07T19:46:10.2941185Z 2025-05-07T19:46:10.3429162Z libcufft-11.3.0.4 | 156.2 MB | #####8 | 58%  2025-05-07T19:46:10.3429440Z 2025-05-07T19:46:10.3518739Z libcublas-12.6.4.1 | 256.2 MB | ####7 | 47%  2025-05-07T19:46:10.3941834Z nsight-compute-2024. | 443.1 MB | ##3 | 24% 2025-05-07T19:46:10.3942130Z 2025-05-07T19:46:10.3942135Z 2025-05-07T19:46:10.4430355Z libcufft-11.3.0.4 | 156.2 MB | ######3 | 63%  2025-05-07T19:46:10.4430677Z 2025-05-07T19:46:10.4518549Z libcublas-12.6.4.1 | 256.2 MB | #####1 | 51%  2025-05-07T19:46:10.4942783Z nsight-compute-2024. | 443.1 MB | ##5 | 26% 2025-05-07T19:46:10.4943093Z 2025-05-07T19:46:10.4943098Z 2025-05-07T19:46:10.5520545Z libcufft-11.3.0.4 | 156.2 MB | ######7 | 67%  2025-05-07T19:46:10.5767195Z nsight-compute-2024. | 443.1 MB | ##8 | 29% 2025-05-07T19:46:10.5768046Z 2025-05-07T19:46:10.5942251Z libcublas-12.6.4.1 | 256.2 MB | #####4 | 54%  2025-05-07T19:46:10.5942561Z 2025-05-07T19:46:10.5942566Z 2025-05-07T19:46:10.6521093Z libcufft-11.3.0.4 | 156.2 MB | #######2 | 73%  2025-05-07T19:46:10.6767394Z nsight-compute-2024. | 443.1 MB | ###1 | 31% 2025-05-07T19:46:10.6767885Z 2025-05-07T19:46:10.7232386Z libcublas-12.6.4.1 | 256.2 MB | #####7 | 57%  2025-05-07T19:46:10.7232709Z 2025-05-07T19:46:10.7232715Z 2025-05-07T19:46:10.7574854Z libcufft-11.3.0.4 | 156.2 MB | #######6 | 77%  2025-05-07T19:46:10.7770049Z nsight-compute-2024. | 443.1 MB | ###3 | 33% 2025-05-07T19:46:10.7770514Z 2025-05-07T19:46:10.8234240Z libcublas-12.6.4.1 | 256.2 MB | ###### | 60%  2025-05-07T19:46:10.8234530Z 2025-05-07T19:46:10.8234573Z 2025-05-07T19:46:10.8770538Z libcufft-11.3.0.4 | 156.2 MB | ########1 | 82%  2025-05-07T19:46:10.8770850Z 2025-05-07T19:46:10.9139854Z libcublas-12.6.4.1 | 256.2 MB | ######4 | 64%  2025-05-07T19:46:10.9236043Z nsight-compute-2024. | 443.1 MB | ###5 | 35% 2025-05-07T19:46:10.9236370Z 2025-05-07T19:46:10.9236509Z 2025-05-07T19:46:10.9771286Z libcufft-11.3.0.4 | 156.2 MB | ########8 | 88%  2025-05-07T19:46:10.9771610Z 2025-05-07T19:46:11.0157406Z libcublas-12.6.4.1 | 256.2 MB | ######7 | 68%  2025-05-07T19:46:11.0243256Z nsight-compute-2024. | 443.1 MB | ###7 | 37% 2025-05-07T19:46:11.0243733Z 2025-05-07T19:46:11.0243772Z 2025-05-07T19:46:11.0785376Z libcufft-11.3.0.4 | 156.2 MB | #########3 | 93%  2025-05-07T19:46:11.0785945Z 2025-05-07T19:46:11.1244780Z libcublas-12.6.4.1 | 256.2 MB | ####### | 71%  2025-05-07T19:46:11.1245077Z 2025-05-07T19:46:11.1245139Z 2025-05-07T19:46:11.1287378Z libcufft-11.3.0.4 | 156.2 MB | #########8 | 98%  2025-05-07T19:46:11.1785008Z nsight-compute-2024. | 443.1 MB | ###9 | 39% 2025-05-07T19:46:11.1785314Z 2025-05-07T19:46:11.2289827Z libcublas-12.6.4.1 | 256.2 MB | #######4 | 74%  2025-05-07T19:46:11.2643194Z nsight-compute-2024. | 443.1 MB | ####1 | 42% 2025-05-07T19:46:11.2643689Z 2025-05-07T19:46:11.2643733Z 2025-05-07T19:46:11.2643739Z 2025-05-07T19:46:11.2643790Z 2025-05-07T19:46:11.2788680Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:46:11.2789012Z 2025-05-07T19:46:11.3067210Z libcublas-12.6.4.1 | 256.2 MB | #######8 | 78%  2025-05-07T19:46:11.3067505Z 2025-05-07T19:46:11.3067511Z 2025-05-07T19:46:11.3067529Z 2025-05-07T19:46:11.3067534Z 2025-05-07T19:46:11.3067562Z 2025-05-07T19:46:11.3291668Z cuda-nvvp-12.6.80 | 109.3 MB | | 0%  2025-05-07T19:46:11.4068787Z nsight-compute-2024. | 443.1 MB | ####3 | 44% 2025-05-07T19:46:11.4069093Z 2025-05-07T19:46:11.4069097Z 2025-05-07T19:46:11.4069102Z 2025-05-07T19:46:11.4069106Z 2025-05-07T19:46:11.4069111Z 2025-05-07T19:46:11.4359280Z cuda-nvvp-12.6.80 | 109.3 MB | 8 | 9%  2025-05-07T19:46:11.4742980Z nsight-compute-2024. | 443.1 MB | ####5 | 46% 2025-05-07T19:46:11.4743260Z 2025-05-07T19:46:11.4743265Z 2025-05-07T19:46:11.4743268Z 2025-05-07T19:46:11.5068280Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:46:11.5068600Z 2025-05-07T19:46:11.5068606Z 2025-05-07T19:46:11.5068610Z 2025-05-07T19:46:11.5068613Z 2025-05-07T19:46:11.5068617Z 2025-05-07T19:46:11.5190002Z cuda-nvvp-12.6.80 | 109.3 MB | #8 | 19%  2025-05-07T19:46:11.5190331Z 2025-05-07T19:46:11.5312235Z libcublas-12.6.4.1 | 256.2 MB | ########1 | 82%  2025-05-07T19:46:11.5312535Z 2025-05-07T19:46:11.5312539Z 2025-05-07T19:46:11.5312557Z 2025-05-07T19:46:11.5312561Z 2025-05-07T19:46:11.5312564Z 2025-05-07T19:46:11.5312568Z 2025-05-07T19:46:11.5586918Z libcusolver-11.7.1.2 | 95.8 MB | | 0%  2025-05-07T19:46:11.6300634Z nsight-compute-2024. | 443.1 MB | ####7 | 48% 2025-05-07T19:46:11.6300966Z 2025-05-07T19:46:11.6301094Z 2025-05-07T19:46:11.6301099Z 2025-05-07T19:46:11.6301257Z 2025-05-07T19:46:11.6301261Z 2025-05-07T19:46:11.6319382Z cuda-nvvp-12.6.80 | 109.3 MB | ##6 | 26%  2025-05-07T19:46:11.6319685Z 2025-05-07T19:46:11.6319705Z 2025-05-07T19:46:11.6319709Z 2025-05-07T19:46:11.6319713Z 2025-05-07T19:46:11.6319716Z 2025-05-07T19:46:11.6319720Z 2025-05-07T19:46:11.6666579Z libcusolver-11.7.1.2 | 95.8 MB | 5 | 6%  2025-05-07T19:46:11.6667102Z 2025-05-07T19:46:11.7020110Z libcublas-12.6.4.1 | 256.2 MB | ########4 | 84%  2025-05-07T19:46:11.7319885Z nsight-compute-2024. | 443.1 MB | ####9 | 49% 2025-05-07T19:46:11.7320380Z 2025-05-07T19:46:11.7320456Z 2025-05-07T19:46:11.7320461Z 2025-05-07T19:46:11.7320466Z 2025-05-07T19:46:11.7320469Z 2025-05-07T19:46:11.7320473Z 2025-05-07T19:46:11.7644699Z libcusolver-11.7.1.2 | 95.8 MB | #1 | 12%  2025-05-07T19:46:11.7645686Z 2025-05-07T19:46:11.7645700Z 2025-05-07T19:46:11.7645712Z 2025-05-07T19:46:11.7645723Z 2025-05-07T19:46:11.7645733Z 2025-05-07T19:46:11.8025170Z cuda-nvvp-12.6.80 | 109.3 MB | ###3 | 33%  2025-05-07T19:46:11.8026075Z 2025-05-07T19:46:11.8283835Z libcublas-12.6.4.1 | 256.2 MB | ########6 | 87%  2025-05-07T19:46:11.8320212Z nsight-compute-2024. | 443.1 MB | #####1 | 51% 2025-05-07T19:46:11.8320484Z 2025-05-07T19:46:11.8320488Z 2025-05-07T19:46:11.8320492Z 2025-05-07T19:46:11.8320496Z 2025-05-07T19:46:11.8320500Z 2025-05-07T19:46:11.8320714Z 2025-05-07T19:46:11.8809443Z libcusolver-11.7.1.2 | 95.8 MB | #7 | 17%  2025-05-07T19:46:11.8809789Z 2025-05-07T19:46:11.8809795Z 2025-05-07T19:46:11.8809883Z 2025-05-07T19:46:11.8809891Z 2025-05-07T19:46:11.8809906Z 2025-05-07T19:46:11.9323008Z cuda-nvvp-12.6.80 | 109.3 MB | ###9 | 39%  2025-05-07T19:46:11.9323329Z 2025-05-07T19:46:11.9323336Z 2025-05-07T19:46:11.9323560Z 2025-05-07T19:46:11.9323564Z 2025-05-07T19:46:11.9323568Z 2025-05-07T19:46:11.9323599Z 2025-05-07T19:46:11.9361845Z libcusolver-11.7.1.2 | 95.8 MB | ##3 | 23%  2025-05-07T19:46:11.9362840Z 2025-05-07T19:46:11.9519278Z libcublas-12.6.4.1 | 256.2 MB | ########9 | 89%  2025-05-07T19:46:11.9937576Z nsight-compute-2024. | 443.1 MB | #####2 | 53% 2025-05-07T19:46:11.9938088Z 2025-05-07T19:46:11.9938141Z 2025-05-07T19:46:11.9938147Z 2025-05-07T19:46:11.9938151Z 2025-05-07T19:46:11.9938155Z 2025-05-07T19:46:12.0324621Z cuda-nvvp-12.6.80 | 109.3 MB | ####5 | 45%  2025-05-07T19:46:12.0325535Z 2025-05-07T19:46:12.0325575Z 2025-05-07T19:46:12.0325587Z 2025-05-07T19:46:12.0325597Z 2025-05-07T19:46:12.0325608Z 2025-05-07T19:46:12.0325618Z 2025-05-07T19:46:12.0629007Z libcusolver-11.7.1.2 | 95.8 MB | ##9 | 29%  2025-05-07T19:46:12.0629474Z 2025-05-07T19:46:12.0666505Z libcublas-12.6.4.1 | 256.2 MB | #########1 | 92%  2025-05-07T19:46:12.1017163Z nsight-compute-2024. | 443.1 MB | #####4 | 54% 2025-05-07T19:46:12.1017985Z 2025-05-07T19:46:12.1017999Z 2025-05-07T19:46:12.1018009Z 2025-05-07T19:46:12.1018020Z 2025-05-07T19:46:12.1018031Z 2025-05-07T19:46:12.1326729Z cuda-nvvp-12.6.80 | 109.3 MB | #####1 | 51%  2025-05-07T19:46:12.1327053Z 2025-05-07T19:46:12.1327057Z 2025-05-07T19:46:12.1327061Z 2025-05-07T19:46:12.1327064Z 2025-05-07T19:46:12.1327068Z 2025-05-07T19:46:12.1327072Z 2025-05-07T19:46:12.1701432Z libcusolver-11.7.1.2 | 95.8 MB | ###5 | 35%  2025-05-07T19:46:12.1702781Z 2025-05-07T19:46:12.1888431Z libcublas-12.6.4.1 | 256.2 MB | #########3 | 94%  2025-05-07T19:46:12.2092199Z nsight-compute-2024. | 443.1 MB | #####5 | 56% 2025-05-07T19:46:12.2092719Z 2025-05-07T19:46:12.2092758Z 2025-05-07T19:46:12.2092763Z 2025-05-07T19:46:12.2092767Z 2025-05-07T19:46:12.2092796Z 2025-05-07T19:46:12.2327527Z cuda-nvvp-12.6.80 | 109.3 MB | #####6 | 57%  2025-05-07T19:46:12.2328249Z 2025-05-07T19:46:12.2328253Z 2025-05-07T19:46:12.2328257Z 2025-05-07T19:46:12.2328261Z 2025-05-07T19:46:12.2328264Z 2025-05-07T19:46:12.2328268Z 2025-05-07T19:46:12.2735334Z libcusolver-11.7.1.2 | 95.8 MB | #### | 41%  2025-05-07T19:46:12.2735681Z 2025-05-07T19:46:12.2945434Z libcublas-12.6.4.1 | 256.2 MB | #########5 | 96%  2025-05-07T19:46:12.3159973Z nsight-compute-2024. | 443.1 MB | #####7 | 57% 2025-05-07T19:46:12.3160484Z 2025-05-07T19:46:12.3160526Z 2025-05-07T19:46:12.3160531Z 2025-05-07T19:46:12.3160536Z 2025-05-07T19:46:12.3160543Z 2025-05-07T19:46:12.3438020Z cuda-nvvp-12.6.80 | 109.3 MB | ######2 | 62%  2025-05-07T19:46:12.3438354Z 2025-05-07T19:46:12.3438358Z 2025-05-07T19:46:12.3438362Z 2025-05-07T19:46:12.3438365Z 2025-05-07T19:46:12.3438369Z 2025-05-07T19:46:12.3438372Z 2025-05-07T19:46:12.3746201Z libcusolver-11.7.1.2 | 95.8 MB | ####6 | 47%  2025-05-07T19:46:12.3746580Z 2025-05-07T19:46:12.3946601Z libcublas-12.6.4.1 | 256.2 MB | #########7 | 98%  2025-05-07T19:46:12.4160759Z nsight-compute-2024. | 443.1 MB | #####8 | 58% 2025-05-07T19:46:12.4161443Z 2025-05-07T19:46:12.4161468Z 2025-05-07T19:46:12.4161484Z 2025-05-07T19:46:12.4161554Z 2025-05-07T19:46:12.4161584Z 2025-05-07T19:46:12.4747663Z cuda-nvvp-12.6.80 | 109.3 MB | ######8 | 68%  2025-05-07T19:46:12.4747988Z 2025-05-07T19:46:12.4947662Z libcublas-12.6.4.1 | 256.2 MB | #########9 | 100%  2025-05-07T19:46:12.4973446Z nsight-compute-2024. | 443.1 MB | #####9 | 60% 2025-05-07T19:46:12.4973762Z 2025-05-07T19:46:12.4973804Z 2025-05-07T19:46:12.4973811Z 2025-05-07T19:46:12.4973892Z 2025-05-07T19:46:12.4973901Z 2025-05-07T19:46:12.4973907Z 2025-05-07T19:46:12.5163721Z libcusolver-11.7.1.2 | 95.8 MB | #####2 | 52%  2025-05-07T19:46:12.5164057Z 2025-05-07T19:46:12.5164279Z 2025-05-07T19:46:12.5164284Z 2025-05-07T19:46:12.5164287Z 2025-05-07T19:46:12.5164299Z 2025-05-07T19:46:12.5976720Z cuda-nvvp-12.6.80 | 109.3 MB | #######4 | 75%  2025-05-07T19:46:12.5977162Z 2025-05-07T19:46:12.5977167Z 2025-05-07T19:46:12.5977171Z 2025-05-07T19:46:12.5977175Z 2025-05-07T19:46:12.5977178Z 2025-05-07T19:46:12.5977182Z 2025-05-07T19:46:12.6006119Z libcusolver-11.7.1.2 | 95.8 MB | #####7 | 57%  2025-05-07T19:46:12.6292306Z nsight-compute-2024. | 443.1 MB | ######1 | 62% 2025-05-07T19:46:12.6293137Z 2025-05-07T19:46:12.6293151Z 2025-05-07T19:46:12.6293162Z 2025-05-07T19:46:12.6293173Z 2025-05-07T19:46:12.6628905Z cuda-nsight-12.6.77 | 113.2 MB | ########## | 100%  2025-05-07T19:46:12.6629231Z 2025-05-07T19:46:12.6629236Z 2025-05-07T19:46:12.6629239Z 2025-05-07T19:46:12.6629243Z 2025-05-07T19:46:12.6629247Z 2025-05-07T19:46:12.6977183Z cuda-nvvp-12.6.80 | 109.3 MB | ######## | 81%  2025-05-07T19:46:12.6977645Z 2025-05-07T19:46:12.6977729Z 2025-05-07T19:46:12.6977737Z 2025-05-07T19:46:12.6977753Z 2025-05-07T19:46:12.6977757Z 2025-05-07T19:46:12.6977786Z 2025-05-07T19:46:12.7007974Z libcusolver-11.7.1.2 | 95.8 MB | ######6 | 67%  2025-05-07T19:46:12.7661365Z nsight-compute-2024. | 443.1 MB | ######3 | 63% 2025-05-07T19:46:12.7661684Z 2025-05-07T19:46:12.7661689Z 2025-05-07T19:46:12.7661694Z 2025-05-07T19:46:12.7661699Z 2025-05-07T19:46:12.7661703Z 2025-05-07T19:46:12.8034122Z cuda-nvvp-12.6.80 | 109.3 MB | ########6 | 87%  2025-05-07T19:46:12.8531770Z nsight-compute-2024. | 443.1 MB | ######5 | 66% 2025-05-07T19:46:12.8532142Z 2025-05-07T19:46:12.8532263Z 2025-05-07T19:46:12.8532267Z 2025-05-07T19:46:12.8532289Z 2025-05-07T19:46:12.8532328Z 2025-05-07T19:46:12.8532331Z 2025-05-07T19:46:12.8660857Z libcusolver-11.7.1.2 | 95.8 MB | #######3 | 73%  2025-05-07T19:46:12.8661312Z 2025-05-07T19:46:12.8661442Z 2025-05-07T19:46:12.8661446Z 2025-05-07T19:46:12.8661450Z 2025-05-07T19:46:12.9359635Z 2025-05-07T19:46:12.9360067Z cuda-nvvp-12.6.80 | 109.3 MB | #########4 | 94%  2025-05-07T19:46:12.9531968Z nsight-compute-2024. | 443.1 MB | ######7 | 67% 2025-05-07T19:46:12.9532352Z 2025-05-07T19:46:12.9532557Z 2025-05-07T19:46:12.9532563Z 2025-05-07T19:46:12.9532594Z 2025-05-07T19:46:12.9532612Z 2025-05-07T19:46:12.9532616Z 2025-05-07T19:46:13.0450167Z libcusolver-11.7.1.2 | 95.8 MB | ########2 | 82%  2025-05-07T19:46:13.0532239Z nsight-compute-2024. | 443.1 MB | ######8 | 69% 2025-05-07T19:46:13.0532632Z 2025-05-07T19:46:13.0532935Z 2025-05-07T19:46:13.0532943Z 2025-05-07T19:46:13.0532948Z 2025-05-07T19:46:13.0532953Z 2025-05-07T19:46:13.0533000Z 2025-05-07T19:46:13.1579189Z libcusolver-11.7.1.2 | 95.8 MB | #########1 | 91%  2025-05-07T19:46:13.1579555Z 2025-05-07T19:46:13.1579561Z 2025-05-07T19:46:13.2070329Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:46:13.2298245Z nsight-compute-2024. | 443.1 MB | ####### | 70% 2025-05-07T19:46:13.2298652Z 2025-05-07T19:46:13.2298771Z 2025-05-07T19:46:13.2298775Z 2025-05-07T19:46:13.2299000Z 2025-05-07T19:46:13.2299007Z 2025-05-07T19:46:13.2299050Z 2025-05-07T19:46:13.2299055Z 2025-05-07T19:46:13.3096089Z libnpp-12.3.1.54 | 93.4 MB | | 0%  2025-05-07T19:46:13.3310010Z nsight-compute-2024. | 443.1 MB | #######2 | 72% 2025-05-07T19:46:13.3310334Z 2025-05-07T19:46:13.3310564Z 2025-05-07T19:46:13.3310567Z 2025-05-07T19:46:13.3310571Z 2025-05-07T19:46:13.3310574Z 2025-05-07T19:46:13.3310578Z 2025-05-07T19:46:13.3310582Z 2025-05-07T19:46:13.4096245Z libnpp-12.3.1.54 | 93.4 MB | 4 | 4%  2025-05-07T19:46:13.4311235Z nsight-compute-2024. | 443.1 MB | #######3 | 74% 2025-05-07T19:46:13.4311536Z 2025-05-07T19:46:13.4311541Z 2025-05-07T19:46:13.4311545Z 2025-05-07T19:46:13.4311767Z 2025-05-07T19:46:13.4311772Z 2025-05-07T19:46:13.4311775Z 2025-05-07T19:46:13.4311779Z 2025-05-07T19:46:13.5115554Z libnpp-12.3.1.54 | 93.4 MB | #2 | 13%  2025-05-07T19:46:13.5398798Z nsight-compute-2024. | 443.1 MB | #######5 | 76% 2025-05-07T19:46:13.5399221Z 2025-05-07T19:46:13.5399342Z 2025-05-07T19:46:13.5399346Z 2025-05-07T19:46:13.5399368Z 2025-05-07T19:46:13.5399613Z 2025-05-07T19:46:13.5399621Z 2025-05-07T19:46:13.5399626Z 2025-05-07T19:46:13.6116505Z libnpp-12.3.1.54 | 93.4 MB | #8 | 18%  2025-05-07T19:46:13.6402454Z nsight-compute-2024. | 443.1 MB | #######7 | 78% 2025-05-07T19:46:13.6402840Z 2025-05-07T19:46:13.6403449Z 2025-05-07T19:46:13.6403573Z 2025-05-07T19:46:13.6403578Z 2025-05-07T19:46:13.6403581Z 2025-05-07T19:46:13.6403585Z 2025-05-07T19:46:13.6403589Z 2025-05-07T19:46:13.7117849Z libnpp-12.3.1.54 | 93.4 MB | ##3 | 24%  2025-05-07T19:46:13.7641326Z nsight-compute-2024. | 443.1 MB | #######9 | 80% 2025-05-07T19:46:13.7641776Z 2025-05-07T19:46:13.7641933Z 2025-05-07T19:46:13.7641942Z 2025-05-07T19:46:13.7641947Z 2025-05-07T19:46:13.7641988Z 2025-05-07T19:46:13.7641992Z 2025-05-07T19:46:13.7642021Z 2025-05-07T19:46:13.8117788Z libnpp-12.3.1.54 | 93.4 MB | ##8 | 29%  2025-05-07T19:46:13.8174211Z nsight-compute-2024. | 443.1 MB | ########1 | 82% 2025-05-07T19:46:13.8175047Z 2025-05-07T19:46:13.8175059Z 2025-05-07T19:46:13.8175070Z 2025-05-07T19:46:13.8175117Z 2025-05-07T19:46:13.8175128Z 2025-05-07T19:46:13.8520774Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:46:13.8521160Z 2025-05-07T19:46:13.8521166Z 2025-05-07T19:46:13.8521172Z 2025-05-07T19:46:13.8521179Z 2025-05-07T19:46:13.8521185Z 2025-05-07T19:46:13.8521191Z 2025-05-07T19:46:13.8521195Z 2025-05-07T19:46:13.8521208Z 2025-05-07T19:46:13.8640857Z cuda-nvdisasm-12.6.7 | 47.6 MB | | 0%  2025-05-07T19:46:13.8641210Z 2025-05-07T19:46:13.8641214Z 2025-05-07T19:46:13.8641218Z 2025-05-07T19:46:13.8641221Z 2025-05-07T19:46:13.8641225Z 2025-05-07T19:46:13.8641229Z 2025-05-07T19:46:13.8641232Z 2025-05-07T19:46:13.9522433Z libnpp-12.3.1.54 | 93.4 MB | ###6 | 37%  2025-05-07T19:46:13.9522911Z 2025-05-07T19:46:13.9523035Z 2025-05-07T19:46:13.9523039Z 2025-05-07T19:46:13.9523072Z 2025-05-07T19:46:13.9523077Z 2025-05-07T19:46:13.9523082Z 2025-05-07T19:46:13.9523132Z 2025-05-07T19:46:13.9523155Z 2025-05-07T19:46:13.9640941Z cuda-nvdisasm-12.6.7 | 47.6 MB | #5 | 15%  2025-05-07T19:46:13.9641398Z 2025-05-07T19:46:13.9641542Z 2025-05-07T19:46:13.9641547Z 2025-05-07T19:46:13.9641576Z 2025-05-07T19:46:13.9641594Z 2025-05-07T19:46:13.9641598Z 2025-05-07T19:46:13.9641634Z 2025-05-07T19:46:14.0523880Z libnpp-12.3.1.54 | 93.4 MB | ####4 | 44%  2025-05-07T19:46:14.0524380Z 2025-05-07T19:46:14.0524434Z 2025-05-07T19:46:14.0524440Z 2025-05-07T19:46:14.0524444Z 2025-05-07T19:46:14.0524549Z 2025-05-07T19:46:14.0524555Z 2025-05-07T19:46:14.0524559Z 2025-05-07T19:46:14.0524629Z 2025-05-07T19:46:14.0640883Z cuda-nvdisasm-12.6.7 | 47.6 MB | ###1 | 31%  2025-05-07T19:46:14.0641517Z 2025-05-07T19:46:14.0641586Z 2025-05-07T19:46:14.0641591Z 2025-05-07T19:46:14.0641641Z 2025-05-07T19:46:14.0641645Z 2025-05-07T19:46:14.0641650Z 2025-05-07T19:46:14.0641654Z 2025-05-07T19:46:14.0922588Z libnpp-12.3.1.54 | 93.4 MB | #####1 | 52%  2025-05-07T19:46:14.0923161Z 2025-05-07T19:46:14.0923398Z 2025-05-07T19:46:14.0923406Z 2025-05-07T19:46:14.0923411Z 2025-05-07T19:46:14.0923416Z 2025-05-07T19:46:14.0923420Z 2025-05-07T19:46:14.0923930Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:46:14.0924248Z 2025-05-07T19:46:14.0924252Z 2025-05-07T19:46:14.0924474Z 2025-05-07T19:46:14.0924509Z 2025-05-07T19:46:14.0924513Z 2025-05-07T19:46:14.0924516Z 2025-05-07T19:46:14.1240893Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:46:14.1241230Z 2025-05-07T19:46:14.1241265Z 2025-05-07T19:46:14.1241269Z 2025-05-07T19:46:14.1294097Z libcusparse-12.5.4.2 | 118.6 MB | ########## | 100%  2025-05-07T19:46:14.1523083Z nsight-compute-2024. | 443.1 MB | ########3 | 84% 2025-05-07T19:46:14.1523480Z 2025-05-07T19:46:14.1523673Z 2025-05-07T19:46:14.1523676Z 2025-05-07T19:46:14.1523681Z 2025-05-07T19:46:14.1523725Z 2025-05-07T19:46:14.1523754Z 2025-05-07T19:46:14.1523758Z 2025-05-07T19:46:14.1523761Z 2025-05-07T19:46:14.1617630Z cuda-nvdisasm-12.6.7 | 47.6 MB | ####4 | 45%  2025-05-07T19:46:14.1617989Z 2025-05-07T19:46:14.1617994Z 2025-05-07T19:46:14.1617999Z 2025-05-07T19:46:14.1618002Z 2025-05-07T19:46:14.1618006Z 2025-05-07T19:46:14.1618010Z 2025-05-07T19:46:14.1618015Z 2025-05-07T19:46:14.1618031Z 2025-05-07T19:46:14.1618043Z 2025-05-07T19:46:14.2013988Z libcurand-10.3.7.77 | 39.9 MB | | 0%  2025-05-07T19:46:14.2014947Z 2025-05-07T19:46:14.2014961Z 2025-05-07T19:46:14.2014972Z 2025-05-07T19:46:14.2014983Z 2025-05-07T19:46:14.2014993Z 2025-05-07T19:46:14.2015004Z 2025-05-07T19:46:14.2015014Z 2025-05-07T19:46:14.2385046Z libnpp-12.3.1.54 | 93.4 MB | #####8 | 58%  2025-05-07T19:46:14.2523156Z nsight-compute-2024. | 443.1 MB | ########5 | 85% 2025-05-07T19:46:14.2523584Z 2025-05-07T19:46:14.2523708Z 2025-05-07T19:46:14.2523712Z 2025-05-07T19:46:14.2523716Z 2025-05-07T19:46:14.2523822Z 2025-05-07T19:46:14.2523846Z 2025-05-07T19:46:14.2523850Z 2025-05-07T19:46:14.2523908Z 2025-05-07T19:46:14.2618522Z cuda-nvdisasm-12.6.7 | 47.6 MB | #####7 | 58%  2025-05-07T19:46:14.2619013Z 2025-05-07T19:46:14.2619194Z 2025-05-07T19:46:14.2619202Z 2025-05-07T19:46:14.2619224Z 2025-05-07T19:46:14.2619261Z 2025-05-07T19:46:14.2619266Z 2025-05-07T19:46:14.2619270Z 2025-05-07T19:46:14.2619301Z 2025-05-07T19:46:14.2619305Z 2025-05-07T19:46:14.3222564Z libcurand-10.3.7.77 | 39.9 MB | #3 | 13%  2025-05-07T19:46:14.3222898Z 2025-05-07T19:46:14.3222916Z 2025-05-07T19:46:14.3222919Z 2025-05-07T19:46:14.3222923Z 2025-05-07T19:46:14.3222927Z 2025-05-07T19:46:14.3222931Z 2025-05-07T19:46:14.3222934Z 2025-05-07T19:46:14.3619926Z libnpp-12.3.1.54 | 93.4 MB | ######4 | 64%  2025-05-07T19:46:14.3620359Z 2025-05-07T19:46:14.3620462Z 2025-05-07T19:46:14.3620473Z 2025-05-07T19:46:14.3620480Z 2025-05-07T19:46:14.3620487Z 2025-05-07T19:46:14.3620494Z 2025-05-07T19:46:14.3620501Z 2025-05-07T19:46:14.3620507Z 2025-05-07T19:46:14.3620524Z 2025-05-07T19:46:14.3624473Z libcurand-10.3.7.77 | 39.9 MB | ##7 | 27%  2025-05-07T19:46:14.3624788Z 2025-05-07T19:46:14.3624791Z 2025-05-07T19:46:14.3624820Z 2025-05-07T19:46:14.3624824Z 2025-05-07T19:46:14.3624828Z 2025-05-07T19:46:14.3624832Z 2025-05-07T19:46:14.3624835Z 2025-05-07T19:46:14.3624839Z 2025-05-07T19:46:14.3724986Z cuda-nvdisasm-12.6.7 | 47.6 MB | ####### | 71%  2025-05-07T19:46:14.4361442Z nsight-compute-2024. | 443.1 MB | ########6 | 87% 2025-05-07T19:46:14.4362006Z 2025-05-07T19:46:14.4362303Z 2025-05-07T19:46:14.4362310Z 2025-05-07T19:46:14.4362316Z 2025-05-07T19:46:14.4362320Z 2025-05-07T19:46:14.4362355Z 2025-05-07T19:46:14.4362360Z 2025-05-07T19:46:14.4621357Z libnpp-12.3.1.54 | 93.4 MB | ####### | 70%  2025-05-07T19:46:14.4622270Z 2025-05-07T19:46:14.4622284Z 2025-05-07T19:46:14.4622295Z 2025-05-07T19:46:14.4622306Z 2025-05-07T19:46:14.4622316Z 2025-05-07T19:46:14.4622344Z 2025-05-07T19:46:14.4622355Z 2025-05-07T19:46:14.4622366Z 2025-05-07T19:46:14.4622376Z 2025-05-07T19:46:14.4764154Z libcurand-10.3.7.77 | 39.9 MB | ####1 | 41%  2025-05-07T19:46:14.4765128Z 2025-05-07T19:46:14.4765141Z 2025-05-07T19:46:14.4765151Z 2025-05-07T19:46:14.4765162Z 2025-05-07T19:46:14.4765192Z 2025-05-07T19:46:14.4765203Z 2025-05-07T19:46:14.4765214Z 2025-05-07T19:46:14.4765224Z 2025-05-07T19:46:14.5014541Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########3 | 83%  2025-05-07T19:46:14.5569691Z nsight-compute-2024. | 443.1 MB | ########8 | 88% 2025-05-07T19:46:14.5570015Z 2025-05-07T19:46:14.5570020Z 2025-05-07T19:46:14.5570025Z 2025-05-07T19:46:14.5570030Z 2025-05-07T19:46:14.5570056Z 2025-05-07T19:46:14.5570061Z 2025-05-07T19:46:14.5570066Z 2025-05-07T19:46:14.5622855Z libnpp-12.3.1.54 | 93.4 MB | #######5 | 76%  2025-05-07T19:46:14.5623766Z 2025-05-07T19:46:14.5623780Z 2025-05-07T19:46:14.5623791Z 2025-05-07T19:46:14.5623801Z 2025-05-07T19:46:14.5623812Z 2025-05-07T19:46:14.5623823Z 2025-05-07T19:46:14.5623833Z 2025-05-07T19:46:14.5623843Z 2025-05-07T19:46:14.5623884Z 2025-05-07T19:46:14.5826703Z libcurand-10.3.7.77 | 39.9 MB | #####4 | 54%  2025-05-07T19:46:14.5827840Z 2025-05-07T19:46:14.5827853Z 2025-05-07T19:46:14.5827864Z 2025-05-07T19:46:14.5827874Z 2025-05-07T19:46:14.5827885Z 2025-05-07T19:46:14.5827916Z 2025-05-07T19:46:14.5827926Z 2025-05-07T19:46:14.5827937Z 2025-05-07T19:46:14.6206718Z cuda-nvdisasm-12.6.7 | 47.6 MB | #########5 | 95%  2025-05-07T19:46:14.6586005Z nsight-compute-2024. | 443.1 MB | ########9 | 90% 2025-05-07T19:46:14.6586296Z 2025-05-07T19:46:14.6586300Z 2025-05-07T19:46:14.6586303Z 2025-05-07T19:46:14.6586307Z 2025-05-07T19:46:14.6586311Z 2025-05-07T19:46:14.6586314Z 2025-05-07T19:46:14.6586318Z 2025-05-07T19:46:14.6621959Z libnpp-12.3.1.54 | 93.4 MB | ########1 | 81%  2025-05-07T19:46:14.6622317Z 2025-05-07T19:46:14.6622333Z 2025-05-07T19:46:14.6622336Z 2025-05-07T19:46:14.6622340Z 2025-05-07T19:46:14.6622356Z 2025-05-07T19:46:14.6622359Z 2025-05-07T19:46:14.6622363Z 2025-05-07T19:46:14.6622366Z 2025-05-07T19:46:14.6624388Z 2025-05-07T19:46:14.7208877Z libcurand-10.3.7.77 | 39.9 MB | ######8 | 69%  2025-05-07T19:46:14.7587066Z nsight-compute-2024. | 443.1 MB | ######### | 91% 2025-05-07T19:46:14.7587343Z 2025-05-07T19:46:14.7587360Z 2025-05-07T19:46:14.7587364Z 2025-05-07T19:46:14.7587368Z 2025-05-07T19:46:14.7587374Z 2025-05-07T19:46:14.7587379Z 2025-05-07T19:46:14.7587383Z 2025-05-07T19:46:14.7622224Z libnpp-12.3.1.54 | 93.4 MB | ########6 | 87%  2025-05-07T19:46:14.7622588Z 2025-05-07T19:46:14.7622605Z 2025-05-07T19:46:14.7622609Z 2025-05-07T19:46:14.7622627Z 2025-05-07T19:46:14.7622631Z 2025-05-07T19:46:14.7622634Z 2025-05-07T19:46:14.7622637Z 2025-05-07T19:46:14.7622641Z 2025-05-07T19:46:14.7622644Z 2025-05-07T19:46:14.8212701Z libcurand-10.3.7.77 | 39.9 MB | ########4 | 85%  2025-05-07T19:46:14.9213759Z nsight-compute-2024. | 443.1 MB | #########2 | 92% 2025-05-07T19:46:14.9277603Z nsight-compute-2024. | 443.1 MB | #########4 | 94% 2025-05-07T19:46:14.9277927Z 2025-05-07T19:46:14.9277932Z 2025-05-07T19:46:14.9277935Z 2025-05-07T19:46:14.9277948Z 2025-05-07T19:46:14.9277952Z 2025-05-07T19:46:14.9277955Z 2025-05-07T19:46:14.9278226Z 2025-05-07T19:46:15.0214055Z libnpp-12.3.1.54 | 93.4 MB | #########2 | 92%  2025-05-07T19:46:15.0338051Z nsight-compute-2024. | 443.1 MB | #########6 | 97% 2025-05-07T19:46:15.0339137Z 2025-05-07T19:46:15.0339591Z 2025-05-07T19:46:15.0339628Z 2025-05-07T19:46:15.0339638Z 2025-05-07T19:46:15.0339648Z 2025-05-07T19:46:15.0339659Z 2025-05-07T19:46:15.0339669Z 2025-05-07T19:46:15.1213870Z libnpp-12.3.1.54 | 93.4 MB | #########6 | 97%  2025-05-07T19:46:15.2942225Z nsight-compute-2024. | 443.1 MB | #########8 | 99% 2025-05-07T19:46:15.2942503Z 2025-05-07T19:46:15.2942508Z 2025-05-07T19:46:15.2942684Z 2025-05-07T19:46:15.2942688Z 2025-05-07T19:46:15.2942692Z 2025-05-07T19:46:15.2942695Z 2025-05-07T19:46:15.2942699Z 2025-05-07T19:46:15.2942702Z 2025-05-07T19:46:15.3467229Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:46:15.3467589Z 2025-05-07T19:46:15.3467594Z 2025-05-07T19:46:15.3467597Z 2025-05-07T19:46:15.3467602Z 2025-05-07T19:46:15.3467607Z 2025-05-07T19:46:15.3467612Z 2025-05-07T19:46:15.3467617Z 2025-05-07T19:46:15.3467648Z 2025-05-07T19:46:15.3467651Z 2025-05-07T19:46:15.3467950Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:46:15.3468499Z 2025-05-07T19:46:15.3468506Z 2025-05-07T19:46:15.3468512Z 2025-05-07T19:46:15.3468519Z 2025-05-07T19:46:15.3468525Z 2025-05-07T19:46:15.3468531Z 2025-05-07T19:46:15.3468537Z 2025-05-07T19:46:15.3468558Z 2025-05-07T19:46:15.3468565Z 2025-05-07T19:46:15.3492347Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:46:15.3493551Z 2025-05-07T19:46:15.3493599Z 2025-05-07T19:46:15.3493622Z 2025-05-07T19:46:15.3493638Z 2025-05-07T19:46:15.3493649Z 2025-05-07T19:46:15.3493659Z 2025-05-07T19:46:15.3493690Z 2025-05-07T19:46:15.3493700Z 2025-05-07T19:46:15.3493710Z 2025-05-07T19:46:15.3493720Z 2025-05-07T19:46:15.4097291Z gds-tools-1.11.1.6 | 37.8 MB | | 0%  2025-05-07T19:46:15.4097697Z 2025-05-07T19:46:15.4097706Z 2025-05-07T19:46:15.4097713Z 2025-05-07T19:46:15.4097734Z 2025-05-07T19:46:15.4097740Z 2025-05-07T19:46:15.4097763Z 2025-05-07T19:46:15.4097770Z 2025-05-07T19:46:15.4097776Z 2025-05-07T19:46:15.4097782Z 2025-05-07T19:46:15.4097789Z 2025-05-07T19:46:15.4097795Z 2025-05-07T19:46:15.4494190Z cuda-nvcc-tools-12.6 | 23.0 MB | | 0%  2025-05-07T19:46:15.4494570Z 2025-05-07T19:46:15.4494575Z 2025-05-07T19:46:15.4494578Z 2025-05-07T19:46:15.4494582Z 2025-05-07T19:46:15.4494585Z 2025-05-07T19:46:15.4494607Z 2025-05-07T19:46:15.4494610Z 2025-05-07T19:46:15.4494613Z 2025-05-07T19:46:15.4494617Z 2025-05-07T19:46:15.4494620Z 2025-05-07T19:46:15.5091087Z gds-tools-1.11.1.6 | 37.8 MB | ##5 | 25%  2025-05-07T19:46:15.5091549Z 2025-05-07T19:46:15.5095790Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:46:15.5096051Z 2025-05-07T19:46:15.5096055Z 2025-05-07T19:46:15.5096075Z 2025-05-07T19:46:15.5096079Z 2025-05-07T19:46:15.5096082Z 2025-05-07T19:46:15.5096085Z 2025-05-07T19:46:15.5096089Z 2025-05-07T19:46:15.5096117Z 2025-05-07T19:46:15.5096121Z 2025-05-07T19:46:15.5096124Z 2025-05-07T19:46:15.5096128Z 2025-05-07T19:46:15.5495573Z cuda-nvcc-tools-12.6 | 23.0 MB | ###2 | 33%  2025-05-07T19:46:15.5496603Z 2025-05-07T19:46:15.5496616Z 2025-05-07T19:46:15.5496628Z 2025-05-07T19:46:15.5496638Z 2025-05-07T19:46:15.5496649Z 2025-05-07T19:46:15.5496659Z 2025-05-07T19:46:15.5496698Z 2025-05-07T19:46:15.5496710Z 2025-05-07T19:46:15.5496720Z 2025-05-07T19:46:15.5496730Z 2025-05-07T19:46:15.5688217Z gds-tools-1.11.1.6 | 37.8 MB | ####9 | 49%  2025-05-07T19:46:15.5688694Z 2025-05-07T19:46:15.5688809Z 2025-05-07T19:46:15.5688837Z 2025-05-07T19:46:15.5688841Z 2025-05-07T19:46:15.5688877Z 2025-05-07T19:46:15.5688880Z 2025-05-07T19:46:15.5688968Z 2025-05-07T19:46:15.5688972Z 2025-05-07T19:46:15.5688975Z 2025-05-07T19:46:15.5688979Z 2025-05-07T19:46:15.5688982Z 2025-05-07T19:46:15.5688986Z 2025-05-07T19:46:15.6101663Z cuda-nvrtc-12.6.85 | 17.3 MB | | 0%  2025-05-07T19:46:15.6102013Z 2025-05-07T19:46:15.6102019Z 2025-05-07T19:46:15.6102023Z 2025-05-07T19:46:15.6102028Z 2025-05-07T19:46:15.6102031Z 2025-05-07T19:46:15.6102035Z 2025-05-07T19:46:15.6102038Z 2025-05-07T19:46:15.6102042Z 2025-05-07T19:46:15.6102045Z 2025-05-07T19:46:15.6102048Z 2025-05-07T19:46:15.6102052Z 2025-05-07T19:46:15.6561910Z cuda-nvcc-tools-12.6 | 23.0 MB | #####8 | 59%  2025-05-07T19:46:15.6562300Z 2025-05-07T19:46:15.6562305Z 2025-05-07T19:46:15.6562309Z 2025-05-07T19:46:15.6562313Z 2025-05-07T19:46:15.6562317Z 2025-05-07T19:46:15.6562320Z 2025-05-07T19:46:15.6562324Z 2025-05-07T19:46:15.6562340Z 2025-05-07T19:46:15.6562344Z 2025-05-07T19:46:15.6562347Z 2025-05-07T19:46:15.6688111Z gds-tools-1.11.1.6 | 37.8 MB | ######8 | 68%  2025-05-07T19:46:15.6688556Z 2025-05-07T19:46:15.6688642Z 2025-05-07T19:46:15.6688663Z 2025-05-07T19:46:15.6688730Z 2025-05-07T19:46:15.6688736Z 2025-05-07T19:46:15.6688740Z 2025-05-07T19:46:15.6688775Z 2025-05-07T19:46:15.6688778Z 2025-05-07T19:46:15.6688782Z 2025-05-07T19:46:15.6688818Z 2025-05-07T19:46:15.6688821Z 2025-05-07T19:46:15.6688825Z 2025-05-07T19:46:15.7102674Z cuda-nvrtc-12.6.85 | 17.3 MB | ###1 | 31%  2025-05-07T19:46:15.7103058Z 2025-05-07T19:46:15.7103065Z 2025-05-07T19:46:15.7103071Z 2025-05-07T19:46:15.7103077Z 2025-05-07T19:46:15.7103083Z 2025-05-07T19:46:15.7103089Z 2025-05-07T19:46:15.7103094Z 2025-05-07T19:46:15.7103101Z 2025-05-07T19:46:15.7103106Z 2025-05-07T19:46:15.7103113Z 2025-05-07T19:46:15.7103118Z 2025-05-07T19:46:15.7692385Z cuda-nvcc-tools-12.6 | 23.0 MB | ########5 | 86%  2025-05-07T19:46:15.7692939Z 2025-05-07T19:46:15.7693239Z 2025-05-07T19:46:15.7693252Z 2025-05-07T19:46:15.7693259Z 2025-05-07T19:46:15.7693265Z 2025-05-07T19:46:15.7693309Z 2025-05-07T19:46:15.7693314Z 2025-05-07T19:46:15.7693318Z 2025-05-07T19:46:15.7693323Z 2025-05-07T19:46:15.7693328Z 2025-05-07T19:46:15.7693333Z 2025-05-07T19:46:15.7693338Z 2025-05-07T19:46:15.7756978Z cuda-nvrtc-12.6.85 | 17.3 MB | ######4 | 64%  2025-05-07T19:46:15.7757357Z 2025-05-07T19:46:15.7757534Z 2025-05-07T19:46:15.7757538Z 2025-05-07T19:46:15.7757616Z 2025-05-07T19:46:15.7757647Z 2025-05-07T19:46:15.7757653Z 2025-05-07T19:46:15.7757659Z 2025-05-07T19:46:15.7757665Z 2025-05-07T19:46:15.7757701Z 2025-05-07T19:46:15.7757706Z 2025-05-07T19:46:15.9972456Z gds-tools-1.11.1.6 | 37.8 MB | ########6 | 87%  2025-05-07T19:46:15.9972986Z 2025-05-07T19:46:15.9973049Z 2025-05-07T19:46:16.0376134Z libcufft-11.3.0.4 | 156.2 MB | ########## | 100%  2025-05-07T19:46:16.0376554Z 2025-05-07T19:46:16.0376564Z 2025-05-07T19:46:16.0376572Z 2025-05-07T19:46:16.0376578Z 2025-05-07T19:46:16.0376584Z 2025-05-07T19:46:16.0376652Z 2025-05-07T19:46:16.0376656Z 2025-05-07T19:46:16.0376660Z 2025-05-07T19:46:16.0376664Z 2025-05-07T19:46:16.0376668Z 2025-05-07T19:46:16.0376671Z 2025-05-07T19:46:16.0376675Z 2025-05-07T19:46:16.0377001Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:46:16.0377364Z 2025-05-07T19:46:16.0377367Z 2025-05-07T19:46:16.0377372Z 2025-05-07T19:46:16.0377393Z 2025-05-07T19:46:16.0377396Z 2025-05-07T19:46:16.0377400Z 2025-05-07T19:46:16.0377404Z 2025-05-07T19:46:16.0377407Z 2025-05-07T19:46:16.0377410Z 2025-05-07T19:46:16.0377414Z 2025-05-07T19:46:16.0377417Z 2025-05-07T19:46:16.0377421Z 2025-05-07T19:46:16.0609701Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:46:16.0610078Z 2025-05-07T19:46:16.0610084Z 2025-05-07T19:46:16.0610089Z 2025-05-07T19:46:16.0610093Z 2025-05-07T19:46:16.0610098Z 2025-05-07T19:46:16.0610103Z 2025-05-07T19:46:16.0610108Z 2025-05-07T19:46:16.0610384Z 2025-05-07T19:46:16.0610388Z 2025-05-07T19:46:16.0610394Z 2025-05-07T19:46:16.0610398Z 2025-05-07T19:46:16.0822475Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:46:16.0822856Z 2025-05-07T19:46:16.0822862Z 2025-05-07T19:46:16.0822866Z 2025-05-07T19:46:16.0822869Z 2025-05-07T19:46:16.0822874Z 2025-05-07T19:46:16.0822877Z 2025-05-07T19:46:16.0823123Z 2025-05-07T19:46:16.0823156Z 2025-05-07T19:46:16.0823159Z 2025-05-07T19:46:16.0823163Z 2025-05-07T19:46:16.0823167Z 2025-05-07T19:46:16.0823170Z 2025-05-07T19:46:16.0823175Z 2025-05-07T19:46:16.1142545Z libnvjitlink-12.6.85 | 14.9 MB | | 0%  2025-05-07T19:46:16.1142937Z 2025-05-07T19:46:16.1142942Z 2025-05-07T19:46:16.1142976Z 2025-05-07T19:46:16.1142982Z 2025-05-07T19:46:16.1142988Z 2025-05-07T19:46:16.1142994Z 2025-05-07T19:46:16.1142999Z 2025-05-07T19:46:16.1143004Z 2025-05-07T19:46:16.1143009Z 2025-05-07T19:46:16.1143046Z 2025-05-07T19:46:16.1143049Z 2025-05-07T19:46:16.1143053Z 2025-05-07T19:46:16.1143056Z 2025-05-07T19:46:16.1143060Z 2025-05-07T19:46:16.1824076Z cuda-nvcc-dev_linux- | 10.8 MB | | 0%  2025-05-07T19:46:16.1824434Z 2025-05-07T19:46:16.1824471Z 2025-05-07T19:46:16.1824476Z 2025-05-07T19:46:16.1824481Z 2025-05-07T19:46:16.1824486Z 2025-05-07T19:46:16.1824517Z 2025-05-07T19:46:16.1824522Z 2025-05-07T19:46:16.1824529Z 2025-05-07T19:46:16.1824534Z 2025-05-07T19:46:16.1824539Z 2025-05-07T19:46:16.1824544Z 2025-05-07T19:46:16.1824548Z 2025-05-07T19:46:16.1824552Z 2025-05-07T19:46:16.2141921Z libnvjitlink-12.6.85 | 14.9 MB | #####5 | 55%  2025-05-07T19:46:16.2142297Z 2025-05-07T19:46:16.2142302Z 2025-05-07T19:46:16.2142306Z 2025-05-07T19:46:16.2142310Z 2025-05-07T19:46:16.2142314Z 2025-05-07T19:46:16.2142318Z 2025-05-07T19:46:16.2142321Z 2025-05-07T19:46:16.2142326Z 2025-05-07T19:46:16.2142351Z 2025-05-07T19:46:16.2142355Z 2025-05-07T19:46:16.2142358Z 2025-05-07T19:46:16.2142362Z 2025-05-07T19:46:16.2142365Z 2025-05-07T19:46:16.2142369Z 2025-05-07T19:46:16.3279847Z cuda-nvcc-dev_linux- | 10.8 MB | ######5 | 66%  2025-05-07T19:46:16.3280219Z 2025-05-07T19:46:16.3280224Z 2025-05-07T19:46:16.3280227Z 2025-05-07T19:46:16.3280231Z 2025-05-07T19:46:16.3280255Z 2025-05-07T19:46:16.3280259Z 2025-05-07T19:46:16.3280262Z 2025-05-07T19:46:16.3280266Z 2025-05-07T19:46:16.3280270Z 2025-05-07T19:46:16.3280273Z 2025-05-07T19:46:16.3493970Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:46:16.3494384Z 2025-05-07T19:46:16.3494646Z 2025-05-07T19:46:16.3494657Z 2025-05-07T19:46:16.3494662Z 2025-05-07T19:46:16.3494667Z 2025-05-07T19:46:16.3494671Z 2025-05-07T19:46:16.3494676Z 2025-05-07T19:46:16.3711364Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:46:16.3711728Z 2025-05-07T19:46:16.3711733Z 2025-05-07T19:46:16.3711736Z 2025-05-07T19:46:16.3711740Z 2025-05-07T19:46:16.3711743Z 2025-05-07T19:46:16.3711747Z 2025-05-07T19:46:16.3711750Z 2025-05-07T19:46:16.3711754Z 2025-05-07T19:46:16.3711758Z 2025-05-07T19:46:16.3711761Z 2025-05-07T19:46:16.3711765Z 2025-05-07T19:46:16.3711768Z 2025-05-07T19:46:16.3711771Z 2025-05-07T19:46:16.3711775Z 2025-05-07T19:46:16.3711786Z 2025-05-07T19:46:16.3858077Z cuda-nvvm-tools-12.6 | 10.4 MB | | 0%  2025-05-07T19:46:16.3858506Z 2025-05-07T19:46:16.3858699Z 2025-05-07T19:46:16.3858702Z 2025-05-07T19:46:16.3858706Z 2025-05-07T19:46:16.3858710Z 2025-05-07T19:46:16.3858713Z 2025-05-07T19:46:16.3858788Z 2025-05-07T19:46:16.3858797Z 2025-05-07T19:46:16.3858802Z 2025-05-07T19:46:16.3858805Z 2025-05-07T19:46:16.3858810Z 2025-05-07T19:46:16.3858814Z 2025-05-07T19:46:16.3858819Z 2025-05-07T19:46:16.3858830Z 2025-05-07T19:46:16.3908764Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:46:16.3909373Z 2025-05-07T19:46:16.3909377Z 2025-05-07T19:46:16.3909381Z 2025-05-07T19:46:16.3909385Z 2025-05-07T19:46:16.3909388Z 2025-05-07T19:46:16.3909392Z 2025-05-07T19:46:16.3909395Z 2025-05-07T19:46:16.3909399Z 2025-05-07T19:46:16.3909402Z 2025-05-07T19:46:16.3909406Z 2025-05-07T19:46:16.3909425Z 2025-05-07T19:46:16.3909546Z 2025-05-07T19:46:16.3909550Z 2025-05-07T19:46:16.3909554Z 2025-05-07T19:46:16.3909557Z 2025-05-07T19:46:16.3909560Z 2025-05-07T19:46:16.4369465Z cuda-sanitizer-api-1 | 8.9 MB | | 0%  2025-05-07T19:46:16.4369927Z 2025-05-07T19:46:16.4370061Z 2025-05-07T19:46:16.4370065Z 2025-05-07T19:46:16.4370068Z 2025-05-07T19:46:16.4370118Z 2025-05-07T19:46:16.4370122Z 2025-05-07T19:46:16.4370222Z 2025-05-07T19:46:16.4370230Z 2025-05-07T19:46:16.4370235Z 2025-05-07T19:46:16.4370242Z 2025-05-07T19:46:16.4370264Z 2025-05-07T19:46:16.4370299Z 2025-05-07T19:46:16.4370304Z 2025-05-07T19:46:16.4370308Z 2025-05-07T19:46:16.4370312Z 2025-05-07T19:46:16.4370316Z 2025-05-07T19:46:16.4370321Z 2025-05-07T19:46:16.4670813Z cuda-nvvm-impl-12.6. | 7.7 MB | | 0%  2025-05-07T19:46:16.4671183Z 2025-05-07T19:46:16.4671201Z 2025-05-07T19:46:16.4671205Z 2025-05-07T19:46:16.4671222Z 2025-05-07T19:46:16.4671226Z 2025-05-07T19:46:16.4671229Z 2025-05-07T19:46:16.4671233Z 2025-05-07T19:46:16.4671236Z 2025-05-07T19:46:16.4671240Z 2025-05-07T19:46:16.4671243Z 2025-05-07T19:46:16.4671247Z 2025-05-07T19:46:16.4671251Z 2025-05-07T19:46:16.4671254Z 2025-05-07T19:46:16.4672532Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:46:16.4672871Z 2025-05-07T19:46:16.4672875Z 2025-05-07T19:46:16.4672895Z 2025-05-07T19:46:16.4672898Z 2025-05-07T19:46:16.4672902Z 2025-05-07T19:46:16.4672905Z 2025-05-07T19:46:16.4672918Z 2025-05-07T19:46:16.4672921Z 2025-05-07T19:46:16.4672924Z 2025-05-07T19:46:16.4672928Z 2025-05-07T19:46:16.4672931Z 2025-05-07T19:46:16.4672935Z 2025-05-07T19:46:16.4672938Z 2025-05-07T19:46:16.4715326Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:46:16.4715713Z 2025-05-07T19:46:16.4715717Z 2025-05-07T19:46:16.4715721Z 2025-05-07T19:46:16.4715737Z 2025-05-07T19:46:16.4715741Z 2025-05-07T19:46:16.4715744Z 2025-05-07T19:46:16.4715748Z 2025-05-07T19:46:16.4715751Z 2025-05-07T19:46:16.4715755Z 2025-05-07T19:46:16.4715784Z 2025-05-07T19:46:16.4715788Z 2025-05-07T19:46:16.4715791Z 2025-05-07T19:46:16.4715794Z 2025-05-07T19:46:16.4715798Z 2025-05-07T19:46:16.4715801Z 2025-05-07T19:46:16.4825907Z cuda-nvvm-tools-12.6 | 10.4 MB | #######2 | 72%  2025-05-07T19:46:16.4826281Z 2025-05-07T19:46:16.4826286Z 2025-05-07T19:46:16.4826312Z 2025-05-07T19:46:16.4826316Z 2025-05-07T19:46:16.4826332Z 2025-05-07T19:46:16.4826335Z 2025-05-07T19:46:16.4826339Z 2025-05-07T19:46:16.4826343Z 2025-05-07T19:46:16.4910797Z cuda-nvdisasm-12.6.7 | 47.6 MB | ########## | 100%  2025-05-07T19:46:16.4911199Z 2025-05-07T19:46:16.4911204Z 2025-05-07T19:46:16.4911232Z 2025-05-07T19:46:16.4911236Z 2025-05-07T19:46:16.4911239Z 2025-05-07T19:46:16.4911243Z 2025-05-07T19:46:16.4911258Z 2025-05-07T19:46:16.4911262Z 2025-05-07T19:46:16.4911265Z 2025-05-07T19:46:16.4911269Z 2025-05-07T19:46:16.4911272Z 2025-05-07T19:46:16.4911276Z 2025-05-07T19:46:16.4911279Z 2025-05-07T19:46:16.4911283Z 2025-05-07T19:46:16.4911287Z 2025-05-07T19:46:16.4911290Z 2025-05-07T19:46:16.5049653Z cuda-sanitizer-api-1 | 8.9 MB | ####### | 71%  2025-05-07T19:46:16.5050780Z 2025-05-07T19:46:16.5050794Z 2025-05-07T19:46:16.5050805Z 2025-05-07T19:46:16.5050815Z 2025-05-07T19:46:16.5050826Z 2025-05-07T19:46:16.5105383Z cuda-nvvp-12.6.80 | 109.3 MB | ########## | 100%  2025-05-07T19:46:16.5106750Z 2025-05-07T19:46:16.5106763Z 2025-05-07T19:46:16.5106774Z 2025-05-07T19:46:16.5106785Z 2025-05-07T19:46:16.5106795Z 2025-05-07T19:46:16.5106806Z 2025-05-07T19:46:16.5106816Z 2025-05-07T19:46:16.5106827Z 2025-05-07T19:46:16.5106838Z 2025-05-07T19:46:16.5106848Z 2025-05-07T19:46:16.5106859Z 2025-05-07T19:46:16.5106869Z 2025-05-07T19:46:16.5107082Z 2025-05-07T19:46:16.5107095Z 2025-05-07T19:46:16.5107105Z 2025-05-07T19:46:16.5107116Z 2025-05-07T19:46:16.5107127Z 2025-05-07T19:46:16.5107137Z 2025-05-07T19:46:16.5369791Z cuda-cupti-dev-12.6. | 3.4 MB | | 0%  2025-05-07T19:46:16.5370168Z 2025-05-07T19:46:16.5370377Z 2025-05-07T19:46:16.5370385Z 2025-05-07T19:46:16.5370391Z 2025-05-07T19:46:16.5370397Z 2025-05-07T19:46:16.5370401Z 2025-05-07T19:46:16.5370405Z 2025-05-07T19:46:16.5370410Z 2025-05-07T19:46:16.5370414Z 2025-05-07T19:46:16.5370439Z 2025-05-07T19:46:16.5370456Z 2025-05-07T19:46:16.5370460Z 2025-05-07T19:46:16.5370465Z 2025-05-07T19:46:16.5370501Z 2025-05-07T19:46:16.5370507Z 2025-05-07T19:46:16.5370511Z 2025-05-07T19:46:16.5370515Z 2025-05-07T19:46:16.6005447Z cuda-nvvm-impl-12.6. | 7.7 MB | ######1 | 62%  2025-05-07T19:46:16.6005896Z 2025-05-07T19:46:16.6005900Z 2025-05-07T19:46:16.6005938Z 2025-05-07T19:46:16.6005942Z 2025-05-07T19:46:16.6005946Z 2025-05-07T19:46:16.6005950Z 2025-05-07T19:46:16.6005953Z 2025-05-07T19:46:16.6005958Z 2025-05-07T19:46:16.6005962Z 2025-05-07T19:46:16.6005967Z 2025-05-07T19:46:16.6005971Z 2025-05-07T19:46:16.6005976Z 2025-05-07T19:46:16.6005981Z 2025-05-07T19:46:16.6005985Z 2025-05-07T19:46:16.6005990Z 2025-05-07T19:46:16.6005994Z 2025-05-07T19:46:16.6006000Z 2025-05-07T19:46:16.6006019Z 2025-05-07T19:46:16.6364181Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:16.6364661Z 2025-05-07T19:46:16.6364667Z 2025-05-07T19:46:16.6364671Z 2025-05-07T19:46:16.6364675Z 2025-05-07T19:46:16.6364679Z 2025-05-07T19:46:16.6364683Z 2025-05-07T19:46:16.6364703Z 2025-05-07T19:46:16.6364706Z 2025-05-07T19:46:16.6364709Z 2025-05-07T19:46:16.6364713Z 2025-05-07T19:46:16.6364716Z 2025-05-07T19:46:16.6364720Z 2025-05-07T19:46:16.6364723Z 2025-05-07T19:46:16.6364728Z 2025-05-07T19:46:16.6364787Z 2025-05-07T19:46:16.6364791Z 2025-05-07T19:46:16.6422406Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:46:16.6422802Z 2025-05-07T19:46:16.6422807Z 2025-05-07T19:46:16.6422810Z 2025-05-07T19:46:16.6422814Z 2025-05-07T19:46:16.6422817Z 2025-05-07T19:46:16.6422821Z 2025-05-07T19:46:16.6422825Z 2025-05-07T19:46:16.6422829Z 2025-05-07T19:46:16.6422834Z 2025-05-07T19:46:16.6422838Z 2025-05-07T19:46:16.6422841Z 2025-05-07T19:46:16.6422845Z 2025-05-07T19:46:16.6422848Z 2025-05-07T19:46:16.6422870Z 2025-05-07T19:46:16.6422873Z 2025-05-07T19:46:16.6452324Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:46:16.6452695Z 2025-05-07T19:46:16.6452700Z 2025-05-07T19:46:16.6452704Z 2025-05-07T19:46:16.6452707Z 2025-05-07T19:46:16.6452711Z 2025-05-07T19:46:16.6452714Z 2025-05-07T19:46:16.6452718Z 2025-05-07T19:46:16.6452721Z 2025-05-07T19:46:16.6453060Z 2025-05-07T19:46:16.6453196Z 2025-05-07T19:46:16.6453210Z 2025-05-07T19:46:16.6453223Z 2025-05-07T19:46:16.6453235Z 2025-05-07T19:46:16.6453248Z 2025-05-07T19:46:16.6453260Z 2025-05-07T19:46:16.6453272Z 2025-05-07T19:46:16.6453285Z 2025-05-07T19:46:16.6454806Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:46:16.6455883Z 2025-05-07T19:46:16.6455894Z 2025-05-07T19:46:16.6455905Z 2025-05-07T19:46:16.6455915Z 2025-05-07T19:46:16.6455925Z 2025-05-07T19:46:16.6455935Z 2025-05-07T19:46:16.6456339Z 2025-05-07T19:46:16.6456350Z 2025-05-07T19:46:16.6456361Z 2025-05-07T19:46:16.6456371Z 2025-05-07T19:46:16.6456381Z 2025-05-07T19:46:16.6456391Z 2025-05-07T19:46:16.6456402Z 2025-05-07T19:46:16.6456413Z 2025-05-07T19:46:16.6456423Z 2025-05-07T19:46:16.6456433Z 2025-05-07T19:46:16.6456444Z 2025-05-07T19:46:16.6543396Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:46:16.6543799Z 2025-05-07T19:46:16.6543804Z 2025-05-07T19:46:16.6543808Z 2025-05-07T19:46:16.6543811Z 2025-05-07T19:46:16.6543815Z 2025-05-07T19:46:16.6543819Z 2025-05-07T19:46:16.6543822Z 2025-05-07T19:46:16.6543825Z 2025-05-07T19:46:16.6543829Z 2025-05-07T19:46:16.6543832Z 2025-05-07T19:46:16.6543836Z 2025-05-07T19:46:16.6543839Z 2025-05-07T19:46:16.6543843Z 2025-05-07T19:46:16.6543873Z 2025-05-07T19:46:16.6543877Z 2025-05-07T19:46:16.6543880Z 2025-05-07T19:46:16.6543883Z 2025-05-07T19:46:16.6543887Z 2025-05-07T19:46:16.6543891Z 2025-05-07T19:46:16.7086419Z ... (more hidden) ... 2025-05-07T19:46:16.7086758Z 2025-05-07T19:46:16.7086783Z 2025-05-07T19:46:16.7086787Z 2025-05-07T19:46:16.7086790Z 2025-05-07T19:46:16.7086794Z 2025-05-07T19:46:16.7086798Z 2025-05-07T19:46:16.7086802Z 2025-05-07T19:46:16.7086806Z 2025-05-07T19:46:16.7086810Z 2025-05-07T19:46:16.7086813Z 2025-05-07T19:46:16.7086817Z 2025-05-07T19:46:16.7086834Z 2025-05-07T19:46:16.7086837Z 2025-05-07T19:46:16.7086841Z 2025-05-07T19:46:16.7086844Z 2025-05-07T19:46:16.7086848Z 2025-05-07T19:46:16.7086851Z 2025-05-07T19:46:16.7086855Z 2025-05-07T19:46:16.7086858Z 2025-05-07T19:46:16.7397170Z ... (more hidden) ... 2025-05-07T19:46:16.7397820Z 2025-05-07T19:46:16.7397900Z 2025-05-07T19:46:16.7397905Z 2025-05-07T19:46:16.7397909Z 2025-05-07T19:46:16.7397913Z 2025-05-07T19:46:16.7397916Z 2025-05-07T19:46:16.9447399Z libcusolver-11.7.1.2 | 95.8 MB | ########## | 100%  2025-05-07T19:46:16.9447818Z 2025-05-07T19:46:16.9447822Z 2025-05-07T19:46:16.9447826Z 2025-05-07T19:46:16.9447830Z 2025-05-07T19:46:16.9447833Z 2025-05-07T19:46:16.9447838Z 2025-05-07T19:46:16.9447841Z 2025-05-07T19:46:16.9447846Z 2025-05-07T19:46:16.9447851Z 2025-05-07T19:46:17.0655339Z libcurand-10.3.7.77 | 39.9 MB | ########## | 100%  2025-05-07T19:46:17.0656347Z 2025-05-07T19:46:17.0656362Z 2025-05-07T19:46:17.0656374Z 2025-05-07T19:46:17.0656387Z 2025-05-07T19:46:17.0656400Z 2025-05-07T19:46:17.0656412Z 2025-05-07T19:46:17.0656424Z 2025-05-07T19:46:17.0656436Z 2025-05-07T19:46:17.0656448Z 2025-05-07T19:46:17.0656459Z 2025-05-07T19:46:17.0656471Z 2025-05-07T19:46:17.0656482Z 2025-05-07T19:46:17.2819313Z cuda-nvrtc-12.6.85 | 17.3 MB | ########## | 100%  2025-05-07T19:46:17.2819681Z 2025-05-07T19:46:17.2819717Z 2025-05-07T19:46:17.2819722Z 2025-05-07T19:46:17.2819727Z 2025-05-07T19:46:17.2819769Z 2025-05-07T19:46:17.2819773Z 2025-05-07T19:46:17.2819777Z 2025-05-07T19:46:17.2819781Z 2025-05-07T19:46:17.2819784Z 2025-05-07T19:46:17.2819788Z 2025-05-07T19:46:17.3651690Z gds-tools-1.11.1.6 | 37.8 MB | ########## | 100%  2025-05-07T19:46:17.3652067Z 2025-05-07T19:46:17.3652072Z 2025-05-07T19:46:17.3652075Z 2025-05-07T19:46:17.3652080Z 2025-05-07T19:46:17.3652084Z 2025-05-07T19:46:17.3652110Z 2025-05-07T19:46:17.3652114Z 2025-05-07T19:46:17.3652119Z 2025-05-07T19:46:17.3652122Z 2025-05-07T19:46:17.3652126Z 2025-05-07T19:46:17.3652129Z 2025-05-07T19:46:17.5631054Z cuda-nvcc-tools-12.6 | 23.0 MB | ########## | 100%  2025-05-07T19:46:17.5631427Z 2025-05-07T19:46:17.5631508Z 2025-05-07T19:46:17.5631512Z 2025-05-07T19:46:17.5631543Z 2025-05-07T19:46:17.5631646Z 2025-05-07T19:46:17.5631681Z 2025-05-07T19:46:17.5631722Z 2025-05-07T19:46:17.5631727Z 2025-05-07T19:46:17.5631731Z 2025-05-07T19:46:17.5631735Z 2025-05-07T19:46:17.5632051Z 2025-05-07T19:46:17.5632055Z 2025-05-07T19:46:17.5632059Z 2025-05-07T19:46:17.5632063Z 2025-05-07T19:46:17.7888411Z cuda-nvcc-dev_linux- | 10.8 MB | ########## | 100%  2025-05-07T19:46:17.7888984Z 2025-05-07T19:46:17.7889019Z 2025-05-07T19:46:17.7889024Z 2025-05-07T19:46:17.7889028Z 2025-05-07T19:46:17.7889034Z 2025-05-07T19:46:17.7889039Z 2025-05-07T19:46:17.7889313Z 2025-05-07T19:46:17.7889319Z 2025-05-07T19:46:17.7889324Z 2025-05-07T19:46:17.7889331Z 2025-05-07T19:46:17.7889338Z 2025-05-07T19:46:17.7889343Z 2025-05-07T19:46:17.7889349Z 2025-05-07T19:46:17.9223434Z libnvjitlink-12.6.85 | 14.9 MB | ########## | 100%  2025-05-07T19:46:17.9224026Z 2025-05-07T19:46:17.9224035Z 2025-05-07T19:46:17.9224047Z 2025-05-07T19:46:17.9224054Z 2025-05-07T19:46:17.9224061Z 2025-05-07T19:46:17.9224068Z 2025-05-07T19:46:17.9224074Z 2025-05-07T19:46:17.9224080Z 2025-05-07T19:46:17.9224139Z 2025-05-07T19:46:17.9224146Z 2025-05-07T19:46:17.9224157Z 2025-05-07T19:46:17.9224164Z 2025-05-07T19:46:17.9224171Z 2025-05-07T19:46:17.9224178Z 2025-05-07T19:46:17.9224186Z 2025-05-07T19:46:17.9224193Z 2025-05-07T19:46:17.9224200Z 2025-05-07T19:46:17.9224205Z 2025-05-07T19:46:17.9224772Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:17.9225315Z 2025-05-07T19:46:17.9225322Z 2025-05-07T19:46:17.9225327Z 2025-05-07T19:46:17.9225332Z 2025-05-07T19:46:17.9225336Z 2025-05-07T19:46:17.9225344Z 2025-05-07T19:46:17.9225350Z 2025-05-07T19:46:17.9225357Z 2025-05-07T19:46:17.9225363Z 2025-05-07T19:46:17.9225370Z 2025-05-07T19:46:17.9225376Z 2025-05-07T19:46:17.9225382Z 2025-05-07T19:46:17.9225388Z 2025-05-07T19:46:17.9225394Z 2025-05-07T19:46:17.9225398Z 2025-05-07T19:46:17.9225403Z 2025-05-07T19:46:17.9225407Z 2025-05-07T19:46:17.9225420Z 2025-05-07T19:46:18.1137140Z cuda-cupti-dev-12.6. | 3.4 MB | ########## | 100%  2025-05-07T19:46:18.1137768Z 2025-05-07T19:46:18.1137776Z 2025-05-07T19:46:18.1137783Z 2025-05-07T19:46:18.1137789Z 2025-05-07T19:46:18.1137794Z 2025-05-07T19:46:18.1137799Z 2025-05-07T19:46:18.1137803Z 2025-05-07T19:46:18.1137809Z 2025-05-07T19:46:18.1137816Z 2025-05-07T19:46:18.1137822Z 2025-05-07T19:46:18.1137831Z 2025-05-07T19:46:18.1137861Z 2025-05-07T19:46:18.1137881Z 2025-05-07T19:46:18.1137887Z 2025-05-07T19:46:18.1137894Z 2025-05-07T19:46:18.1137899Z 2025-05-07T19:46:18.2874175Z cuda-sanitizer-api-1 | 8.9 MB | ########## | 100%  2025-05-07T19:46:18.2874704Z 2025-05-07T19:46:18.2874733Z 2025-05-07T19:46:18.2874777Z 2025-05-07T19:46:18.2874785Z 2025-05-07T19:46:18.2874791Z 2025-05-07T19:46:18.2874798Z 2025-05-07T19:46:18.2874806Z 2025-05-07T19:46:18.2874814Z 2025-05-07T19:46:18.2874820Z 2025-05-07T19:46:18.2874829Z 2025-05-07T19:46:18.2874834Z 2025-05-07T19:46:18.2874874Z 2025-05-07T19:46:18.2874880Z 2025-05-07T19:46:18.2874888Z 2025-05-07T19:46:18.2874893Z 2025-05-07T19:46:18.3544460Z cuda-nvvm-tools-12.6 | 10.4 MB | ########## | 100%  2025-05-07T19:46:18.3544842Z 2025-05-07T19:46:18.3544847Z 2025-05-07T19:46:18.3544851Z 2025-05-07T19:46:18.3544854Z 2025-05-07T19:46:18.3544858Z 2025-05-07T19:46:18.3544862Z 2025-05-07T19:46:18.3544883Z 2025-05-07T19:46:18.4092385Z libnpp-12.3.1.54 | 93.4 MB | ########## | 100%  2025-05-07T19:46:18.4093049Z 2025-05-07T19:46:18.4093149Z 2025-05-07T19:46:18.4093155Z 2025-05-07T19:46:18.4093159Z 2025-05-07T19:46:18.4093162Z 2025-05-07T19:46:18.4093166Z 2025-05-07T19:46:18.4093173Z 2025-05-07T19:46:18.4093258Z 2025-05-07T19:46:18.4093264Z 2025-05-07T19:46:18.4093269Z 2025-05-07T19:46:18.4093276Z 2025-05-07T19:46:18.4093283Z 2025-05-07T19:46:18.4093288Z 2025-05-07T19:46:18.4093292Z 2025-05-07T19:46:18.4093297Z 2025-05-07T19:46:18.4093302Z 2025-05-07T19:46:18.4093535Z 2025-05-07T19:46:18.4093572Z 2025-05-07T19:46:18.4093576Z 2025-05-07T19:46:18.4094033Z ... (more hidden) ... 2025-05-07T19:46:18.4094361Z 2025-05-07T19:46:18.4094365Z 2025-05-07T19:46:18.4094368Z 2025-05-07T19:46:18.4094371Z 2025-05-07T19:46:18.4094375Z 2025-05-07T19:46:18.4094378Z 2025-05-07T19:46:18.4094397Z 2025-05-07T19:46:18.4095061Z 2025-05-07T19:46:18.4095066Z 2025-05-07T19:46:18.4095069Z 2025-05-07T19:46:18.4095073Z 2025-05-07T19:46:18.4095076Z 2025-05-07T19:46:18.4095079Z 2025-05-07T19:46:18.4095083Z 2025-05-07T19:46:18.4095086Z 2025-05-07T19:46:18.4095089Z 2025-05-07T19:46:18.4095093Z 2025-05-07T19:46:18.4095096Z 2025-05-07T19:46:18.4095100Z 2025-05-07T19:46:18.4183426Z ... (more hidden) ... 2025-05-07T19:46:18.4183757Z 2025-05-07T19:46:18.4183762Z 2025-05-07T19:46:18.4183765Z 2025-05-07T19:46:18.4183769Z 2025-05-07T19:46:18.4183792Z 2025-05-07T19:46:18.4183796Z 2025-05-07T19:46:18.4183800Z 2025-05-07T19:46:18.4183803Z 2025-05-07T19:46:18.4183807Z 2025-05-07T19:46:18.4183810Z 2025-05-07T19:46:18.4183813Z 2025-05-07T19:46:18.4183817Z 2025-05-07T19:46:18.4183820Z 2025-05-07T19:46:18.4183824Z 2025-05-07T19:46:18.4183827Z 2025-05-07T19:46:18.4183831Z 2025-05-07T19:46:18.4183840Z 2025-05-07T19:46:18.4392368Z cuda-nvvm-impl-12.6. | 7.7 MB | ########## | 100%  2025-05-07T19:46:19.8795000Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:46:19.8795373Z 2025-05-07T19:46:22.9104947Z libcublas-12.6.4.1 | 256.2 MB | ########## | 100%  2025-05-07T19:46:22.9118948Z nsight-compute-2024. | 443.1 MB | ########## | 100% 2025-05-07T19:46:22.9119310Z 2025-05-07T19:46:22.9119316Z 2025-05-07T19:46:22.9119321Z 2025-05-07T19:46:22.9119325Z 2025-05-07T19:46:22.9119330Z 2025-05-07T19:46:22.9119336Z 2025-05-07T19:46:22.9119344Z 2025-05-07T19:46:22.9119349Z 2025-05-07T19:46:22.9119382Z 2025-05-07T19:46:22.9119387Z 2025-05-07T19:46:22.9119390Z 2025-05-07T19:46:22.9119410Z 2025-05-07T19:46:22.9119414Z 2025-05-07T19:46:22.9119417Z 2025-05-07T19:46:22.9119420Z 2025-05-07T19:46:22.9119424Z 2025-05-07T19:46:22.9119427Z 2025-05-07T19:46:22.9119433Z 2025-05-07T19:46:22.9119436Z 2025-05-07T19:46:22.9119523Z 2025-05-07T19:46:22.9119955Z  2025-05-07T19:46:22.9120291Z 2025-05-07T19:46:22.9120521Z 2025-05-07T19:46:22.9120691Z  2025-05-07T19:46:22.9120917Z 2025-05-07T19:46:22.9120921Z 2025-05-07T19:46:22.9121090Z  2025-05-07T19:46:22.9121319Z 2025-05-07T19:46:22.9121324Z 2025-05-07T19:46:22.9121327Z 2025-05-07T19:46:22.9121501Z  2025-05-07T19:46:22.9121717Z 2025-05-07T19:46:22.9121743Z 2025-05-07T19:46:22.9121747Z 2025-05-07T19:46:22.9121750Z 2025-05-07T19:46:22.9121928Z  2025-05-07T19:46:22.9122146Z 2025-05-07T19:46:22.9122150Z 2025-05-07T19:46:22.9122153Z 2025-05-07T19:46:22.9122156Z 2025-05-07T19:46:22.9122245Z 2025-05-07T19:46:22.9122432Z  2025-05-07T19:46:22.9122676Z 2025-05-07T19:46:22.9122680Z 2025-05-07T19:46:22.9122683Z 2025-05-07T19:46:22.9122687Z 2025-05-07T19:46:22.9122690Z 2025-05-07T19:46:22.9122694Z 2025-05-07T19:46:22.9122878Z  2025-05-07T19:46:22.9123104Z 2025-05-07T19:46:22.9123108Z 2025-05-07T19:46:22.9123111Z 2025-05-07T19:46:22.9123130Z 2025-05-07T19:46:22.9123133Z 2025-05-07T19:46:22.9123137Z 2025-05-07T19:46:22.9123140Z 2025-05-07T19:46:22.9123325Z  2025-05-07T19:46:22.9123798Z 2025-05-07T19:46:22.9123801Z 2025-05-07T19:46:22.9123805Z 2025-05-07T19:46:22.9123809Z 2025-05-07T19:46:22.9123813Z 2025-05-07T19:46:22.9123816Z 2025-05-07T19:46:22.9123819Z 2025-05-07T19:46:22.9123837Z 2025-05-07T19:46:22.9124035Z  2025-05-07T19:46:22.9124266Z 2025-05-07T19:46:22.9124269Z 2025-05-07T19:46:22.9124398Z 2025-05-07T19:46:22.9124403Z 2025-05-07T19:46:22.9124406Z 2025-05-07T19:46:22.9124410Z 2025-05-07T19:46:22.9124413Z 2025-05-07T19:46:22.9124417Z 2025-05-07T19:46:22.9124420Z 2025-05-07T19:46:22.9124636Z  2025-05-07T19:46:22.9124868Z 2025-05-07T19:46:22.9124872Z 2025-05-07T19:46:22.9124876Z 2025-05-07T19:46:22.9124879Z 2025-05-07T19:46:22.9124883Z 2025-05-07T19:46:22.9124886Z 2025-05-07T19:46:22.9124890Z 2025-05-07T19:46:22.9124893Z 2025-05-07T19:46:22.9124896Z 2025-05-07T19:46:22.9124905Z 2025-05-07T19:46:22.9125119Z  2025-05-07T19:46:22.9125357Z 2025-05-07T19:46:22.9125361Z 2025-05-07T19:46:22.9125364Z 2025-05-07T19:46:22.9125368Z 2025-05-07T19:46:22.9125371Z 2025-05-07T19:46:22.9125375Z 2025-05-07T19:46:22.9125378Z 2025-05-07T19:46:22.9125381Z 2025-05-07T19:46:22.9125385Z 2025-05-07T19:46:22.9125388Z 2025-05-07T19:46:22.9125411Z 2025-05-07T19:46:22.9125616Z  2025-05-07T19:46:22.9125858Z 2025-05-07T19:46:22.9125861Z 2025-05-07T19:46:22.9125865Z 2025-05-07T19:46:22.9125868Z 2025-05-07T19:46:22.9125872Z 2025-05-07T19:46:22.9125875Z 2025-05-07T19:46:22.9125879Z 2025-05-07T19:46:22.9125882Z 2025-05-07T19:46:22.9125902Z 2025-05-07T19:46:22.9125905Z 2025-05-07T19:46:22.9125909Z 2025-05-07T19:46:22.9125912Z 2025-05-07T19:46:22.9126116Z  2025-05-07T19:46:22.9126380Z 2025-05-07T19:46:22.9126383Z 2025-05-07T19:46:22.9126387Z 2025-05-07T19:46:22.9126390Z 2025-05-07T19:46:22.9126394Z 2025-05-07T19:46:22.9126397Z 2025-05-07T19:46:22.9126414Z 2025-05-07T19:46:22.9126417Z 2025-05-07T19:46:22.9126420Z 2025-05-07T19:46:22.9126424Z 2025-05-07T19:46:22.9126427Z 2025-05-07T19:46:22.9126431Z 2025-05-07T19:46:22.9126434Z 2025-05-07T19:46:22.9126649Z  2025-05-07T19:46:22.9126892Z 2025-05-07T19:46:22.9126895Z 2025-05-07T19:46:22.9126899Z 2025-05-07T19:46:22.9126916Z 2025-05-07T19:46:22.9126920Z 2025-05-07T19:46:22.9126923Z 2025-05-07T19:46:22.9126926Z 2025-05-07T19:46:22.9126930Z 2025-05-07T19:46:22.9126933Z 2025-05-07T19:46:22.9126937Z 2025-05-07T19:46:22.9126940Z 2025-05-07T19:46:22.9126944Z 2025-05-07T19:46:22.9126947Z 2025-05-07T19:46:22.9126950Z 2025-05-07T19:46:22.9127164Z  2025-05-07T19:46:22.9127427Z 2025-05-07T19:46:22.9127431Z 2025-05-07T19:46:22.9127434Z 2025-05-07T19:46:22.9127438Z 2025-05-07T19:46:22.9127441Z 2025-05-07T19:46:22.9127444Z 2025-05-07T19:46:22.9127448Z 2025-05-07T19:46:22.9127451Z 2025-05-07T19:46:22.9127455Z 2025-05-07T19:46:22.9127458Z 2025-05-07T19:46:22.9127462Z 2025-05-07T19:46:22.9127465Z 2025-05-07T19:46:22.9127471Z 2025-05-07T19:46:22.9127475Z 2025-05-07T19:46:22.9127478Z 2025-05-07T19:46:22.9127696Z  2025-05-07T19:46:22.9127960Z 2025-05-07T19:46:22.9127963Z 2025-05-07T19:46:22.9127967Z 2025-05-07T19:46:22.9127970Z 2025-05-07T19:46:22.9127974Z 2025-05-07T19:46:22.9127977Z 2025-05-07T19:46:22.9127980Z 2025-05-07T19:46:22.9127984Z 2025-05-07T19:46:22.9127987Z 2025-05-07T19:46:22.9127991Z 2025-05-07T19:46:22.9127994Z 2025-05-07T19:46:22.9127997Z 2025-05-07T19:46:22.9128001Z 2025-05-07T19:46:22.9128063Z 2025-05-07T19:46:22.9128066Z 2025-05-07T19:46:22.9128070Z 2025-05-07T19:46:22.9128311Z  2025-05-07T19:46:22.9128564Z 2025-05-07T19:46:22.9128568Z 2025-05-07T19:46:22.9128571Z 2025-05-07T19:46:22.9128575Z 2025-05-07T19:46:22.9128578Z 2025-05-07T19:46:22.9128582Z 2025-05-07T19:46:22.9128585Z 2025-05-07T19:46:22.9128642Z 2025-05-07T19:46:22.9128646Z 2025-05-07T19:46:22.9128650Z 2025-05-07T19:46:22.9128653Z 2025-05-07T19:46:22.9128670Z 2025-05-07T19:46:22.9128674Z 2025-05-07T19:46:22.9128677Z 2025-05-07T19:46:22.9128681Z 2025-05-07T19:46:22.9128684Z 2025-05-07T19:46:22.9128688Z 2025-05-07T19:46:22.9128920Z  2025-05-07T19:46:22.9129181Z 2025-05-07T19:46:22.9129185Z 2025-05-07T19:46:22.9129188Z 2025-05-07T19:46:22.9129206Z 2025-05-07T19:46:22.9129209Z 2025-05-07T19:46:22.9129217Z 2025-05-07T19:46:22.9129220Z 2025-05-07T19:46:22.9129224Z 2025-05-07T19:46:22.9129227Z 2025-05-07T19:46:22.9129231Z 2025-05-07T19:46:22.9129234Z 2025-05-07T19:46:22.9129238Z 2025-05-07T19:46:22.9129241Z 2025-05-07T19:46:22.9129244Z 2025-05-07T19:46:22.9129248Z 2025-05-07T19:46:22.9129251Z 2025-05-07T19:46:22.9129255Z 2025-05-07T19:46:22.9129258Z 2025-05-07T19:46:22.9129498Z  2025-05-07T19:46:22.9129771Z 2025-05-07T19:46:22.9129774Z 2025-05-07T19:46:22.9129876Z  2025-05-07T19:46:22.9130051Z 2025-05-07T19:46:22.9130055Z 2025-05-07T19:46:22.9130153Z  2025-05-07T19:46:22.9130277Z 2025-05-07T19:46:22.9130280Z 2025-05-07T19:46:22.9130284Z 2025-05-07T19:46:22.9130387Z  2025-05-07T19:46:22.9130502Z 2025-05-07T19:46:22.9130505Z 2025-05-07T19:46:22.9130509Z 2025-05-07T19:46:22.9130512Z 2025-05-07T19:46:22.9130631Z  2025-05-07T19:46:22.9130750Z 2025-05-07T19:46:22.9130758Z 2025-05-07T19:46:22.9130761Z 2025-05-07T19:46:22.9130765Z 2025-05-07T19:46:22.9130768Z 2025-05-07T19:46:22.9130878Z  2025-05-07T19:46:22.9131023Z 2025-05-07T19:46:22.9131027Z 2025-05-07T19:46:22.9131030Z 2025-05-07T19:46:22.9131034Z 2025-05-07T19:46:22.9131037Z 2025-05-07T19:46:22.9131041Z 2025-05-07T19:46:22.9131156Z  2025-05-07T19:46:22.9131289Z 2025-05-07T19:46:22.9131297Z 2025-05-07T19:46:22.9131300Z 2025-05-07T19:46:22.9131319Z 2025-05-07T19:46:22.9131323Z 2025-05-07T19:46:22.9131326Z 2025-05-07T19:46:22.9131329Z 2025-05-07T19:46:22.9131443Z  2025-05-07T19:46:22.9131584Z 2025-05-07T19:46:22.9131588Z 2025-05-07T19:46:22.9131591Z 2025-05-07T19:46:22.9131595Z 2025-05-07T19:46:22.9131598Z 2025-05-07T19:46:22.9131601Z 2025-05-07T19:46:22.9131606Z 2025-05-07T19:46:22.9131623Z 2025-05-07T19:46:22.9131738Z  2025-05-07T19:46:22.9131903Z 2025-05-07T19:46:22.9131906Z 2025-05-07T19:46:22.9131913Z 2025-05-07T19:46:22.9131917Z 2025-05-07T19:46:22.9131920Z 2025-05-07T19:46:22.9131924Z 2025-05-07T19:46:22.9131927Z 2025-05-07T19:46:22.9131931Z 2025-05-07T19:46:22.9131934Z 2025-05-07T19:46:22.9132071Z  2025-05-07T19:46:22.9132231Z 2025-05-07T19:46:22.9132235Z 2025-05-07T19:46:22.9132239Z 2025-05-07T19:46:22.9132242Z 2025-05-07T19:46:22.9132245Z 2025-05-07T19:46:22.9132249Z 2025-05-07T19:46:22.9132256Z 2025-05-07T19:46:22.9132260Z 2025-05-07T19:46:22.9132263Z 2025-05-07T19:46:22.9132267Z 2025-05-07T19:46:22.9132427Z  2025-05-07T19:46:22.9132597Z 2025-05-07T19:46:22.9132600Z 2025-05-07T19:46:22.9132604Z 2025-05-07T19:46:22.9132607Z 2025-05-07T19:46:22.9132611Z 2025-05-07T19:46:22.9132614Z 2025-05-07T19:46:22.9132618Z 2025-05-07T19:46:22.9132621Z 2025-05-07T19:46:22.9132625Z 2025-05-07T19:46:22.9132628Z 2025-05-07T19:46:22.9132632Z 2025-05-07T19:46:22.9132781Z  2025-05-07T19:46:22.9133024Z 2025-05-07T19:46:22.9133028Z 2025-05-07T19:46:22.9133032Z 2025-05-07T19:46:22.9133035Z 2025-05-07T19:46:22.9133038Z 2025-05-07T19:46:22.9133042Z 2025-05-07T19:46:22.9133045Z 2025-05-07T19:46:22.9133049Z 2025-05-07T19:46:22.9133052Z 2025-05-07T19:46:22.9133055Z 2025-05-07T19:46:22.9133059Z 2025-05-07T19:46:22.9133062Z 2025-05-07T19:46:22.9133208Z  2025-05-07T19:46:22.9134361Z 2025-05-07T19:46:22.9134366Z 2025-05-07T19:46:22.9134370Z 2025-05-07T19:46:22.9134373Z 2025-05-07T19:46:22.9134377Z 2025-05-07T19:46:22.9134380Z 2025-05-07T19:46:22.9134384Z 2025-05-07T19:46:22.9134387Z 2025-05-07T19:46:22.9134391Z 2025-05-07T19:46:22.9134394Z 2025-05-07T19:46:22.9134398Z 2025-05-07T19:46:22.9134416Z 2025-05-07T19:46:22.9134420Z 2025-05-07T19:46:22.9134585Z  2025-05-07T19:46:22.9134796Z 2025-05-07T19:46:22.9134800Z 2025-05-07T19:46:22.9134803Z 2025-05-07T19:46:22.9134807Z 2025-05-07T19:46:22.9134810Z 2025-05-07T19:46:22.9134819Z 2025-05-07T19:46:22.9134822Z 2025-05-07T19:46:22.9134826Z 2025-05-07T19:46:22.9134830Z 2025-05-07T19:46:22.9134849Z 2025-05-07T19:46:22.9134852Z 2025-05-07T19:46:22.9134855Z 2025-05-07T19:46:22.9134859Z 2025-05-07T19:46:22.9134862Z 2025-05-07T19:46:22.9135007Z  2025-05-07T19:46:22.9135216Z 2025-05-07T19:46:22.9135219Z 2025-05-07T19:46:22.9135223Z 2025-05-07T19:46:22.9135230Z 2025-05-07T19:46:22.9135234Z 2025-05-07T19:46:22.9135251Z 2025-05-07T19:46:22.9135255Z 2025-05-07T19:46:22.9135258Z 2025-05-07T19:46:22.9135262Z 2025-05-07T19:46:22.9135265Z 2025-05-07T19:46:22.9135268Z 2025-05-07T19:46:22.9135272Z 2025-05-07T19:46:22.9135275Z 2025-05-07T19:46:22.9135279Z 2025-05-07T19:46:22.9135282Z 2025-05-07T19:46:22.9135434Z  2025-05-07T19:46:22.9135653Z 2025-05-07T19:46:22.9135683Z 2025-05-07T19:46:22.9135686Z 2025-05-07T19:46:22.9135690Z 2025-05-07T19:46:22.9135693Z 2025-05-07T19:46:22.9135699Z 2025-05-07T19:46:22.9135702Z 2025-05-07T19:46:22.9135706Z 2025-05-07T19:46:22.9135709Z 2025-05-07T19:46:22.9135713Z 2025-05-07T19:46:22.9135716Z 2025-05-07T19:46:22.9135720Z 2025-05-07T19:46:22.9135723Z 2025-05-07T19:46:22.9135726Z 2025-05-07T19:46:22.9135730Z 2025-05-07T19:46:22.9135733Z 2025-05-07T19:46:22.9135908Z  2025-05-07T19:46:22.9136160Z 2025-05-07T19:46:22.9136168Z 2025-05-07T19:46:22.9136171Z 2025-05-07T19:46:22.9136175Z 2025-05-07T19:46:22.9136178Z 2025-05-07T19:46:22.9136181Z 2025-05-07T19:46:22.9136185Z 2025-05-07T19:46:22.9136188Z 2025-05-07T19:46:22.9136191Z 2025-05-07T19:46:22.9136195Z 2025-05-07T19:46:22.9136198Z 2025-05-07T19:46:22.9136201Z 2025-05-07T19:46:22.9136205Z 2025-05-07T19:46:22.9136208Z 2025-05-07T19:46:22.9136212Z 2025-05-07T19:46:22.9136216Z 2025-05-07T19:46:22.9136219Z 2025-05-07T19:46:22.9136445Z  2025-05-07T19:46:22.9136673Z 2025-05-07T19:46:22.9136680Z 2025-05-07T19:46:22.9136684Z 2025-05-07T19:46:22.9136687Z 2025-05-07T19:46:22.9136690Z 2025-05-07T19:46:22.9136694Z 2025-05-07T19:46:22.9136697Z 2025-05-07T19:46:22.9136701Z 2025-05-07T19:46:22.9136704Z 2025-05-07T19:46:22.9136708Z 2025-05-07T19:46:22.9136711Z 2025-05-07T19:46:22.9136715Z 2025-05-07T19:46:22.9136718Z 2025-05-07T19:46:22.9136751Z 2025-05-07T19:46:22.9136755Z 2025-05-07T19:46:22.9136762Z 2025-05-07T19:46:22.9136765Z 2025-05-07T19:46:22.9136768Z 2025-05-07T19:46:22.9136954Z  2025-05-07T19:46:22.9137188Z 2025-05-07T19:46:22.9137192Z 2025-05-07T19:46:22.9137328Z  2025-05-07T19:46:22.9137448Z 2025-05-07T19:46:22.9137452Z 2025-05-07T19:46:22.9137566Z  2025-05-07T19:46:22.9137709Z 2025-05-07T19:46:22.9137713Z 2025-05-07T19:46:22.9137717Z 2025-05-07T19:46:22.9137832Z  2025-05-07T19:46:22.9137955Z 2025-05-07T19:46:22.9137958Z 2025-05-07T19:46:22.9137962Z 2025-05-07T19:46:22.9137965Z 2025-05-07T19:46:22.9138175Z  2025-05-07T19:46:22.9138306Z 2025-05-07T19:46:22.9138310Z 2025-05-07T19:46:22.9138314Z 2025-05-07T19:46:22.9138317Z 2025-05-07T19:46:22.9138320Z 2025-05-07T19:46:22.9138441Z  2025-05-07T19:46:22.9138603Z 2025-05-07T19:46:22.9138606Z 2025-05-07T19:46:22.9138610Z 2025-05-07T19:46:22.9138613Z 2025-05-07T19:46:22.9138617Z 2025-05-07T19:46:22.9138620Z 2025-05-07T19:46:22.9138817Z  2025-05-07T19:46:22.9138962Z 2025-05-07T19:46:22.9138966Z 2025-05-07T19:46:22.9138969Z 2025-05-07T19:46:22.9139003Z 2025-05-07T19:46:22.9139006Z 2025-05-07T19:46:22.9139010Z 2025-05-07T19:46:22.9139013Z 2025-05-07T19:46:22.9139142Z  2025-05-07T19:46:22.9139298Z 2025-05-07T19:46:22.9139302Z 2025-05-07T19:46:22.9139305Z 2025-05-07T19:46:22.9139309Z 2025-05-07T19:46:22.9139313Z 2025-05-07T19:46:22.9139316Z 2025-05-07T19:46:22.9139319Z 2025-05-07T19:46:22.9139349Z 2025-05-07T19:46:22.9139642Z  2025-05-07T19:46:22.9139810Z 2025-05-07T19:46:22.9139814Z 2025-05-07T19:46:22.9139818Z 2025-05-07T19:46:22.9139821Z 2025-05-07T19:46:22.9139825Z 2025-05-07T19:46:22.9139828Z 2025-05-07T19:46:22.9139832Z 2025-05-07T19:46:22.9139835Z 2025-05-07T19:46:22.9139838Z 2025-05-07T19:46:22.9140002Z  2025-05-07T19:46:22.9140177Z 2025-05-07T19:46:22.9140181Z 2025-05-07T19:46:22.9140184Z 2025-05-07T19:46:22.9140191Z 2025-05-07T19:46:22.9140194Z 2025-05-07T19:46:22.9140198Z 2025-05-07T19:46:22.9140202Z 2025-05-07T19:46:22.9140205Z 2025-05-07T19:46:22.9140209Z 2025-05-07T19:46:22.9140212Z 2025-05-07T19:46:22.9140503Z  2025-05-07T19:46:22.9140686Z 2025-05-07T19:46:22.9140690Z 2025-05-07T19:46:22.9140693Z 2025-05-07T19:46:22.9140696Z 2025-05-07T19:46:22.9140700Z 2025-05-07T19:46:22.9140703Z 2025-05-07T19:46:22.9140707Z 2025-05-07T19:46:22.9140710Z 2025-05-07T19:46:22.9140714Z 2025-05-07T19:46:22.9140717Z 2025-05-07T19:46:22.9140721Z 2025-05-07T19:46:22.9140900Z  2025-05-07T19:46:22.9141095Z 2025-05-07T19:46:22.9141098Z 2025-05-07T19:46:22.9141101Z 2025-05-07T19:46:22.9141105Z 2025-05-07T19:46:22.9141108Z 2025-05-07T19:46:22.9141112Z 2025-05-07T19:46:22.9141115Z 2025-05-07T19:46:22.9141119Z 2025-05-07T19:46:22.9141122Z 2025-05-07T19:46:22.9141126Z 2025-05-07T19:46:22.9141129Z 2025-05-07T19:46:22.9141160Z 2025-05-07T19:46:22.9141312Z  2025-05-07T19:46:22.9141518Z 2025-05-07T19:46:22.9141521Z 2025-05-07T19:46:22.9141525Z 2025-05-07T19:46:22.9141528Z 2025-05-07T19:46:22.9141531Z 2025-05-07T19:46:22.9141535Z 2025-05-07T19:46:22.9141539Z 2025-05-07T19:46:22.9141542Z 2025-05-07T19:46:22.9141545Z 2025-05-07T19:46:22.9141549Z 2025-05-07T19:46:22.9141581Z 2025-05-07T19:46:22.9141585Z 2025-05-07T19:46:22.9141588Z 2025-05-07T19:46:22.9141741Z  2025-05-07T19:46:22.9141953Z 2025-05-07T19:46:22.9141956Z 2025-05-07T19:46:22.9141960Z 2025-05-07T19:46:22.9141966Z 2025-05-07T19:46:22.9141969Z 2025-05-07T19:46:22.9141973Z 2025-05-07T19:46:22.9141976Z 2025-05-07T19:46:22.9142008Z 2025-05-07T19:46:22.9142011Z 2025-05-07T19:46:22.9142015Z 2025-05-07T19:46:22.9142018Z 2025-05-07T19:46:22.9142022Z 2025-05-07T19:46:22.9142025Z 2025-05-07T19:46:22.9142029Z 2025-05-07T19:46:22.9142188Z  2025-05-07T19:46:22.9142405Z 2025-05-07T19:46:22.9142411Z 2025-05-07T19:46:22.9142415Z 2025-05-07T19:46:22.9142418Z 2025-05-07T19:46:22.9142451Z 2025-05-07T19:46:22.9142454Z 2025-05-07T19:46:22.9142457Z 2025-05-07T19:46:22.9142461Z 2025-05-07T19:46:22.9142465Z 2025-05-07T19:46:22.9142468Z 2025-05-07T19:46:22.9142472Z 2025-05-07T19:46:22.9142475Z 2025-05-07T19:46:22.9142479Z 2025-05-07T19:46:22.9142482Z 2025-05-07T19:46:22.9142485Z 2025-05-07T19:46:22.9142649Z  2025-05-07T19:46:22.9142901Z 2025-05-07T19:46:22.9142904Z 2025-05-07T19:46:22.9142908Z 2025-05-07T19:46:22.9142976Z 2025-05-07T19:46:22.9142980Z 2025-05-07T19:46:22.9142983Z 2025-05-07T19:46:22.9142987Z 2025-05-07T19:46:22.9142990Z 2025-05-07T19:46:22.9142993Z 2025-05-07T19:46:22.9142997Z 2025-05-07T19:46:22.9143000Z 2025-05-07T19:46:22.9143004Z 2025-05-07T19:46:22.9143007Z 2025-05-07T19:46:22.9143010Z 2025-05-07T19:46:22.9143014Z 2025-05-07T19:46:22.9143017Z 2025-05-07T19:46:22.9143240Z  2025-05-07T19:46:22.9143499Z 2025-05-07T19:46:22.9143503Z 2025-05-07T19:46:22.9143507Z 2025-05-07T19:46:22.9143510Z 2025-05-07T19:46:22.9143513Z 2025-05-07T19:46:22.9143517Z 2025-05-07T19:46:22.9143520Z 2025-05-07T19:46:22.9143524Z 2025-05-07T19:46:22.9143527Z 2025-05-07T19:46:22.9143531Z 2025-05-07T19:46:22.9143534Z 2025-05-07T19:46:22.9143537Z 2025-05-07T19:46:22.9143540Z 2025-05-07T19:46:22.9143544Z 2025-05-07T19:46:22.9143547Z 2025-05-07T19:46:22.9143551Z 2025-05-07T19:46:22.9143554Z 2025-05-07T19:46:22.9143758Z  2025-05-07T19:46:22.9143994Z 2025-05-07T19:46:22.9143998Z 2025-05-07T19:46:22.9144001Z 2025-05-07T19:46:22.9144005Z 2025-05-07T19:46:22.9144008Z 2025-05-07T19:46:22.9144012Z 2025-05-07T19:46:22.9144015Z 2025-05-07T19:46:22.9144018Z 2025-05-07T19:46:22.9144022Z 2025-05-07T19:46:22.9144026Z 2025-05-07T19:46:22.9144029Z 2025-05-07T19:46:22.9144032Z 2025-05-07T19:46:22.9144066Z 2025-05-07T19:46:22.9144073Z 2025-05-07T19:46:22.9144076Z 2025-05-07T19:46:22.9144079Z 2025-05-07T19:46:22.9144083Z 2025-05-07T19:46:22.9144086Z 2025-05-07T19:46:22.9144266Z  2025-05-07T19:46:22.9144503Z 2025-05-07T19:46:22.9144507Z 2025-05-07T19:46:22.9144649Z  2025-05-07T19:46:22.9144773Z 2025-05-07T19:46:22.9144776Z 2025-05-07T19:46:22.9144892Z  2025-05-07T19:46:22.9145044Z 2025-05-07T19:46:22.9145048Z 2025-05-07T19:46:22.9145051Z 2025-05-07T19:46:22.9145168Z  2025-05-07T19:46:22.9145296Z 2025-05-07T19:46:22.9145303Z 2025-05-07T19:46:22.9145306Z 2025-05-07T19:46:22.9145310Z 2025-05-07T19:46:22.9145464Z  2025-05-07T19:46:22.9145597Z 2025-05-07T19:46:22.9145601Z 2025-05-07T19:46:22.9145605Z 2025-05-07T19:46:22.9145608Z 2025-05-07T19:46:22.9145612Z 2025-05-07T19:46:22.9145732Z  2025-05-07T19:46:22.9145903Z 2025-05-07T19:46:22.9145907Z 2025-05-07T19:46:22.9145910Z 2025-05-07T19:46:22.9145913Z 2025-05-07T19:46:22.9145920Z 2025-05-07T19:46:22.9145924Z 2025-05-07T19:46:22.9146039Z  2025-05-07T19:46:22.9146177Z 2025-05-07T19:46:22.9146214Z 2025-05-07T19:46:22.9146217Z 2025-05-07T19:46:22.9146220Z 2025-05-07T19:46:22.9146224Z 2025-05-07T19:46:22.9146227Z 2025-05-07T19:46:22.9146231Z 2025-05-07T19:46:22.9146357Z  2025-05-07T19:46:22.9146515Z 2025-05-07T19:46:22.9146519Z 2025-05-07T19:46:22.9146522Z 2025-05-07T19:46:22.9146526Z 2025-05-07T19:46:22.9146529Z 2025-05-07T19:46:22.9146560Z 2025-05-07T19:46:22.9146563Z 2025-05-07T19:46:22.9146569Z 2025-05-07T19:46:22.9146700Z  2025-05-07T19:46:22.9146870Z 2025-05-07T19:46:22.9146874Z 2025-05-07T19:46:22.9146877Z 2025-05-07T19:46:22.9146881Z 2025-05-07T19:46:22.9146884Z 2025-05-07T19:46:22.9146887Z 2025-05-07T19:46:22.9146891Z 2025-05-07T19:46:22.9146894Z 2025-05-07T19:46:22.9146927Z 2025-05-07T19:46:22.9147068Z  2025-05-07T19:46:22.9147246Z 2025-05-07T19:46:22.9147253Z 2025-05-07T19:46:22.9147256Z 2025-05-07T19:46:22.9147260Z 2025-05-07T19:46:22.9147263Z 2025-05-07T19:46:22.9147266Z 2025-05-07T19:46:22.9147270Z 2025-05-07T19:46:22.9147273Z 2025-05-07T19:46:22.9147277Z 2025-05-07T19:46:22.9147280Z 2025-05-07T19:46:22.9147460Z  2025-05-07T19:46:22.9147642Z 2025-05-07T19:46:22.9147646Z 2025-05-07T19:46:22.9147650Z 2025-05-07T19:46:22.9147653Z 2025-05-07T19:46:22.9147656Z 2025-05-07T19:46:22.9147660Z 2025-05-07T19:46:22.9147663Z 2025-05-07T19:46:22.9147666Z 2025-05-07T19:46:22.9147670Z 2025-05-07T19:46:22.9147728Z 2025-05-07T19:46:22.9147732Z 2025-05-07T19:46:22.9147913Z  2025-05-07T19:46:22.9148108Z 2025-05-07T19:46:22.9148112Z 2025-05-07T19:46:22.9148115Z 2025-05-07T19:46:22.9148119Z 2025-05-07T19:46:22.9148122Z 2025-05-07T19:46:22.9148126Z 2025-05-07T19:46:22.9148130Z 2025-05-07T19:46:22.9148133Z 2025-05-07T19:46:22.9148136Z 2025-05-07T19:46:22.9148140Z 2025-05-07T19:46:22.9148198Z 2025-05-07T19:46:22.9148231Z 2025-05-07T19:46:22.9148386Z  2025-05-07T19:46:22.9148591Z 2025-05-07T19:46:22.9148595Z 2025-05-07T19:46:22.9148598Z 2025-05-07T19:46:22.9148602Z 2025-05-07T19:46:22.9148605Z 2025-05-07T19:46:22.9148608Z 2025-05-07T19:46:22.9148612Z 2025-05-07T19:46:22.9148615Z 2025-05-07T19:46:22.9148619Z 2025-05-07T19:46:22.9148623Z 2025-05-07T19:46:22.9148658Z 2025-05-07T19:46:22.9148662Z 2025-05-07T19:46:22.9148665Z 2025-05-07T19:46:22.9148817Z  2025-05-07T19:46:22.9149037Z 2025-05-07T19:46:22.9149041Z 2025-05-07T19:46:22.9149044Z 2025-05-07T19:46:22.9149047Z 2025-05-07T19:46:22.9149051Z 2025-05-07T19:46:22.9149054Z 2025-05-07T19:46:22.9149058Z 2025-05-07T19:46:22.9149092Z 2025-05-07T19:46:22.9149096Z 2025-05-07T19:46:22.9149099Z 2025-05-07T19:46:22.9149103Z 2025-05-07T19:46:22.9149106Z 2025-05-07T19:46:22.9149109Z 2025-05-07T19:46:22.9149113Z 2025-05-07T19:46:22.9149276Z  2025-05-07T19:46:22.9149496Z 2025-05-07T19:46:22.9149499Z 2025-05-07T19:46:22.9149503Z 2025-05-07T19:46:22.9149507Z 2025-05-07T19:46:22.9149543Z 2025-05-07T19:46:22.9149546Z 2025-05-07T19:46:22.9149550Z 2025-05-07T19:46:22.9149553Z 2025-05-07T19:46:22.9149556Z 2025-05-07T19:46:22.9149560Z 2025-05-07T19:46:22.9149563Z 2025-05-07T19:46:22.9149567Z 2025-05-07T19:46:22.9149570Z 2025-05-07T19:46:22.9149573Z 2025-05-07T19:46:22.9149577Z 2025-05-07T19:46:22.9149739Z  2025-05-07T19:46:22.9149999Z 2025-05-07T19:46:22.9150003Z 2025-05-07T19:46:22.9150006Z 2025-05-07T19:46:22.9150009Z 2025-05-07T19:46:22.9150013Z 2025-05-07T19:46:22.9150017Z 2025-05-07T19:46:22.9150020Z 2025-05-07T19:46:22.9150023Z 2025-05-07T19:46:22.9150027Z 2025-05-07T19:46:22.9150030Z 2025-05-07T19:46:22.9150034Z 2025-05-07T19:46:22.9150038Z 2025-05-07T19:46:22.9150041Z 2025-05-07T19:46:22.9150044Z 2025-05-07T19:46:22.9150048Z 2025-05-07T19:46:22.9150055Z 2025-05-07T19:46:22.9150220Z  2025-05-07T19:46:22.9150470Z 2025-05-07T19:46:22.9150474Z 2025-05-07T19:46:22.9150477Z 2025-05-07T19:46:22.9150481Z 2025-05-07T19:46:22.9150484Z 2025-05-07T19:46:22.9150488Z 2025-05-07T19:46:22.9150491Z 2025-05-07T19:46:22.9150495Z 2025-05-07T19:46:22.9150498Z 2025-05-07T19:46:22.9150501Z 2025-05-07T19:46:22.9150505Z 2025-05-07T19:46:22.9150508Z 2025-05-07T19:46:22.9150512Z 2025-05-07T19:46:22.9150515Z 2025-05-07T19:46:22.9150518Z 2025-05-07T19:46:22.9150522Z 2025-05-07T19:46:22.9150529Z 2025-05-07T19:46:22.9150737Z  2025-05-07T19:46:22.9150971Z 2025-05-07T19:46:22.9150975Z 2025-05-07T19:46:22.9150978Z 2025-05-07T19:46:22.9150982Z 2025-05-07T19:46:22.9150985Z 2025-05-07T19:46:22.9150989Z 2025-05-07T19:46:22.9150992Z 2025-05-07T19:46:22.9150996Z 2025-05-07T19:46:22.9150999Z 2025-05-07T19:46:22.9151002Z 2025-05-07T19:46:22.9151006Z 2025-05-07T19:46:22.9151043Z 2025-05-07T19:46:22.9151047Z 2025-05-07T19:46:22.9151050Z 2025-05-07T19:46:22.9151054Z 2025-05-07T19:46:22.9151057Z 2025-05-07T19:46:22.9151061Z 2025-05-07T19:46:22.9151065Z 2025-05-07T19:46:22.9151242Z  2025-05-07T19:46:22.9151476Z 2025-05-07T19:46:22.9151480Z 2025-05-07T19:46:22.9151618Z  2025-05-07T19:46:22.9151738Z 2025-05-07T19:46:22.9151742Z 2025-05-07T19:46:22.9151852Z  2025-05-07T19:46:22.9151999Z 2025-05-07T19:46:22.9152003Z 2025-05-07T19:46:22.9152007Z 2025-05-07T19:46:22.9152173Z  2025-05-07T19:46:22.9152304Z 2025-05-07T19:46:22.9152308Z 2025-05-07T19:46:22.9152311Z 2025-05-07T19:46:22.9152315Z 2025-05-07T19:46:22.9152461Z  2025-05-07T19:46:22.9152593Z 2025-05-07T19:46:22.9152597Z 2025-05-07T19:46:22.9152601Z 2025-05-07T19:46:22.9152604Z 2025-05-07T19:46:22.9152607Z 2025-05-07T19:46:22.9152727Z  2025-05-07T19:46:22.9152895Z 2025-05-07T19:46:22.9152899Z 2025-05-07T19:46:22.9152954Z 2025-05-07T19:46:22.9152958Z 2025-05-07T19:46:22.9152962Z 2025-05-07T19:46:22.9152965Z 2025-05-07T19:46:22.9153088Z  2025-05-07T19:46:22.9153231Z 2025-05-07T19:46:22.9153262Z 2025-05-07T19:46:22.9153265Z 2025-05-07T19:46:22.9153269Z 2025-05-07T19:46:22.9153272Z 2025-05-07T19:46:22.9153276Z 2025-05-07T19:46:22.9153279Z 2025-05-07T19:46:22.9153407Z  2025-05-07T19:46:22.9153550Z 2025-05-07T19:46:22.9153553Z 2025-05-07T19:46:22.9153557Z 2025-05-07T19:46:22.9153560Z 2025-05-07T19:46:22.9153564Z 2025-05-07T19:46:22.9153597Z 2025-05-07T19:46:22.9153600Z 2025-05-07T19:46:22.9153604Z 2025-05-07T19:46:22.9153735Z  2025-05-07T19:46:22.9153898Z 2025-05-07T19:46:22.9153901Z 2025-05-07T19:46:22.9153905Z 2025-05-07T19:46:22.9153908Z 2025-05-07T19:46:22.9153912Z 2025-05-07T19:46:22.9153915Z 2025-05-07T19:46:22.9153919Z 2025-05-07T19:46:22.9153922Z 2025-05-07T19:46:22.9153950Z 2025-05-07T19:46:22.9154087Z  2025-05-07T19:46:22.9154262Z 2025-05-07T19:46:22.9154266Z 2025-05-07T19:46:22.9154270Z 2025-05-07T19:46:22.9154273Z 2025-05-07T19:46:22.9154276Z 2025-05-07T19:46:22.9154280Z 2025-05-07T19:46:22.9154283Z 2025-05-07T19:46:22.9154286Z 2025-05-07T19:46:22.9154290Z 2025-05-07T19:46:22.9154293Z 2025-05-07T19:46:22.9154459Z  2025-05-07T19:46:22.9154639Z 2025-05-07T19:46:22.9154642Z 2025-05-07T19:46:22.9154646Z 2025-05-07T19:46:22.9154649Z 2025-05-07T19:46:22.9154653Z 2025-05-07T19:46:22.9154656Z 2025-05-07T19:46:22.9154664Z 2025-05-07T19:46:22.9154667Z 2025-05-07T19:46:22.9154671Z 2025-05-07T19:46:22.9154674Z 2025-05-07T19:46:22.9154739Z 2025-05-07T19:46:22.9154882Z  2025-05-07T19:46:22.9155098Z 2025-05-07T19:46:22.9155102Z 2025-05-07T19:46:22.9155105Z 2025-05-07T19:46:22.9155109Z 2025-05-07T19:46:22.9155112Z 2025-05-07T19:46:22.9155115Z 2025-05-07T19:46:22.9155119Z 2025-05-07T19:46:22.9155122Z 2025-05-07T19:46:22.9155129Z 2025-05-07T19:46:22.9155133Z 2025-05-07T19:46:22.9155136Z 2025-05-07T19:46:22.9155140Z 2025-05-07T19:46:22.9155290Z  2025-05-07T19:46:22.9155513Z 2025-05-07T19:46:22.9155517Z 2025-05-07T19:46:22.9155520Z 2025-05-07T19:46:22.9155524Z 2025-05-07T19:46:22.9155527Z 2025-05-07T19:46:22.9155531Z 2025-05-07T19:46:22.9155534Z 2025-05-07T19:46:22.9155538Z 2025-05-07T19:46:22.9155541Z 2025-05-07T19:46:22.9155544Z 2025-05-07T19:46:22.9155548Z 2025-05-07T19:46:22.9155551Z 2025-05-07T19:46:22.9155555Z 2025-05-07T19:46:22.9155708Z  2025-05-07T19:46:22.9155927Z 2025-05-07T19:46:22.9155930Z 2025-05-07T19:46:22.9155934Z 2025-05-07T19:46:22.9155937Z 2025-05-07T19:46:22.9155941Z 2025-05-07T19:46:22.9155944Z 2025-05-07T19:46:22.9155948Z 2025-05-07T19:46:22.9155951Z 2025-05-07T19:46:22.9155955Z 2025-05-07T19:46:22.9155958Z 2025-05-07T19:46:22.9155961Z 2025-05-07T19:46:22.9155965Z 2025-05-07T19:46:22.9155971Z 2025-05-07T19:46:22.9155975Z 2025-05-07T19:46:22.9156157Z  2025-05-07T19:46:22.9156368Z 2025-05-07T19:46:22.9156372Z 2025-05-07T19:46:22.9156375Z 2025-05-07T19:46:22.9156379Z 2025-05-07T19:46:22.9156382Z 2025-05-07T19:46:22.9156386Z 2025-05-07T19:46:22.9156389Z 2025-05-07T19:46:22.9156393Z 2025-05-07T19:46:22.9156396Z 2025-05-07T19:46:22.9156399Z 2025-05-07T19:46:22.9156403Z 2025-05-07T19:46:22.9156406Z 2025-05-07T19:46:22.9156410Z 2025-05-07T19:46:22.9156413Z 2025-05-07T19:46:22.9156416Z 2025-05-07T19:46:22.9156603Z  2025-05-07T19:46:22.9156891Z 2025-05-07T19:46:22.9156894Z 2025-05-07T19:46:22.9156898Z 2025-05-07T19:46:22.9156901Z 2025-05-07T19:46:22.9156905Z 2025-05-07T19:46:22.9156908Z 2025-05-07T19:46:22.9156912Z 2025-05-07T19:46:22.9156915Z 2025-05-07T19:46:22.9156919Z 2025-05-07T19:46:22.9156922Z 2025-05-07T19:46:22.9156953Z 2025-05-07T19:46:22.9156956Z 2025-05-07T19:46:22.9156960Z 2025-05-07T19:46:22.9157026Z 2025-05-07T19:46:22.9157030Z 2025-05-07T19:46:22.9157034Z 2025-05-07T19:46:22.9157204Z  2025-05-07T19:46:22.9157430Z 2025-05-07T19:46:22.9157434Z 2025-05-07T19:46:22.9157437Z 2025-05-07T19:46:22.9157441Z 2025-05-07T19:46:22.9157444Z 2025-05-07T19:46:22.9157474Z 2025-05-07T19:46:22.9157478Z 2025-05-07T19:46:22.9157481Z 2025-05-07T19:46:22.9157484Z 2025-05-07T19:46:22.9157488Z 2025-05-07T19:46:22.9157491Z 2025-05-07T19:46:22.9157494Z 2025-05-07T19:46:22.9157498Z 2025-05-07T19:46:22.9157501Z 2025-05-07T19:46:22.9157508Z 2025-05-07T19:46:22.9157512Z 2025-05-07T19:46:22.9157515Z 2025-05-07T19:46:22.9157688Z  2025-05-07T19:46:22.9157943Z 2025-05-07T19:46:22.9157946Z 2025-05-07T19:46:22.9157950Z 2025-05-07T19:46:22.9157953Z 2025-05-07T19:46:22.9157957Z 2025-05-07T19:46:22.9157960Z 2025-05-07T19:46:22.9157963Z 2025-05-07T19:46:22.9157966Z 2025-05-07T19:46:22.9157973Z 2025-05-07T19:46:22.9157977Z 2025-05-07T19:46:22.9157980Z 2025-05-07T19:46:22.9157983Z 2025-05-07T19:46:22.9157987Z 2025-05-07T19:46:22.9157990Z 2025-05-07T19:46:22.9157993Z 2025-05-07T19:46:22.9157997Z 2025-05-07T19:46:22.9158000Z 2025-05-07T19:46:22.9158004Z 2025-05-07T19:46:22.9158208Z  2025-05-07T19:46:22.9158445Z 2025-05-07T19:46:22.9158449Z 2025-05-07T19:46:22.9158563Z  2025-05-07T19:46:22.9158710Z 2025-05-07T19:46:22.9158713Z 2025-05-07T19:46:22.9158827Z  2025-05-07T19:46:22.9158951Z 2025-05-07T19:46:22.9158958Z 2025-05-07T19:46:22.9158962Z 2025-05-07T19:46:22.9159077Z  2025-05-07T19:46:22.9159231Z 2025-05-07T19:46:22.9159234Z 2025-05-07T19:46:22.9159238Z 2025-05-07T19:46:22.9159241Z 2025-05-07T19:46:22.9159357Z  2025-05-07T19:46:22.9159492Z 2025-05-07T19:46:22.9159526Z 2025-05-07T19:46:22.9159530Z 2025-05-07T19:46:22.9159533Z 2025-05-07T19:46:22.9159537Z 2025-05-07T19:46:22.9159661Z  2025-05-07T19:46:22.9159800Z 2025-05-07T19:46:22.9159803Z 2025-05-07T19:46:22.9159807Z 2025-05-07T19:46:22.9159810Z 2025-05-07T19:46:22.9159814Z 2025-05-07T19:46:22.9159817Z 2025-05-07T19:46:22.9159963Z  2025-05-07T19:46:22.9160108Z 2025-05-07T19:46:22.9160112Z 2025-05-07T19:46:22.9160116Z 2025-05-07T19:46:22.9160119Z 2025-05-07T19:46:22.9160123Z 2025-05-07T19:46:22.9160126Z 2025-05-07T19:46:22.9160130Z 2025-05-07T19:46:22.9160292Z  done 2025-05-07T19:46:23.1209075Z Preparing transaction: / - done 2025-05-07T19:46:23.8238030Z Verifying transaction: | / - \ | / - done 2025-05-07T19:46:24.1289342Z Executing transaction: | / - done 2025-05-07T19:46:25.8737412Z [INSTALL] Fixing file placements for CUDA 12.6.3+ ... 2025-05-07T19:46:25.8737856Z [INSTALL] Creating symlinks: libnvToolsExt.so 2025-05-07T19:46:25.8738687Z + ln -sf /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:25.8739308Z 2025-05-07T19:46:25.8758333Z 2025-05-07T19:46:25.8760725Z + ln -sf /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so.1 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:25.8763115Z 2025-05-07T19:46:25.8777853Z 2025-05-07T19:46:25.8778189Z [INSTALL] Copying nvtx3 headers ... 2025-05-07T19:46:25.8782944Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/include/ 2025-05-07T19:46:25.8787398Z 2025-05-07T19:46:25.8900667Z 2025-05-07T19:46:25.8905559Z + cp -r /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCuda.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtCudaRt.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtOpenCL.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvToolsExtSync.h /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtx3.hpp /github/home/miniconda/envs/build_binary/nsight-compute-2024.3.2/host/target-linux-x64/nvtx/include/nvtx3/nvtxDetail /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/ 2025-05-07T19:46:25.8909695Z 2025-05-07T19:46:25.8923678Z 2025-05-07T19:46:25.8924509Z [INSTALL] Appending libcuda.so path to LD_LIBRARY_PATH ... 2025-05-07T19:46:25.9355191Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs ... 2025-05-07T19:46:27.5745302Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/lib:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs 2025-05-07T19:46:27.5746124Z 2025-05-07T19:46:27.9819409Z 2025-05-07T19:46:27.9824247Z [INSTALL] Setting environment variable NVML_LIB_PATH ... 2025-05-07T19:46:28.0197205Z + conda env config vars set -n build_binary NVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:28.0197768Z 2025-05-07T19:46:28.4271282Z 2025-05-07T19:46:28.4271815Z [INSTALL] Setting environment variable CUDA_INCLUDE_DIRS ... 2025-05-07T19:46:28.4272860Z + conda env config vars set -n build_binary CUDA_INCLUDE_DIRS="/github/home/miniconda/envs/build_binary/include/:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/" 2025-05-07T19:46:28.4273688Z 2025-05-07T19:46:28.8331619Z 2025-05-07T19:46:30.5380024Z [CHECK] cuda_runtime.h found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/cuda_runtime.h 2025-05-07T19:46:32.2810188Z [CHECK] libcuda.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:46:33.9869535Z [CHECK] libnvToolsExt.so found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:46:33.9870466Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvToolsExt.so 2025-05-07T19:46:35.7222104Z [CHECK] libnvidia-ml.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libnvidia-ml.so 2025-05-07T19:46:37.3212647Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:46:37.3212967Z 2025-05-07T19:46:37.3985336Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:46:40.6381656Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:46:40.6383314Z Target: x86_64-conda-linux-gnu 2025-05-07T19:46:40.6383588Z Thread model: posix 2025-05-07T19:46:40.6383924Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:46:40.6384541Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang.cfg 2025-05-07T19:46:40.6385009Z 2025-05-07T19:46:40.6940376Z [INSTALL] Resetting compiler symlinks to clang ... 2025-05-07T19:46:43.9545421Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:46:43.9565200Z 2025-05-07T19:46:43.9565207Z 2025-05-07T19:46:43.9583462Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:46:43.9583980Z 2025-05-07T19:46:43.9607624Z 2025-05-07T19:46:43.9633281Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:46:43.9633808Z 2025-05-07T19:46:43.9651401Z 2025-05-07T19:46:43.9677697Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:46:43.9693903Z 2025-05-07T19:46:43.9693910Z 2025-05-07T19:46:43.9694286Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:46:43.9694630Z 2025-05-07T19:46:43.9720905Z total 20 2025-05-07T19:46:43.9721547Z drwxr-xr-x. 2 root root 154 May 7 19:46 . 2025-05-07T19:46:43.9721909Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:46:43.9722339Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:46:43.9722803Z -rw-r--r--. 2 root root 136 Mar 27 01:27 libglib_activate.sh 2025-05-07T19:46:43.9723229Z -rw-r--r--. 2 root root 873 Jun 5 2024 libxml2_activate.sh 2025-05-07T19:46:43.9723639Z -rw-r--r--. 2 root root 499 Nov 30 04:26 openjdk_activate.sh 2025-05-07T19:46:43.9724069Z -rw-r--r--. 2 root root 2932 Nov 20 20:32 ~cuda-nvcc_activate.sh 2025-05-07T19:46:43.9724344Z 2025-05-07T19:46:43.9724585Z [INSTALL] Removing the -ccbin=CXX hook from NVCC activation scripts ... 2025-05-07T19:46:43.9725252Z + sed -i /-ccbin=/d /github/home/miniconda/envs/build_binary/etc/conda/activate.d/*cuda-nvcc_activate.sh 2025-05-07T19:46:43.9725716Z 2025-05-07T19:46:43.9741475Z 2025-05-07T19:46:43.9741957Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:46:43.9742261Z 2025-05-07T19:46:45.6821183Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:46:45.6823201Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:46:45.6823671Z 2025-05-07T19:46:45.6823835Z [BUILD] Setting Clang as the NVCC host compiler: 2025-05-07T19:46:47.3478610Z [BUILD] Setting prepend flags for NVCC ... 2025-05-07T19:46:47.3480237Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-allow-unsupported-compiler -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++" 2025-05-07T19:46:47.3481019Z 2025-05-07T19:46:47.7677603Z 2025-05-07T19:46:47.7678314Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:46:47.7678659Z 2025-05-07T19:46:49.3420881Z -allow-unsupported-compiler -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:46:49.3421475Z 2025-05-07T19:46:49.4000250Z 2025-05-07T19:46:49.4000980Z [INFO] Printing out all preprocessor defines in nvcc ... 2025-05-07T19:46:49.4001536Z + conda run -n build_binary nvcc --compiler-options -dM -E -x cu - < /dev/null 2025-05-07T19:46:49.4001898Z 2025-05-07T19:46:51.0866806Z #define ADJ_ESTERROR 0x0008 2025-05-07T19:46:51.0867831Z #define ADJ_FREQUENCY 0x0002 2025-05-07T19:46:51.0868624Z #define ADJ_MAXERROR 0x0004 2025-05-07T19:46:51.0869350Z #define ADJ_MICRO 0x1000 2025-05-07T19:46:51.0870103Z #define ADJ_NANO 0x2000 2025-05-07T19:46:51.0870790Z #define ADJ_OFFSET 0x0001 2025-05-07T19:46:51.0871604Z #define ADJ_OFFSET_SINGLESHOT 0x8001 2025-05-07T19:46:51.0873013Z #define ADJ_OFFSET_SS_READ 0xa001 2025-05-07T19:46:51.0873433Z #define ADJ_STATUS 0x0010 2025-05-07T19:46:51.0873699Z #define ADJ_TAI 0x0080 2025-05-07T19:46:51.0873938Z #define ADJ_TICK 0x4000 2025-05-07T19:46:51.0874198Z #define ADJ_TIMECONST 0x0020 2025-05-07T19:46:51.0874469Z #define AIO_PRIO_DELTA_MAX 20 2025-05-07T19:46:51.0874772Z #define BC_BASE_MAX _POSIX2_BC_BASE_MAX 2025-05-07T19:46:51.0875266Z #define BC_DIM_MAX _POSIX2_BC_DIM_MAX 2025-05-07T19:46:51.0875622Z #define BC_SCALE_MAX _POSIX2_BC_SCALE_MAX 2025-05-07T19:46:51.0876214Z #define BC_STRING_MAX _POSIX2_BC_STRING_MAX 2025-05-07T19:46:51.0876567Z #define BIG_ENDIAN __BIG_ENDIAN 2025-05-07T19:46:51.0877055Z #define BUFSIZ _IO_BUFSIZ 2025-05-07T19:46:51.0877320Z #define BYTE_ORDER __BYTE_ORDER 2025-05-07T19:46:51.0877639Z #define CHARCLASS_NAME_MAX 2048 2025-05-07T19:46:51.0877976Z #define CHAR_BIT __CHAR_BIT__ 2025-05-07T19:46:51.0878282Z #define CHAR_MAX __SCHAR_MAX__ 2025-05-07T19:46:51.0878578Z #define CHAR_MIN SCHAR_MIN 2025-05-07T19:46:51.0878883Z #define CLOCKS_PER_SEC 1000000l 2025-05-07T19:46:51.0879167Z #define CLOCK_BOOTTIME 7 2025-05-07T19:46:51.0879450Z #define CLOCK_BOOTTIME_ALARM 9 2025-05-07T19:46:51.0879725Z #define CLOCK_MONOTONIC 1 2025-05-07T19:46:51.0880036Z #define CLOCK_MONOTONIC_COARSE 6 2025-05-07T19:46:51.0880366Z #define CLOCK_MONOTONIC_RAW 4 2025-05-07T19:46:51.0880671Z #define CLOCK_PROCESS_CPUTIME_ID 2 2025-05-07T19:46:51.0881027Z #define CLOCK_REALTIME 0 2025-05-07T19:46:51.0881312Z #define CLOCK_REALTIME_ALARM 8 2025-05-07T19:46:51.0881635Z #define CLOCK_REALTIME_COARSE 5 2025-05-07T19:46:51.0881913Z #define CLOCK_TAI 11 2025-05-07T19:46:51.0882181Z #define CLOCK_THREAD_CPUTIME_ID 3 2025-05-07T19:46:51.0882487Z #define COLL_WEIGHTS_MAX 255 2025-05-07T19:46:51.0882768Z #define CUDARTAPI 2025-05-07T19:46:51.0883001Z #define CUDARTAPI_CDECL 2025-05-07T19:46:51.0883263Z #define CUDART_CB 2025-05-07T19:46:51.0883532Z #define CUDART_DEVICE __device__ 2025-05-07T19:46:51.0883856Z #define CUDART_VERSION 12060 2025-05-07T19:46:51.0884209Z #define CUDA_DOUBLE_MATH_FUNCTIONS 1 2025-05-07T19:46:51.0884536Z #define CUDA_IPC_HANDLE_SIZE 64 2025-05-07T19:46:51.0884870Z #define CU_UUID_HAS_BEEN_DEFINED 2025-05-07T19:46:51.0885180Z #define DELAYTIMER_MAX 2147483647 2025-05-07T19:46:51.0885473Z #define DOMAIN 1 2025-05-07T19:46:51.0885696Z #define EOF (-1) 2025-05-07T19:46:51.0885943Z #define EXIT_FAILURE 1 2025-05-07T19:46:51.0886195Z #define EXIT_SUCCESS 0 2025-05-07T19:46:51.0886486Z #define EXPR_NEST_MAX _POSIX2_EXPR_NEST_MAX 2025-05-07T19:46:51.0886858Z #define FD_CLR(fd,fdsetp) __FD_CLR (fd, fdsetp) 2025-05-07T19:46:51.0887231Z #define FD_ISSET(fd,fdsetp) __FD_ISSET (fd, fdsetp) 2025-05-07T19:46:51.0887630Z #define FD_SET(fd,fdsetp) __FD_SET (fd, fdsetp) 2025-05-07T19:46:51.0887979Z #define FD_SETSIZE __FD_SETSIZE 2025-05-07T19:46:51.0888331Z #define FD_ZERO(fdsetp) __FD_ZERO (fdsetp) 2025-05-07T19:46:51.0888669Z #define FILENAME_MAX 4096 2025-05-07T19:46:51.0888975Z #define FOPEN_MAX 16 2025-05-07T19:46:51.0889256Z #define FP_ILOGB0 (-2147483647 - 1) 2025-05-07T19:46:51.0889601Z #define FP_ILOGBNAN (-2147483647 - 1) 2025-05-07T19:46:51.0889929Z #define FP_INFINITE 1 2025-05-07T19:46:51.0890180Z #define FP_NAN 0 2025-05-07T19:46:51.0890451Z #define FP_NORMAL 4 2025-05-07T19:46:51.0890743Z #define FP_SUBNORMAL 3 2025-05-07T19:46:51.0891025Z #define FP_ZERO 2 2025-05-07T19:46:51.0891269Z #define HOST_NAME_MAX 64 2025-05-07T19:46:51.0891571Z #define HUGE 3.40282347e+38F 2025-05-07T19:46:51.0891869Z #define HUGE_VAL (__builtin_huge_val()) 2025-05-07T19:46:51.0917031Z #define HUGE_VALF (__builtin_huge_valf()) 2025-05-07T19:46:51.0917413Z #define HUGE_VALL (__builtin_huge_vall()) 2025-05-07T19:46:51.0917725Z #define INFINITY (__builtin_inff()) 2025-05-07T19:46:51.0918039Z #define INT_MAX __INT_MAX__ 2025-05-07T19:46:51.0918334Z #define INT_MIN (-__INT_MAX__ -1) 2025-05-07T19:46:51.0918602Z #define IOV_MAX 1024 2025-05-07T19:46:51.0919229Z #define LINE_MAX _POSIX2_LINE_MAX 2025-05-07T19:46:51.0919767Z #define LITTLE_ENDIAN __LITTLE_ENDIAN 2025-05-07T19:46:51.0920078Z #define LLONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:51.0920373Z #define LLONG_MIN (-__LONG_LONG_MAX__-1LL) 2025-05-07T19:46:51.0920689Z #define LOGIN_NAME_MAX 256 2025-05-07T19:46:51.0920937Z #define LONG_BIT 64 2025-05-07T19:46:51.0921198Z #define LONG_LONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:46:51.0921537Z #define LONG_LONG_MIN (-__LONG_LONG_MAX__-1LL) 2025-05-07T19:46:51.0921966Z #define LONG_MAX __LONG_MAX__ 2025-05-07T19:46:51.0922256Z #define LONG_MIN (-__LONG_MAX__ -1L) 2025-05-07T19:46:51.0922534Z #define L_ctermid 9 2025-05-07T19:46:51.0922771Z #define L_cuserid 9 2025-05-07T19:46:51.0922987Z #define L_tmpnam 20 2025-05-07T19:46:51.0923233Z #define MATH_ERREXCEPT 2 2025-05-07T19:46:51.0923475Z #define MATH_ERRNO 1 2025-05-07T19:46:51.0923722Z #define MAX_CANON 255 2025-05-07T19:46:51.0923949Z #define MAX_INPUT 255 2025-05-07T19:46:51.0924228Z #define MB_CUR_MAX (__ctype_get_mb_cur_max ()) 2025-05-07T19:46:51.0924526Z #define MB_LEN_MAX 16 2025-05-07T19:46:51.0924788Z #define MOD_CLKA ADJ_OFFSET_SINGLESHOT 2025-05-07T19:46:51.0925083Z #define MOD_CLKB ADJ_TICK 2025-05-07T19:46:51.0925332Z #define MOD_ESTERROR ADJ_ESTERROR 2025-05-07T19:46:51.0925619Z #define MOD_FREQUENCY ADJ_FREQUENCY 2025-05-07T19:46:51.0925902Z #define MOD_MAXERROR ADJ_MAXERROR 2025-05-07T19:46:51.0926186Z #define MOD_MICRO ADJ_MICRO 2025-05-07T19:46:51.0926436Z #define MOD_NANO ADJ_NANO 2025-05-07T19:46:51.0926699Z #define MOD_OFFSET ADJ_OFFSET 2025-05-07T19:46:51.0926955Z #define MOD_STATUS ADJ_STATUS 2025-05-07T19:46:51.0927223Z #define MOD_TAI ADJ_TAI 2025-05-07T19:46:51.0927468Z #define MOD_TIMECONST ADJ_TIMECONST 2025-05-07T19:46:51.0927759Z #define MQ_PRIO_MAX 32768 2025-05-07T19:46:51.0928022Z #define M_1_PI 0.31830988618379067154 2025-05-07T19:46:51.0928335Z #define M_1_PIl 0.318309886183790671537767526745028724L 2025-05-07T19:46:51.0928675Z #define M_2_PI 0.63661977236758134308 2025-05-07T19:46:51.0928979Z #define M_2_PIl 0.636619772367581343075535053490057448L 2025-05-07T19:46:51.0929327Z #define M_2_SQRTPI 1.12837916709551257390 2025-05-07T19:46:51.0929660Z #define M_2_SQRTPIl 1.128379167095512573896158903121545172L 2025-05-07T19:46:51.0930015Z #define M_E 2.7182818284590452354 2025-05-07T19:46:51.0930303Z #define M_El 2.718281828459045235360287471352662498L 2025-05-07T19:46:51.0930641Z #define M_LN10 2.30258509299404568402 2025-05-07T19:46:51.0930946Z #define M_LN10l 2.302585092994045684017991454684364208L 2025-05-07T19:46:51.0931276Z #define M_LN2 0.69314718055994530942 2025-05-07T19:46:51.0931565Z #define M_LN2l 0.693147180559945309417232121458176568L 2025-05-07T19:46:51.0931880Z #define M_LOG10E 0.43429448190325182765 2025-05-07T19:46:51.0932193Z #define M_LOG10El 0.434294481903251827651128918916605082L 2025-05-07T19:46:51.0932504Z #define M_LOG2E 1.4426950408889634074 2025-05-07T19:46:51.0932807Z #define M_LOG2El 1.442695040888963407359924681001892137L 2025-05-07T19:46:51.0933112Z #define M_PI 3.14159265358979323846 2025-05-07T19:46:51.0933383Z #define M_PI_2 1.57079632679489661923 2025-05-07T19:46:51.0933686Z #define M_PI_2l 1.570796326794896619231321691639751442L 2025-05-07T19:46:51.0934020Z #define M_PI_4 0.78539816339744830962 2025-05-07T19:46:51.0934316Z #define M_PI_4l 0.785398163397448309615660845819875721L 2025-05-07T19:46:51.0934657Z #define M_PIl 3.141592653589793238462643383279502884L 2025-05-07T19:46:51.0934975Z #define M_SQRT1_2 0.70710678118654752440 2025-05-07T19:46:51.0935293Z #define M_SQRT1_2l 0.707106781186547524400844362104849039L 2025-05-07T19:46:51.0935640Z #define M_SQRT2 1.41421356237309504880 2025-05-07T19:46:51.0935950Z #define M_SQRT2l 1.414213562373095048801688724209698079L 2025-05-07T19:46:51.0936274Z #define NAME_MAX 255 2025-05-07T19:46:51.0936504Z #define NAN (__builtin_nanf ("")) 2025-05-07T19:46:51.0936778Z #define NFDBITS __NFDBITS 2025-05-07T19:46:51.0937013Z #define NGROUPS_MAX 65536 2025-05-07T19:46:51.0937268Z #define NL_ARGMAX _POSIX_ARG_MAX 2025-05-07T19:46:51.0937548Z #define NL_LANGMAX _POSIX2_LINE_MAX 2025-05-07T19:46:51.0937923Z #define NL_MSGMAX INT_MAX 2025-05-07T19:46:51.0938174Z #define NL_NMAX INT_MAX 2025-05-07T19:46:51.0938396Z #define NL_SETMAX INT_MAX 2025-05-07T19:46:51.0938626Z #define NL_TEXTMAX INT_MAX 2025-05-07T19:46:51.0938847Z #define NULL __null 2025-05-07T19:46:51.0939068Z #define NZERO 20 2025-05-07T19:46:51.0939267Z #define OVERFLOW 3 2025-05-07T19:46:51.0939489Z #define PATH_MAX 4096 2025-05-07T19:46:51.0939813Z #define PDP_ENDIAN __PDP_ENDIAN 2025-05-07T19:46:51.0940075Z #define PIPE_BUF 4096 2025-05-07T19:46:51.0940434Z #define PLOSS 6 2025-05-07T19:46:51.0940963Z #define PTHREAD_DESTRUCTOR_ITERATIONS _POSIX_THREAD_DESTRUCTOR_ITERATIONS 2025-05-07T19:46:51.0941418Z #define PTHREAD_KEYS_MAX 1024 2025-05-07T19:46:51.0941707Z #define PTHREAD_STACK_MIN 16384 2025-05-07T19:46:51.0941997Z #define P_tmpdir "/tmp" 2025-05-07T19:46:51.0942239Z #define RAND_MAX 2147483647 2025-05-07T19:46:51.0942517Z #define RE_DUP_MAX (0x7fff) 2025-05-07T19:46:51.0942774Z #define RTSIG_MAX 32 2025-05-07T19:46:51.0943044Z #define SCHAR_MAX __SCHAR_MAX__ 2025-05-07T19:46:51.0943317Z #define SCHAR_MIN (-__SCHAR_MAX__-1) 2025-05-07T19:46:51.0943625Z #define SEEK_CUR 1 2025-05-07T19:46:51.0943845Z #define SEEK_DATA 3 2025-05-07T19:46:51.0944074Z #define SEEK_END 2 2025-05-07T19:46:51.0944288Z #define SEEK_HOLE 4 2025-05-07T19:46:51.0944491Z #define SEEK_SET 0 2025-05-07T19:46:51.0944736Z #define SEM_VALUE_MAX (2147483647) 2025-05-07T19:46:51.0945027Z #define SHRT_MAX __SHRT_MAX__ 2025-05-07T19:46:51.0945321Z #define SHRT_MIN (-__SHRT_MAX__ -1) 2025-05-07T19:46:51.0945606Z #define SING 2 2025-05-07T19:46:51.0945848Z #define SSIZE_MAX LONG_MAX 2025-05-07T19:46:51.0946108Z #define STA_CLK 0x8000 2025-05-07T19:46:51.0946369Z #define STA_CLOCKERR 0x1000 2025-05-07T19:46:51.0946633Z #define STA_DEL 0x0020 2025-05-07T19:46:51.0946889Z #define STA_FLL 0x0008 2025-05-07T19:46:51.0947133Z #define STA_FREQHOLD 0x0080 2025-05-07T19:46:51.0947411Z #define STA_INS 0x0010 2025-05-07T19:46:51.0947668Z #define STA_MODE 0x4000 2025-05-07T19:46:51.0947918Z #define STA_NANO 0x2000 2025-05-07T19:46:51.0948182Z #define STA_PLL 0x0001 2025-05-07T19:46:51.0948436Z #define STA_PPSERROR 0x0800 2025-05-07T19:46:51.0948723Z #define STA_PPSFREQ 0x0002 2025-05-07T19:46:51.0948988Z #define STA_PPSJITTER 0x0200 2025-05-07T19:46:51.0949279Z #define STA_PPSSIGNAL 0x0100 2025-05-07T19:46:51.0949550Z #define STA_PPSTIME 0x0004 2025-05-07T19:46:51.0949832Z #define STA_PPSWANDER 0x0400 2025-05-07T19:46:51.0950408Z #define STA_RONLY (STA_PPSSIGNAL | STA_PPSJITTER | STA_PPSWANDER | STA_PPSERROR | STA_CLOCKERR | STA_NANO | STA_MODE | STA_CLK) 2025-05-07T19:46:51.0951021Z #define STA_UNSYNC 0x0040 2025-05-07T19:46:51.0951289Z #define TIMER_ABSTIME 1 2025-05-07T19:46:51.0951529Z #define TIME_UTC 1 2025-05-07T19:46:51.0951767Z #define TLOSS 5 2025-05-07T19:46:51.0951989Z #define TMP_MAX 238328 2025-05-07T19:46:51.0952249Z #define TTY_NAME_MAX 32 2025-05-07T19:46:51.0952507Z #define UCHAR_MAX (__SCHAR_MAX__*2 +1) 2025-05-07T19:46:51.0952830Z #define UINT_MAX (__INT_MAX__ *2U +1U) 2025-05-07T19:46:51.0953274Z #define ULLONG_MAX (__LONG_LONG_MAX__*2ULL+1ULL) 2025-05-07T19:46:51.0953642Z #define ULONG_LONG_MAX (__LONG_LONG_MAX__*2ULL+1ULL) 2025-05-07T19:46:51.0953976Z #define ULONG_MAX (__LONG_MAX__ *2UL+1UL) 2025-05-07T19:46:51.0954276Z #define UNDERFLOW 4 2025-05-07T19:46:51.0954522Z #define USHRT_MAX (__SHRT_MAX__ *2 +1) 2025-05-07T19:46:51.0954801Z #define WCONTINUED 8 2025-05-07T19:46:51.0955047Z #define WEXITED 4 2025-05-07T19:46:51.0955350Z #define WEXITSTATUS(status) __WEXITSTATUS (__WAIT_INT (status)) 2025-05-07T19:46:51.0955825Z #define WIFCONTINUED(status) __WIFCONTINUED (__WAIT_INT (status)) 2025-05-07T19:46:51.0956261Z #define WIFEXITED(status) __WIFEXITED (__WAIT_INT (status)) 2025-05-07T19:46:51.0956704Z #define WIFSIGNALED(status) __WIFSIGNALED (__WAIT_INT (status)) 2025-05-07T19:46:51.0957139Z #define WIFSTOPPED(status) __WIFSTOPPED (__WAIT_INT (status)) 2025-05-07T19:46:51.0957506Z #define WNOHANG 1 2025-05-07T19:46:51.0957743Z #define WNOWAIT 0x01000000 2025-05-07T19:46:51.0958107Z #define WORD_BIT 32 2025-05-07T19:46:51.0958345Z #define WSTOPPED 2 2025-05-07T19:46:51.0958630Z #define WSTOPSIG(status) __WSTOPSIG (__WAIT_INT (status)) 2025-05-07T19:46:51.0959055Z #define WTERMSIG(status) __WTERMSIG (__WAIT_INT (status)) 2025-05-07T19:46:51.0959394Z #define WUNTRACED 2 2025-05-07T19:46:51.0959642Z #define XATTR_LIST_MAX 65536 2025-05-07T19:46:51.0959891Z #define XATTR_NAME_MAX 255 2025-05-07T19:46:51.0960221Z #define XATTR_SIZE_MAX 65536 2025-05-07T19:46:51.0960489Z #define X_TLOSS 1.41484755040568800000e+16 2025-05-07T19:46:51.0960784Z #define _ACRTIMP 2025-05-07T19:46:51.0961019Z #define _ALLOCA_H 1 2025-05-07T19:46:51.0961239Z #define _ASSERT_H 1 2025-05-07T19:46:51.0961474Z #define _ATFILE_SOURCE 1 2025-05-07T19:46:51.0961717Z #define _BITS_BYTESWAP_H 1 2025-05-07T19:46:51.0961991Z #define _BITS_POSIX1_LIM_H 1 2025-05-07T19:46:51.0962245Z #define _BITS_POSIX2_LIM_H 1 2025-05-07T19:46:51.0962522Z #define _BITS_PTHREADTYPES_H 1 2025-05-07T19:46:51.0962779Z #define _BITS_TIMEX_H 1 2025-05-07T19:46:51.0963035Z #define _BITS_TIME_H 1 2025-05-07T19:46:51.0963272Z #define _BITS_TYPESIZES_H 1 2025-05-07T19:46:51.0963542Z #define _BITS_TYPES_H 1 2025-05-07T19:46:51.0963795Z #define _BSD_SOURCE 1 2025-05-07T19:46:51.0964028Z #define _CONCEPT_CHECK_H 1 2025-05-07T19:46:51.0964295Z #define _CPP_TYPE_TRAITS_H 1 2025-05-07T19:46:51.0964541Z #define _CRTIMP 2025-05-07T19:46:51.0964776Z #define _CTYPE_H 1 2025-05-07T19:46:51.0964991Z #define _ENDIAN_H 1 2025-05-07T19:46:51.0965227Z #define _EXCEPTION_DEFINES_H 1 2025-05-07T19:46:51.0965485Z #define _EXT_NUMERIC_TRAITS 1 2025-05-07T19:46:51.0965751Z #define _EXT_TYPE_TRAITS 1 2025-05-07T19:46:51.0965986Z #define _FEATURES_H 1 2025-05-07T19:46:51.0966227Z #define _FUNCTEXCEPT_H 1 2025-05-07T19:46:51.0966461Z #define _GCC_LIMITS_H_ 2025-05-07T19:46:51.0966745Z #define _GLIBCXX11_DEPRECATED _GLIBCXX_DEPRECATED 2025-05-07T19:46:51.0967204Z #define _GLIBCXX11_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:51.0967629Z #define _GLIBCXX11_USE_C99_COMPLEX 1 2025-05-07T19:46:51.0967928Z #define _GLIBCXX11_USE_C99_MATH 1 2025-05-07T19:46:51.0968195Z #define _GLIBCXX11_USE_C99_STDIO 1 2025-05-07T19:46:51.0968483Z #define _GLIBCXX11_USE_C99_STDLIB 1 2025-05-07T19:46:51.0968756Z #define _GLIBCXX11_USE_C99_WCHAR 1 2025-05-07T19:46:51.0969051Z #define _GLIBCXX14_CONSTEXPR constexpr 2025-05-07T19:46:51.0969360Z #define _GLIBCXX17_CONSTEXPR constexpr 2025-05-07T19:46:51.0969700Z #define _GLIBCXX17_DEPRECATED [[__deprecated__]] 2025-05-07T19:46:51.0970157Z #define _GLIBCXX17_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:51.0970577Z #define _GLIBCXX17_INLINE inline 2025-05-07T19:46:51.0970864Z #define _GLIBCXX20_CONSTEXPR 2025-05-07T19:46:51.0971129Z #define _GLIBCXX20_DEPRECATED(MSG) 2025-05-07T19:46:51.0971444Z #define _GLIBCXX20_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:46:51.0971749Z #define _GLIBCXX98_USE_C99_COMPLEX 1 2025-05-07T19:46:51.0972040Z #define _GLIBCXX98_USE_C99_MATH 1 2025-05-07T19:46:51.0972312Z #define _GLIBCXX98_USE_C99_STDIO 1 2025-05-07T19:46:51.0972601Z #define _GLIBCXX98_USE_C99_STDLIB 1 2025-05-07T19:46:51.0972893Z #define _GLIBCXX98_USE_C99_WCHAR 1 2025-05-07T19:46:51.0973248Z #define _GLIBCXX_ABI_TAG_CXX11 __attribute ((__abi_tag__ ("cxx11"))) 2025-05-07T19:46:51.0973644Z #define _GLIBCXX_ATOMIC_BUILTINS 1 2025-05-07T19:46:51.0973939Z #define _GLIBCXX_BEGIN_EXTERN_C extern "C" { 2025-05-07T19:46:51.0974273Z #define _GLIBCXX_BEGIN_NAMESPACE_ALGO 2025-05-07T19:46:51.0974578Z #define _GLIBCXX_BEGIN_NAMESPACE_CONTAINER 2025-05-07T19:46:51.0974956Z #define _GLIBCXX_BEGIN_NAMESPACE_CXX11 namespace __cxx11 { 2025-05-07T19:46:51.0975311Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL 2025-05-07T19:46:51.0975730Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_BEGIN_NAMESPACE_CXX11 2025-05-07T19:46:51.0976941Z #define _GLIBCXX_BEGIN_NAMESPACE_VERSION 2025-05-07T19:46:51.0977434Z #define _GLIBCXX_BITS_SPECFUN_H 1 2025-05-07T19:46:51.0977842Z #define _GLIBCXX_BITS_STD_ABS_H 2025-05-07T19:46:51.0978340Z #define _GLIBCXX_CMATH 1 2025-05-07T19:46:51.0978651Z #define _GLIBCXX_CONST __attribute__ ((__const__)) 2025-05-07T19:46:51.0979003Z #define _GLIBCXX_CONSTEXPR constexpr 2025-05-07T19:46:51.0979329Z #define _GLIBCXX_CPU_DEFINES 1 2025-05-07T19:46:51.0979606Z #define _GLIBCXX_CSTDLIB 1 2025-05-07T19:46:51.0979888Z #define _GLIBCXX_CXX_CONFIG_H 1 2025-05-07T19:46:51.0980372Z #define _GLIBCXX_DARWIN_USE_64_BIT_INODE 1 2025-05-07T19:46:51.0980734Z #define _GLIBCXX_DEBUG_ASSERT(_Condition) 2025-05-07T19:46:51.0981080Z #define _GLIBCXX_DEBUG_ASSERTIONS_H 1 2025-05-07T19:46:51.0981459Z #define _GLIBCXX_DEBUG_MACRO_SWITCH_H 1 2025-05-07T19:46:51.0981800Z #define _GLIBCXX_DEBUG_ONLY(_Statement) 2025-05-07T19:46:51.0982138Z #define _GLIBCXX_DEBUG_PEDASSERT(_Condition) 2025-05-07T19:46:51.0982538Z #define _GLIBCXX_DEFAULT_ABI_TAG _GLIBCXX_ABI_TAG_CXX11 2025-05-07T19:46:51.0982966Z #define _GLIBCXX_DEPRECATED __attribute__ ((__deprecated__)) 2025-05-07T19:46:51.0983562Z #define _GLIBCXX_DEPRECATED_SUGGEST(ALT) __attribute__ ((__deprecated__ ("use '" ALT "' instead"))) 2025-05-07T19:46:51.0984113Z #define _GLIBCXX_DOUBLE_IS_IEEE_BINARY64 1 2025-05-07T19:46:51.0984431Z #define _GLIBCXX_END_EXTERN_C } 2025-05-07T19:46:51.0984731Z #define _GLIBCXX_END_NAMESPACE_ALGO 2025-05-07T19:46:51.0985042Z #define _GLIBCXX_END_NAMESPACE_CONTAINER 2025-05-07T19:46:51.0985387Z #define _GLIBCXX_END_NAMESPACE_CXX11 } 2025-05-07T19:46:51.0985699Z #define _GLIBCXX_END_NAMESPACE_LDBL 2025-05-07T19:46:51.0986127Z #define _GLIBCXX_END_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_END_NAMESPACE_CXX11 2025-05-07T19:46:51.0986565Z #define _GLIBCXX_END_NAMESPACE_VERSION 2025-05-07T19:46:51.0986896Z #define _GLIBCXX_EXTERN_TEMPLATE 1 2025-05-07T19:46:51.0987194Z #define _GLIBCXX_FAST_MATH 0 2025-05-07T19:46:51.0987501Z #define _GLIBCXX_FLOAT_IS_IEEE_BINARY32 1 2025-05-07T19:46:51.0987930Z #define _GLIBCXX_FORWARD(_Tp,__val) std::forward<_Tp>(__val) 2025-05-07T19:46:51.0988322Z #define _GLIBCXX_FULLY_DYNAMIC_STRING 0 2025-05-07T19:46:51.0988670Z #define _GLIBCXX_FWDREF(_Tp) _Tp&& 2025-05-07T19:46:51.0988970Z #define _GLIBCXX_HAS_GTHREADS 1 2025-05-07T19:46:51.0989896Z #define _GLIBCXX_HAS_NESTED_TYPE(_NTYPE) template> struct __has_##_NTYPE : false_type { }; template struct __has_##_NTYPE<_Tp, __void_t> : true_type { }; 2025-05-07T19:46:51.0990842Z #define _GLIBCXX_HAVE_ACOSF 1 2025-05-07T19:46:51.0991124Z #define _GLIBCXX_HAVE_ACOSL 1 2025-05-07T19:46:51.0991422Z #define _GLIBCXX_HAVE_ALIGNED_ALLOC 1 2025-05-07T19:46:51.0991726Z #define _GLIBCXX_HAVE_ARPA_INET_H 1 2025-05-07T19:46:51.0992038Z #define _GLIBCXX_HAVE_ASINF 1 2025-05-07T19:46:51.0992315Z #define _GLIBCXX_HAVE_ASINL 1 2025-05-07T19:46:51.0992631Z #define _GLIBCXX_HAVE_AS_SYMVER_DIRECTIVE 1 2025-05-07T19:46:51.0993070Z #define _GLIBCXX_HAVE_ATAN2F 1 2025-05-07T19:46:51.0993349Z #define _GLIBCXX_HAVE_ATAN2L 1 2025-05-07T19:46:51.0993627Z #define _GLIBCXX_HAVE_ATANF 1 2025-05-07T19:46:51.0993882Z #define _GLIBCXX_HAVE_ATANL 1 2025-05-07T19:46:51.0994176Z #define _GLIBCXX_HAVE_ATOMIC_LOCK_POLICY 1 2025-05-07T19:46:51.0994498Z #define _GLIBCXX_HAVE_ATTRIBUTE_VISIBILITY 1 2025-05-07T19:46:51.0994828Z #define _GLIBCXX_HAVE_AT_QUICK_EXIT 1 2025-05-07T19:46:51.0995138Z #define _GLIBCXX_HAVE_BUILTIN_HAS_UNIQ_OBJ_REP 1 2025-05-07T19:46:51.0995484Z #define _GLIBCXX_HAVE_BUILTIN_IS_AGGREGATE 1 2025-05-07T19:46:51.0995830Z #define _GLIBCXX_HAVE_BUILTIN_IS_CONSTANT_EVALUATED 1 2025-05-07T19:46:51.0996181Z #define _GLIBCXX_HAVE_BUILTIN_IS_SAME 1 2025-05-07T19:46:51.0996487Z #define _GLIBCXX_HAVE_BUILTIN_LAUNDER 1 2025-05-07T19:46:51.0996771Z #define _GLIBCXX_HAVE_CEILF 1 2025-05-07T19:46:51.0997041Z #define _GLIBCXX_HAVE_CEILL 1 2025-05-07T19:46:51.0997301Z #define _GLIBCXX_HAVE_COMPLEX_H 1 2025-05-07T19:46:51.0997587Z #define _GLIBCXX_HAVE_COSF 1 2025-05-07T19:46:51.0997839Z #define _GLIBCXX_HAVE_COSHF 1 2025-05-07T19:46:51.0998110Z #define _GLIBCXX_HAVE_COSHL 1 2025-05-07T19:46:51.0998366Z #define _GLIBCXX_HAVE_COSL 1 2025-05-07T19:46:51.1000392Z #define _GLIBCXX_HAVE_DIRENT_H 1 2025-05-07T19:46:51.1000658Z #define _GLIBCXX_HAVE_DLFCN_H 1 2025-05-07T19:46:51.1000937Z #define _GLIBCXX_HAVE_ENDIAN_H 1 2025-05-07T19:46:51.1001254Z #define _GLIBCXX_HAVE_EXCEPTION_PTR_SINCE_GCC46 1 2025-05-07T19:46:51.1001589Z #define _GLIBCXX_HAVE_EXECINFO_H 1 2025-05-07T19:46:51.1001890Z #define _GLIBCXX_HAVE_EXPF 1 2025-05-07T19:46:51.1002150Z #define _GLIBCXX_HAVE_EXPL 1 2025-05-07T19:46:51.1002501Z #define _GLIBCXX_HAVE_FABSF 1 2025-05-07T19:46:51.1002762Z #define _GLIBCXX_HAVE_FABSL 1 2025-05-07T19:46:51.1003038Z #define _GLIBCXX_HAVE_FCNTL_H 1 2025-05-07T19:46:51.1003298Z #define _GLIBCXX_HAVE_FENV_H 1 2025-05-07T19:46:51.1003580Z #define _GLIBCXX_HAVE_FINITE 1 2025-05-07T19:46:51.1003837Z #define _GLIBCXX_HAVE_FINITEF 1 2025-05-07T19:46:51.1004113Z #define _GLIBCXX_HAVE_FINITEL 1 2025-05-07T19:46:51.1004385Z #define _GLIBCXX_HAVE_FLOAT_H 1 2025-05-07T19:46:51.1004641Z #define _GLIBCXX_HAVE_FLOORF 1 2025-05-07T19:46:51.1004916Z #define _GLIBCXX_HAVE_FLOORL 1 2025-05-07T19:46:51.1005178Z #define _GLIBCXX_HAVE_FMODF 1 2025-05-07T19:46:51.1005449Z #define _GLIBCXX_HAVE_FMODL 1 2025-05-07T19:46:51.1005706Z #define _GLIBCXX_HAVE_FREXPF 1 2025-05-07T19:46:51.1005982Z #define _GLIBCXX_HAVE_FREXPL 1 2025-05-07T19:46:51.1006243Z #define _GLIBCXX_HAVE_GETIPINFO 1 2025-05-07T19:46:51.1006687Z #define _GLIBCXX_HAVE_GETS 1 2025-05-07T19:46:51.1006940Z #define _GLIBCXX_HAVE_HYPOT 1 2025-05-07T19:46:51.1007213Z #define _GLIBCXX_HAVE_HYPOTF 1 2025-05-07T19:46:51.1007481Z #define _GLIBCXX_HAVE_HYPOTL 1 2025-05-07T19:46:51.1007735Z #define _GLIBCXX_HAVE_ICONV 1 2025-05-07T19:46:51.1008006Z #define _GLIBCXX_HAVE_INT64_T 1 2025-05-07T19:46:51.1008268Z #define _GLIBCXX_HAVE_INT64_T_LONG 1 2025-05-07T19:46:51.1008567Z #define _GLIBCXX_HAVE_INTTYPES_H 1 2025-05-07T19:46:51.1008839Z #define _GLIBCXX_HAVE_ISINF 1 2025-05-07T19:46:51.1009111Z #define _GLIBCXX_HAVE_ISINFF 1 2025-05-07T19:46:51.1009365Z #define _GLIBCXX_HAVE_ISINFL 1 2025-05-07T19:46:51.1009635Z #define _GLIBCXX_HAVE_ISNAN 1 2025-05-07T19:46:51.1009894Z #define _GLIBCXX_HAVE_ISNANF 1 2025-05-07T19:46:51.1010162Z #define _GLIBCXX_HAVE_ISNANL 1 2025-05-07T19:46:51.1010438Z #define _GLIBCXX_HAVE_ISWBLANK 1 2025-05-07T19:46:51.1010711Z #define _GLIBCXX_HAVE_LC_MESSAGES 1 2025-05-07T19:46:51.1010999Z #define _GLIBCXX_HAVE_LDEXPF 1 2025-05-07T19:46:51.1011252Z #define _GLIBCXX_HAVE_LDEXPL 1 2025-05-07T19:46:51.1011533Z #define _GLIBCXX_HAVE_LIMIT_AS 1 2025-05-07T19:46:51.1011796Z #define _GLIBCXX_HAVE_LIMIT_DATA 1 2025-05-07T19:46:51.1012080Z #define _GLIBCXX_HAVE_LIMIT_FSIZE 1 2025-05-07T19:46:51.1012351Z #define _GLIBCXX_HAVE_LIMIT_RSS 1 2025-05-07T19:46:51.1012638Z #define _GLIBCXX_HAVE_LIMIT_VMEM 0 2025-05-07T19:46:51.1012904Z #define _GLIBCXX_HAVE_LINK 1 2025-05-07T19:46:51.1013173Z #define _GLIBCXX_HAVE_LINUX_FUTEX 1 2025-05-07T19:46:51.1013467Z #define _GLIBCXX_HAVE_LINUX_RANDOM_H 1 2025-05-07T19:46:51.1013755Z #define _GLIBCXX_HAVE_LINUX_TYPES_H 1 2025-05-07T19:46:51.1014053Z #define _GLIBCXX_HAVE_LOCALE_H 1 2025-05-07T19:46:51.1014320Z #define _GLIBCXX_HAVE_LOG10F 1 2025-05-07T19:46:51.1014589Z #define _GLIBCXX_HAVE_LOG10L 1 2025-05-07T19:46:51.1014846Z #define _GLIBCXX_HAVE_LOGF 1 2025-05-07T19:46:51.1015109Z #define _GLIBCXX_HAVE_LOGL 1 2025-05-07T19:46:51.1015363Z #define _GLIBCXX_HAVE_MBSTATE_T 1 2025-05-07T19:46:51.1015647Z #define _GLIBCXX_HAVE_MEMALIGN 1 2025-05-07T19:46:51.1015908Z #define _GLIBCXX_HAVE_MEMORY_H 1 2025-05-07T19:46:51.1016190Z #define _GLIBCXX_HAVE_MODF 1 2025-05-07T19:46:51.1016460Z #define _GLIBCXX_HAVE_MODFF 1 2025-05-07T19:46:51.1016717Z #define _GLIBCXX_HAVE_MODFL 1 2025-05-07T19:46:51.1016989Z #define _GLIBCXX_HAVE_NETDB_H 1 2025-05-07T19:46:51.1017250Z #define _GLIBCXX_HAVE_NETINET_IN_H 1 2025-05-07T19:46:51.1017550Z #define _GLIBCXX_HAVE_NETINET_TCP_H 1 2025-05-07T19:46:51.1017839Z #define _GLIBCXX_HAVE_OBSOLETE_ISINF 1 2025-05-07T19:46:51.1018147Z #define _GLIBCXX_HAVE_OBSOLETE_ISNAN 1 2025-05-07T19:46:51.1018435Z #define _GLIBCXX_HAVE_POLL 1 2025-05-07T19:46:51.1018708Z #define _GLIBCXX_HAVE_POLL_H 1 2025-05-07T19:46:51.1019118Z #define _GLIBCXX_HAVE_POSIX_MEMALIGN 1 2025-05-07T19:46:51.1019420Z #define _GLIBCXX_HAVE_POSIX_SEMAPHORE 1 2025-05-07T19:46:51.1019731Z #define _GLIBCXX_HAVE_POWF 1 2025-05-07T19:46:51.1019987Z #define _GLIBCXX_HAVE_POWL 1 2025-05-07T19:46:51.1020372Z #define _GLIBCXX_HAVE_QUICK_EXIT 1 2025-05-07T19:46:51.1020829Z #define _GLIBCXX_HAVE_READLINK 1 2025-05-07T19:46:51.1021213Z #define _GLIBCXX_HAVE_SETENV 1 2025-05-07T19:46:51.1021490Z #define _GLIBCXX_HAVE_SINCOS 1 2025-05-07T19:46:51.1021778Z #define _GLIBCXX_HAVE_SINCOSF 1 2025-05-07T19:46:51.1022055Z #define _GLIBCXX_HAVE_SINCOSL 1 2025-05-07T19:46:51.1022349Z #define _GLIBCXX_HAVE_SINF 1 2025-05-07T19:46:51.1022638Z #define _GLIBCXX_HAVE_SINHF 1 2025-05-07T19:46:51.1022908Z #define _GLIBCXX_HAVE_SINHL 1 2025-05-07T19:46:51.1023194Z #define _GLIBCXX_HAVE_SINL 1 2025-05-07T19:46:51.1023468Z #define _GLIBCXX_HAVE_SOCKATMARK 1 2025-05-07T19:46:51.1023774Z #define _GLIBCXX_HAVE_SQRTF 1 2025-05-07T19:46:51.1024043Z #define _GLIBCXX_HAVE_SQRTL 1 2025-05-07T19:46:51.1024341Z #define _GLIBCXX_HAVE_STDALIGN_H 1 2025-05-07T19:46:51.1024632Z #define _GLIBCXX_HAVE_STDBOOL_H 1 2025-05-07T19:46:51.1024935Z #define _GLIBCXX_HAVE_STDINT_H 1 2025-05-07T19:46:51.1025219Z #define _GLIBCXX_HAVE_STDLIB_H 1 2025-05-07T19:46:51.1025518Z #define _GLIBCXX_HAVE_STRERROR_L 1 2025-05-07T19:46:51.1025820Z #define _GLIBCXX_HAVE_STRERROR_R 1 2025-05-07T19:46:51.1026110Z #define _GLIBCXX_HAVE_STRINGS_H 1 2025-05-07T19:46:51.1026413Z #define _GLIBCXX_HAVE_STRING_H 1 2025-05-07T19:46:51.1026691Z #define _GLIBCXX_HAVE_STRTOF 1 2025-05-07T19:46:51.1026978Z #define _GLIBCXX_HAVE_STRTOLD 1 2025-05-07T19:46:51.1027275Z #define _GLIBCXX_HAVE_STRUCT_DIRENT_D_TYPE 1 2025-05-07T19:46:51.1027615Z #define _GLIBCXX_HAVE_STRXFRM_L 1 2025-05-07T19:46:51.1027901Z #define _GLIBCXX_HAVE_SYMLINK 1 2025-05-07T19:46:51.1028269Z #define _GLIBCXX_HAVE_SYMVER_SYMBOL_RENAMING_RUNTIME_SUPPORT 1 2025-05-07T19:46:51.1028664Z #define _GLIBCXX_HAVE_SYS_IOCTL_H 1 2025-05-07T19:46:51.1028982Z #define _GLIBCXX_HAVE_SYS_IPC_H 1 2025-05-07T19:46:51.1029287Z #define _GLIBCXX_HAVE_SYS_PARAM_H 1 2025-05-07T19:46:51.1029589Z #define _GLIBCXX_HAVE_SYS_RESOURCE_H 1 2025-05-07T19:46:51.1029909Z #define _GLIBCXX_HAVE_SYS_SEM_H 1 2025-05-07T19:46:51.1030200Z #define _GLIBCXX_HAVE_SYS_SOCKET_H 1 2025-05-07T19:46:51.1030516Z #define _GLIBCXX_HAVE_SYS_STATVFS_H 1 2025-05-07T19:46:51.1030829Z #define _GLIBCXX_HAVE_SYS_STAT_H 1 2025-05-07T19:46:51.1031143Z #define _GLIBCXX_HAVE_SYS_SYSINFO_H 1 2025-05-07T19:46:51.1031441Z #define _GLIBCXX_HAVE_SYS_TIME_H 1 2025-05-07T19:46:51.1031750Z #define _GLIBCXX_HAVE_SYS_TYPES_H 1 2025-05-07T19:46:51.1032040Z #define _GLIBCXX_HAVE_SYS_UIO_H 1 2025-05-07T19:46:51.1032340Z #define _GLIBCXX_HAVE_S_ISREG 1 2025-05-07T19:46:51.1032631Z #define _GLIBCXX_HAVE_TANF 1 2025-05-07T19:46:51.1032906Z #define _GLIBCXX_HAVE_TANHF 1 2025-05-07T19:46:51.1033299Z #define _GLIBCXX_HAVE_TANHL 1 2025-05-07T19:46:51.1033550Z #define _GLIBCXX_HAVE_TANL 1 2025-05-07T19:46:51.1033819Z #define _GLIBCXX_HAVE_TGMATH_H 1 2025-05-07T19:46:51.1034087Z #define _GLIBCXX_HAVE_TLS 1 2025-05-07T19:46:51.1034357Z #define _GLIBCXX_HAVE_TRUNCATE 1 2025-05-07T19:46:51.1034620Z #define _GLIBCXX_HAVE_UNISTD_H 1 2025-05-07T19:46:51.1034904Z #define _GLIBCXX_HAVE_USELOCALE 1 2025-05-07T19:46:51.1035173Z #define _GLIBCXX_HAVE_UTIME_H 1 2025-05-07T19:46:51.1035448Z #define _GLIBCXX_HAVE_VFWSCANF 1 2025-05-07T19:46:51.1035730Z #define _GLIBCXX_HAVE_VSWSCANF 1 2025-05-07T19:46:51.1035990Z #define _GLIBCXX_HAVE_VWSCANF 1 2025-05-07T19:46:51.1036262Z #define _GLIBCXX_HAVE_WCHAR_H 1 2025-05-07T19:46:51.1036516Z #define _GLIBCXX_HAVE_WCSTOF 1 2025-05-07T19:46:51.1036787Z #define _GLIBCXX_HAVE_WCTYPE_H 1 2025-05-07T19:46:51.1037047Z #define _GLIBCXX_HAVE_WRITEV 1 2025-05-07T19:46:51.1037319Z #define _GLIBCXX_HAVE_XLOCALE_H 1 2025-05-07T19:46:51.1037581Z #define _GLIBCXX_HOSTED 1 2025-05-07T19:46:51.1037836Z #define _GLIBCXX_ICONV_CONST 2025-05-07T19:46:51.1038094Z #define _GLIBCXX_INLINE_VERSION 0 2025-05-07T19:46:51.1038487Z #define _GLIBCXX_LT_OBJDIR ".libs/" 2025-05-07T19:46:51.1038983Z #define _GLIBCXX_MAKE_MOVE_IF_NOEXCEPT_ITERATOR(_Iter) std::__make_move_if_noexcept_iterator(_Iter) 2025-05-07T19:46:51.1039576Z #define _GLIBCXX_MAKE_MOVE_ITERATOR(_Iter) std::make_move_iterator(_Iter) 2025-05-07T19:46:51.1039996Z #define _GLIBCXX_MANGLE_SIZE_T m 2025-05-07T19:46:51.1040260Z #define _GLIBCXX_MATH_H 1 2025-05-07T19:46:51.1040611Z #define _GLIBCXX_MOVE(__val) std::move(__val) 2025-05-07T19:46:51.1040979Z #define _GLIBCXX_MOVE3(_Tp,_Up,_Vp) std::move(_Tp, _Up, _Vp) 2025-05-07T19:46:51.1041461Z #define _GLIBCXX_MOVE_BACKWARD3(_Tp,_Up,_Vp) std::move_backward(_Tp, _Up, _Vp) 2025-05-07T19:46:51.1041930Z #define _GLIBCXX_NAMESPACE_CXX11 __cxx11:: 2025-05-07T19:46:51.1042234Z #define _GLIBCXX_NAMESPACE_LDBL 2025-05-07T19:46:51.1042607Z #define _GLIBCXX_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_NAMESPACE_CXX11 2025-05-07T19:46:51.1043144Z #define _GLIBCXX_NATIVE_THREAD_ID (__gthread_active_p() ? __gthread_self() : (__gthread_t)1) 2025-05-07T19:46:51.1043644Z #define _GLIBCXX_NODISCARD [[__nodiscard__]] 2025-05-07T19:46:51.1043974Z #define _GLIBCXX_NOEXCEPT noexcept 2025-05-07T19:46:51.1044301Z #define _GLIBCXX_NOEXCEPT_IF(...) noexcept(__VA_ARGS__) 2025-05-07T19:46:51.1044672Z #define _GLIBCXX_NOEXCEPT_PARM , bool _NE 2025-05-07T19:46:51.1044996Z #define _GLIBCXX_NOEXCEPT_QUAL noexcept (_NE) 2025-05-07T19:46:51.1045380Z #define _GLIBCXX_NORETURN __attribute__ ((__noreturn__)) 2025-05-07T19:46:51.1045746Z #define _GLIBCXX_NOTHROW _GLIBCXX_USE_NOEXCEPT 2025-05-07T19:46:51.1046176Z #define _GLIBCXX_NO_OBSOLETE_ISINF_ISNAN_DYNAMIC __GLIBC_PREREQ(2,23) 2025-05-07T19:46:51.1046585Z #define _GLIBCXX_NUMERIC_LIMITS 1 2025-05-07T19:46:51.1046862Z #define _GLIBCXX_OS_DEFINES 1 2025-05-07T19:46:51.1047153Z #define _GLIBCXX_PACKAGE_BUGREPORT "" 2025-05-07T19:46:51.1047466Z #define _GLIBCXX_PACKAGE_NAME "package-unused" 2025-05-07T19:46:51.1047878Z #define _GLIBCXX_PACKAGE_STRING "package-unused version-unused" 2025-05-07T19:46:51.1048268Z #define _GLIBCXX_PACKAGE_TARNAME "libstdc++" 2025-05-07T19:46:51.1048603Z #define _GLIBCXX_PACKAGE_URL "" 2025-05-07T19:46:51.1048929Z #define _GLIBCXX_PACKAGE__GLIBCXX_VERSION "version-unused" 2025-05-07T19:46:51.1049297Z #define _GLIBCXX_PREDEFINED_OPS_H 1 2025-05-07T19:46:51.1049586Z #define _GLIBCXX_PSEUDO_VISIBILITY(V) 2025-05-07T19:46:51.1049922Z #define _GLIBCXX_PURE __attribute__ ((__pure__)) 2025-05-07T19:46:51.1050254Z #define _GLIBCXX_RELEASE 11 2025-05-07T19:46:51.1050505Z #define _GLIBCXX_RES_LIMITS 1 2025-05-07T19:46:51.1050774Z #define _GLIBCXX_STDC_HEADERS 1 2025-05-07T19:46:51.1051033Z #define _GLIBCXX_STDIO_EOF -1 2025-05-07T19:46:51.1051304Z #define _GLIBCXX_STDIO_SEEK_CUR 1 2025-05-07T19:46:51.1051569Z #define _GLIBCXX_STDIO_SEEK_END 2 2025-05-07T19:46:51.1051844Z #define _GLIBCXX_STDLIB_H 1 2025-05-07T19:46:51.1052087Z #define _GLIBCXX_STD_A std 2025-05-07T19:46:51.1052340Z #define _GLIBCXX_STD_C std 2025-05-07T19:46:51.1052595Z #define _GLIBCXX_SYMVER 1 2025-05-07T19:46:51.1052833Z #define _GLIBCXX_SYMVER_GNU 1 2025-05-07T19:46:51.1053149Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_AFTER(A) 2025-05-07T19:46:51.1053514Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_BEFORE(A) 2025-05-07T19:46:51.1053854Z #define _GLIBCXX_THROW(_EXC) 2025-05-07T19:46:51.1054149Z #define _GLIBCXX_THROW_OR_ABORT(_EXC) (throw (_EXC)) 2025-05-07T19:46:51.1054507Z #define _GLIBCXX_TR1_BESSEL_FUNCTION_TCC 1 2025-05-07T19:46:51.1054823Z #define _GLIBCXX_TR1_BETA_FUNCTION_TCC 1 2025-05-07T19:46:51.1055142Z #define _GLIBCXX_TR1_ELL_INTEGRAL_TCC 1 2025-05-07T19:46:51.1055436Z #define _GLIBCXX_TR1_EXP_INTEGRAL_TCC 1 2025-05-07T19:46:51.1055737Z #define _GLIBCXX_TR1_GAMMA_TCC 1 2025-05-07T19:46:51.1056033Z #define _GLIBCXX_TR1_HYPERGEOMETRIC_TCC 1 2025-05-07T19:46:51.1056345Z #define _GLIBCXX_TR1_LEGENDRE_FUNCTION_TCC 1 2025-05-07T19:46:51.1056683Z #define _GLIBCXX_TR1_MODIFIED_BESSEL_FUNC_TCC 1 2025-05-07T19:46:51.1056999Z #define _GLIBCXX_TR1_POLY_HERMITE_TCC 1 2025-05-07T19:46:51.1057294Z #define _GLIBCXX_TR1_POLY_LAGUERRE_TCC 1 2025-05-07T19:46:51.1057684Z #define _GLIBCXX_TR1_RIEMANN_ZETA_TCC 1 2025-05-07T19:46:51.1058011Z #define _GLIBCXX_TR1_SPECIAL_FUNCTION_UTIL_H 1 2025-05-07T19:46:51.1058310Z #define _GLIBCXX_TXN_SAFE 2025-05-07T19:46:51.1058568Z #define _GLIBCXX_TXN_SAFE_DYN 2025-05-07T19:46:51.1058829Z #define _GLIBCXX_TYPE_TRAITS 1 2025-05-07T19:46:51.1059080Z #define _GLIBCXX_USE_ALLOCATOR_NEW 1 2025-05-07T19:46:51.1059412Z #define _GLIBCXX_USE_C99 1 2025-05-07T19:46:51.1059703Z #define _GLIBCXX_USE_C99_COMPLEX _GLIBCXX11_USE_C99_COMPLEX 2025-05-07T19:46:51.1060046Z #define _GLIBCXX_USE_C99_COMPLEX_TR1 1 2025-05-07T19:46:51.1060427Z #define _GLIBCXX_USE_C99_CTYPE_TR1 1 2025-05-07T19:46:51.1060878Z #define _GLIBCXX_USE_C99_FENV_TR1 1 2025-05-07T19:46:51.1061169Z #define _GLIBCXX_USE_C99_INTTYPES_TR1 1 2025-05-07T19:46:51.1061501Z #define _GLIBCXX_USE_C99_INTTYPES_WCHAR_T_TR1 1 2025-05-07T19:46:51.1061857Z #define _GLIBCXX_USE_C99_MATH _GLIBCXX11_USE_C99_MATH 2025-05-07T19:46:51.1062200Z #define _GLIBCXX_USE_C99_MATH_TR1 1 2025-05-07T19:46:51.1062511Z #define _GLIBCXX_USE_C99_STDINT_TR1 1 2025-05-07T19:46:51.1062843Z #define _GLIBCXX_USE_C99_STDIO _GLIBCXX11_USE_C99_STDIO 2025-05-07T19:46:51.1063259Z #define _GLIBCXX_USE_C99_STDLIB _GLIBCXX11_USE_C99_STDLIB 2025-05-07T19:46:51.1063655Z #define _GLIBCXX_USE_C99_WCHAR _GLIBCXX11_USE_C99_WCHAR 2025-05-07T19:46:51.1064014Z #define _GLIBCXX_USE_CLOCK_MONOTONIC 1 2025-05-07T19:46:51.1064316Z #define _GLIBCXX_USE_CLOCK_REALTIME 1 2025-05-07T19:46:51.1064620Z #define _GLIBCXX_USE_CONSTEXPR constexpr 2025-05-07T19:46:51.1064925Z #define _GLIBCXX_USE_CXX11_ABI 1 2025-05-07T19:46:51.1065208Z #define _GLIBCXX_USE_DECIMAL_FLOAT 1 2025-05-07T19:46:51.1065514Z #define _GLIBCXX_USE_DEPRECATED 1 2025-05-07T19:46:51.1065791Z #define _GLIBCXX_USE_DEV_RANDOM 1 2025-05-07T19:46:51.1066076Z #define _GLIBCXX_USE_DUAL_ABI 1 2025-05-07T19:46:51.1066335Z #define _GLIBCXX_USE_FCHMOD 1 2025-05-07T19:46:51.1066597Z #define _GLIBCXX_USE_FCHMODAT 1 2025-05-07T19:46:51.1066854Z #define _GLIBCXX_USE_FLOAT128 1 2025-05-07T19:46:51.1067133Z #define _GLIBCXX_USE_GETTIMEOFDAY 1 2025-05-07T19:46:51.1067412Z #define _GLIBCXX_USE_GET_NPROCS 1 2025-05-07T19:46:51.1067687Z #define _GLIBCXX_USE_INT128 1 2025-05-07T19:46:51.1067948Z #define _GLIBCXX_USE_LFS 1 2025-05-07T19:46:51.1068200Z #define _GLIBCXX_USE_LONG_LONG 1 2025-05-07T19:46:51.1068474Z #define _GLIBCXX_USE_LSTAT 1 2025-05-07T19:46:51.1068732Z #define _GLIBCXX_USE_NANOSLEEP 1 2025-05-07T19:46:51.1069024Z #define _GLIBCXX_USE_NOEXCEPT noexcept 2025-05-07T19:46:51.1069323Z #define _GLIBCXX_USE_PTHREAD_RWLOCK_T 1 2025-05-07T19:46:51.1069628Z #define _GLIBCXX_USE_RANDOM_TR1 1 2025-05-07T19:46:51.1069906Z #define _GLIBCXX_USE_REALPATH 1 2025-05-07T19:46:51.1070178Z #define _GLIBCXX_USE_SCHED_YIELD 1 2025-05-07T19:46:51.1070472Z #define _GLIBCXX_USE_SC_NPROCESSORS_ONLN 1 2025-05-07T19:46:51.1070785Z #define _GLIBCXX_USE_SENDFILE 1 2025-05-07T19:46:51.1071064Z #define _GLIBCXX_USE_STD_SPEC_FUNCS 1 2025-05-07T19:46:51.1071354Z #define _GLIBCXX_USE_ST_MTIM 1 2025-05-07T19:46:51.1071720Z #define _GLIBCXX_USE_TBB_PAR_BACKEND __has_include() 2025-05-07T19:46:51.1072109Z #define _GLIBCXX_USE_TMPNAM 1 2025-05-07T19:46:51.1072401Z #define _GLIBCXX_USE_UTIME 1 2025-05-07T19:46:51.1072675Z #define _GLIBCXX_USE_UTIMENSAT 1 2025-05-07T19:46:51.1072960Z #define _GLIBCXX_USE_WCHAR_T 1 2025-05-07T19:46:51.1073336Z #define _GLIBCXX_USE_WEAK_REF __GXX_WEAK__ 2025-05-07T19:46:51.1073640Z #define _GLIBCXX_UTILITY 1 2025-05-07T19:46:51.1073879Z #define _GLIBCXX_VERBOSE 1 2025-05-07T19:46:51.1074212Z #define _GLIBCXX_VISIBILITY(V) __attribute__ ((__visibility__ (#V))) 2025-05-07T19:46:51.1074599Z #define _GLIBCXX_WEAK_DEFINITION 2025-05-07T19:46:51.1074869Z #define _GLIBCXX_X86_RDRAND 1 2025-05-07T19:46:51.1075129Z #define _GLIBCXX_X86_RDSEED 1 2025-05-07T19:46:51.1075363Z #define _GNU_SOURCE 1 2025-05-07T19:46:51.1075603Z #define _GTHREAD_USE_MUTEX_TIMEDLOCK 1 2025-05-07T19:46:51.1075872Z #define _G_BUFSIZ 8192 2025-05-07T19:46:51.1076542Z #define _G_HAVE_MMAP 1 2025-05-07T19:46:51.1077074Z #define _G_HAVE_MREMAP 1 2025-05-07T19:46:51.1077386Z #define _G_HAVE_ST_BLKSIZE defined (_STATBUF_ST_BLKSIZE) 2025-05-07T19:46:51.1077732Z #define _G_IO_IO_FILE_VERSION 0x20001 2025-05-07T19:46:51.1078037Z #define _G_config_h 1 2025-05-07T19:46:51.1078285Z #define _G_va_list __gnuc_va_list 2025-05-07T19:46:51.1078552Z #define _INITIALIZER_LIST 2025-05-07T19:46:51.1078806Z #define _IOFBF 0 2025-05-07T19:46:51.1079097Z #define _IOLBF 1 2025-05-07T19:46:51.1079313Z #define _IONBF 2 2025-05-07T19:46:51.1079541Z #define _IOS_APPEND 8 2025-05-07T19:46:51.1079771Z #define _IOS_ATEND 4 2025-05-07T19:46:51.1079987Z #define _IOS_BIN 128 2025-05-07T19:46:51.1080227Z #define _IOS_INPUT 1 2025-05-07T19:46:51.1080450Z #define _IOS_NOCREATE 32 2025-05-07T19:46:51.1080709Z #define _IOS_NOREPLACE 64 2025-05-07T19:46:51.1080945Z #define _IOS_OUTPUT 2 2025-05-07T19:46:51.1081179Z #define _IOS_TRUNC 16 2025-05-07T19:46:51.1081422Z #define _IO_BAD_SEEN 0x4000 2025-05-07T19:46:51.1081728Z #define _IO_BE(expr,res) __builtin_expect ((expr), res) 2025-05-07T19:46:51.1082079Z #define _IO_BOOLALPHA 0200000 2025-05-07T19:46:51.1082343Z #define _IO_BUFSIZ _G_BUFSIZ 2025-05-07T19:46:51.1082628Z #define _IO_CURRENTLY_PUTTING 0x800 2025-05-07T19:46:51.1082902Z #define _IO_DEC 020 2025-05-07T19:46:51.1083149Z #define _IO_DELETE_DONT_CLOSE 0x40 2025-05-07T19:46:51.1083435Z #define _IO_DONT_CLOSE 0100000 2025-05-07T19:46:51.1083710Z #define _IO_EOF_SEEN 0x10 2025-05-07T19:46:51.1083954Z #define _IO_ERR_SEEN 0x20 2025-05-07T19:46:51.1084223Z #define _IO_FIXED 010000 2025-05-07T19:46:51.1084483Z #define _IO_FLAGS2_MMAP 1 2025-05-07T19:46:51.1084739Z #define _IO_FLAGS2_NOTCANCEL 2 2025-05-07T19:46:51.1085029Z #define _IO_FLAGS2_USER_WBUF 8 2025-05-07T19:46:51.1085326Z #define _IO_HAVE_ST_BLKSIZE _G_HAVE_ST_BLKSIZE 2025-05-07T19:46:51.1085660Z #define _IO_HEX 0100 2025-05-07T19:46:51.1085897Z #define _IO_INTERNAL 010 2025-05-07T19:46:51.1086157Z #define _IO_IN_BACKUP 0x100 2025-05-07T19:46:51.1086437Z #define _IO_IS_APPENDING 0x1000 2025-05-07T19:46:51.1086765Z #define _IO_IS_FILEBUF 0x2000 2025-05-07T19:46:51.1087043Z #define _IO_LEFT 02 2025-05-07T19:46:51.1087320Z #define _IO_LINE_BUF 0x200 2025-05-07T19:46:51.1087619Z #define _IO_LINKED 0x80 2025-05-07T19:46:51.1087887Z #define _IO_MAGIC 0xFBAD0000 2025-05-07T19:46:51.1088200Z #define _IO_MAGIC_MASK 0xFFFF0000 2025-05-07T19:46:51.1088497Z #define _IO_NO_READS 4 2025-05-07T19:46:51.1088883Z #define _IO_NO_WRITES 8 2025-05-07T19:46:51.1089124Z #define _IO_OCT 040 2025-05-07T19:46:51.1089523Z #define _IO_PENDING_OUTPUT_COUNT(_fp) ((_fp)->_IO_write_ptr - (_fp)->_IO_write_base) 2025-05-07T19:46:51.1089958Z #define _IO_RIGHT 04 2025-05-07T19:46:51.1090227Z #define _IO_SCIENTIFIC 04000 2025-05-07T19:46:51.1090495Z #define _IO_SHOWBASE 0200 2025-05-07T19:46:51.1090778Z #define _IO_SHOWPOINT 0400 2025-05-07T19:46:51.1091061Z #define _IO_SHOWPOS 02000 2025-05-07T19:46:51.1091309Z #define _IO_SKIPWS 01 2025-05-07T19:46:51.1091572Z #define _IO_STDIO 040000 2025-05-07T19:46:51.1091815Z #define _IO_STDIO_H 2025-05-07T19:46:51.1092089Z #define _IO_TIED_PUT_GET 0x400 2025-05-07T19:46:51.1092360Z #define _IO_UNBUFFERED 2 2025-05-07T19:46:51.1092643Z #define _IO_UNIFIED_JUMPTABLES 1 2025-05-07T19:46:51.1092918Z #define _IO_UNITBUF 020000 2025-05-07T19:46:51.1093202Z #define _IO_UPPERCASE 01000 2025-05-07T19:46:51.1093458Z #define _IO_USER_BUF 1 2025-05-07T19:46:51.1093727Z #define _IO_USER_LOCK 0x8000 2025-05-07T19:46:51.1094013Z #define _IO_cleanup_region_end(_Doit) 2025-05-07T19:46:51.1094356Z #define _IO_cleanup_region_start(_fct,_fp) 2025-05-07T19:46:51.1094777Z #define _IO_feof_unlocked(__fp) (((__fp)->_flags & _IO_EOF_SEEN) != 0) 2025-05-07T19:46:51.1095260Z #define _IO_ferror_unlocked(__fp) (((__fp)->_flags & _IO_ERR_SEEN) != 0) 2025-05-07T19:46:51.1095682Z #define _IO_file_flags _flags 2025-05-07T19:46:51.1095951Z #define _IO_flockfile(_fp) 2025-05-07T19:46:51.1096237Z #define _IO_fpos64_t _G_fpos64_t 2025-05-07T19:46:51.1096511Z #define _IO_fpos_t _G_fpos_t 2025-05-07T19:46:51.1096802Z #define _IO_ftrylockfile(_fp) 2025-05-07T19:46:51.1097202Z #define _IO_funlockfile(_fp) 2025-05-07T19:46:51.1097753Z #define _IO_getc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) ? __uflow (_fp) : *(unsigned char *) (_fp)->_IO_read_ptr++) 2025-05-07T19:46:51.1098326Z #define _IO_iconv_t _G_iconv_t 2025-05-07T19:46:51.1098592Z #define _IO_off64_t __off64_t 2025-05-07T19:46:51.1098881Z #define _IO_off_t __off_t 2025-05-07T19:46:51.1099240Z #define _IO_peekc(_fp) _IO_peekc_unlocked (_fp) 2025-05-07T19:46:51.1099890Z #define _IO_peekc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) && __underflow (_fp) == EOF ? EOF : *(unsigned char *) (_fp)->_IO_read_ptr) 2025-05-07T19:46:51.1100579Z #define _IO_pid_t __pid_t 2025-05-07T19:46:51.1101453Z #define _IO_putc_unlocked(_ch,_fp) (_IO_BE ((_fp)->_IO_write_ptr >= (_fp)->_IO_write_end, 0) ? __overflow (_fp, (unsigned char) (_ch)) : (unsigned char) (*(_fp)->_IO_write_ptr++ = (_ch))) 2025-05-07T19:46:51.1102190Z #define _IO_size_t size_t 2025-05-07T19:46:51.1102481Z #define _IO_ssize_t __ssize_t 2025-05-07T19:46:51.1102844Z #define _IO_stderr ((_IO_FILE*)(&_IO_2_1_stderr_)) 2025-05-07T19:46:51.1103225Z #define _IO_stdin ((_IO_FILE*)(&_IO_2_1_stdin_)) 2025-05-07T19:46:51.1103622Z #define _IO_stdout ((_IO_FILE*)(&_IO_2_1_stdout_)) 2025-05-07T19:46:51.1103967Z #define _IO_uid_t __uid_t 2025-05-07T19:46:51.1104271Z #define _IO_va_list __gnuc_va_list 2025-05-07T19:46:51.1104604Z #define _IO_wint_t wint_t 2025-05-07T19:46:51.1104882Z #define _ISOC11_SOURCE 1 2025-05-07T19:46:51.1105173Z #define _ISOC95_SOURCE 1 2025-05-07T19:46:51.1105440Z #define _ISOC99_SOURCE 1 2025-05-07T19:46:51.1105836Z #define _ISbit(bit) ((bit) < 8 ? ((1 << (bit)) << 8) : ((1 << (bit)) >> 8)) 2025-05-07T19:46:51.1106249Z #define _LARGEFILE64_SOURCE 1 2025-05-07T19:46:51.1106566Z #define _LARGEFILE_SOURCE 1 2025-05-07T19:46:51.1106851Z #define _LIBC_LIMITS_H_ 1 2025-05-07T19:46:51.1107149Z #define _LINUX_LIMITS_H 2025-05-07T19:46:51.1107417Z #define _LP64 1 2025-05-07T19:46:51.1107883Z #define _MATH_H 1 2025-05-07T19:46:51.1108129Z #define _MATH_H_MATHDEF 1 2025-05-07T19:46:51.1108420Z #define _MOVE_H 1 2025-05-07T19:46:51.1108689Z #define _Mfloat_ float 2025-05-07T19:46:51.1108963Z #define _Mlong_double_ long double 2025-05-07T19:46:51.1109297Z #define _NEW 2025-05-07T19:46:51.1109550Z #define _OLD_STDIO_MAGIC 0xFABC0000 2025-05-07T19:46:51.1109899Z #define _POSIX2_BC_BASE_MAX 99 2025-05-07T19:46:51.1110206Z #define _POSIX2_BC_DIM_MAX 2048 2025-05-07T19:46:51.1110538Z #define _POSIX2_BC_SCALE_MAX 99 2025-05-07T19:46:51.1110842Z #define _POSIX2_BC_STRING_MAX 1000 2025-05-07T19:46:51.1111196Z #define _POSIX2_CHARCLASS_NAME_MAX 14 2025-05-07T19:46:51.1111526Z #define _POSIX2_COLL_WEIGHTS_MAX 2 2025-05-07T19:46:51.1111872Z #define _POSIX2_EXPR_NEST_MAX 32 2025-05-07T19:46:51.1112207Z #define _POSIX2_LINE_MAX 2048 2025-05-07T19:46:51.1112506Z #define _POSIX2_RE_DUP_MAX 255 2025-05-07T19:46:51.1112832Z #define _POSIX_AIO_LISTIO_MAX 2 2025-05-07T19:46:51.1113231Z #define _POSIX_AIO_MAX 1 2025-05-07T19:46:51.1113523Z #define _POSIX_ARG_MAX 4096 2025-05-07T19:46:51.1113784Z #define _POSIX_CHILD_MAX 25 2025-05-07T19:46:51.1114073Z #define _POSIX_CLOCKRES_MIN 20000000 2025-05-07T19:46:51.1114363Z #define _POSIX_C_SOURCE 200809L 2025-05-07T19:46:51.1114660Z #define _POSIX_DELAYTIMER_MAX 32 2025-05-07T19:46:51.1114950Z #define _POSIX_FD_SETSIZE _POSIX_OPEN_MAX 2025-05-07T19:46:51.1115287Z #define _POSIX_HIWAT _POSIX_PIPE_BUF 2025-05-07T19:46:51.1115605Z #define _POSIX_HOST_NAME_MAX 255 2025-05-07T19:46:51.1115876Z #define _POSIX_LINK_MAX 8 2025-05-07T19:46:51.1116155Z #define _POSIX_LOGIN_NAME_MAX 9 2025-05-07T19:46:51.1116426Z #define _POSIX_MAX_CANON 255 2025-05-07T19:46:51.1116720Z #define _POSIX_MAX_INPUT 255 2025-05-07T19:46:51.1116984Z #define _POSIX_MQ_OPEN_MAX 8 2025-05-07T19:46:51.1117278Z #define _POSIX_MQ_PRIO_MAX 32 2025-05-07T19:46:51.1117544Z #define _POSIX_NAME_MAX 14 2025-05-07T19:46:51.1117809Z #define _POSIX_NGROUPS_MAX 8 2025-05-07T19:46:51.1118063Z #define _POSIX_OPEN_MAX 20 2025-05-07T19:46:51.1118425Z #define _POSIX_PATH_MAX 256 2025-05-07T19:46:51.1118687Z #define _POSIX_PIPE_BUF 512 2025-05-07T19:46:51.1118927Z #define _POSIX_QLIMIT 1 2025-05-07T19:46:51.1119175Z #define _POSIX_RE_DUP_MAX 255 2025-05-07T19:46:51.1119425Z #define _POSIX_RTSIG_MAX 8 2025-05-07T19:46:51.1119689Z #define _POSIX_SEM_NSEMS_MAX 256 2025-05-07T19:46:51.1119953Z #define _POSIX_SEM_VALUE_MAX 32767 2025-05-07T19:46:51.1120302Z #define _POSIX_SIGQUEUE_MAX 32 2025-05-07T19:46:51.1120553Z #define _POSIX_SOURCE 1 2025-05-07T19:46:51.1120807Z #define _POSIX_SSIZE_MAX 32767 2025-05-07T19:46:51.1121059Z #define _POSIX_STREAM_MAX 8 2025-05-07T19:46:51.1121322Z #define _POSIX_SYMLINK_MAX 255 2025-05-07T19:46:51.1121589Z #define _POSIX_SYMLOOP_MAX 8 2025-05-07T19:46:51.1121868Z #define _POSIX_THREAD_DESTRUCTOR_ITERATIONS 4 2025-05-07T19:46:51.1122201Z #define _POSIX_THREAD_KEYS_MAX 128 2025-05-07T19:46:51.1122481Z #define _POSIX_THREAD_THREADS_MAX 64 2025-05-07T19:46:51.1122777Z #define _POSIX_TIMER_MAX 32 2025-05-07T19:46:51.1123029Z #define _POSIX_TTY_NAME_MAX 9 2025-05-07T19:46:51.1123297Z #define _POSIX_TZNAME_MAX 6 2025-05-07T19:46:51.1123541Z #define _POSIX_UIO_MAXIOV 16 2025-05-07T19:46:51.1123870Z #define _PSTL_ASSERT(_Condition) __glibcxx_assert(_Condition) 2025-05-07T19:46:51.1124331Z #define _PSTL_ASSERT_MSG(_Condition,_Message) __glibcxx_assert(_Condition) 2025-05-07T19:46:51.1124920Z #define _PSTL_CLANG_VERSION (__clang_major__ * 10000 + __clang_minor__ * 100 + __clang_patchlevel__) 2025-05-07T19:46:51.1125392Z #define _PSTL_CONFIG_H 2025-05-07T19:46:51.1125827Z #define _PSTL_CPP11_STD_ROTATE_BROKEN ((__GLIBCXX__ && __GLIBCXX__ < 20150716) || (_MSC_VER && _MSC_VER < 1800)) 2025-05-07T19:46:51.1126644Z #define _PSTL_CPP14_2RANGE_MISMATCH_EQUAL_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201300L || __cpp_lib_robust_nonmodifying_seq_ops == 201304) 2025-05-07T19:46:51.1127397Z #define _PSTL_CPP14_INTEGER_SEQUENCE_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L) 2025-05-07T19:46:51.1128142Z #define _PSTL_CPP14_MAKE_REVERSE_ITERATOR_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L || __cpp_lib_make_reverse_iterator == 201402) 2025-05-07T19:46:51.1129068Z #define _PSTL_CPP14_VARIABLE_TEMPLATES_PRESENT (!__INTEL_COMPILER || __INTEL_COMPILER >= 1700) && (_MSC_FULL_VER >= 190023918 || __cplusplus >= 201402L) 2025-05-07T19:46:51.1129771Z #define _PSTL_CPP17_EXECUTION_POLICIES_PRESENT (_MSC_VER >= 1912) 2025-05-07T19:46:51.1130214Z #define _PSTL_EARLYEXIT_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:51.1130685Z #define _PSTL_GCC_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:46:51.1131124Z #define _PSTL_HIDE_FROM_ABI_POP 2025-05-07T19:46:51.1131407Z #define _PSTL_HIDE_FROM_ABI_PUSH 2025-05-07T19:46:51.1131743Z #define _PSTL_ICC_18_OMP_SIMD_BROKEN (__INTEL_COMPILER == 1800) 2025-05-07T19:46:51.1132165Z #define _PSTL_MONOTONIC_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:46:51.1132508Z #define _PSTL_PAR_BACKEND_SERIAL 2025-05-07T19:46:51.1132799Z #define _PSTL_PRAGMA(x) _Pragma(# x) 2025-05-07T19:46:51.1133417Z #define _PSTL_PRAGMA_DECLARE_REDUCTION(NAME,OP) _PSTL_PRAGMA(omp declare reduction(NAME:OP : omp_out(omp_in)) initializer(omp_priv = omp_orig)) 2025-05-07T19:46:51.1134124Z #define _PSTL_PRAGMA_DECLARE_SIMD _PSTL_PRAGMA(omp declare simd) 2025-05-07T19:46:51.1134492Z #define _PSTL_PRAGMA_FORCEINLINE 2025-05-07T19:46:51.1134834Z #define _PSTL_PRAGMA_LOCATION " [Parallel STL message]: " 2025-05-07T19:46:51.1135192Z #define _PSTL_PRAGMA_MESSAGE(x) 2025-05-07T19:46:51.1135664Z #define _PSTL_PRAGMA_MESSAGE_IMPL(x) _PSTL_PRAGMA(message(_PSTL_STRING_CONCAT(_PSTL_PRAGMA_LOCATION, x))) 2025-05-07T19:46:51.1136194Z #define _PSTL_PRAGMA_MESSAGE_POLICIES(x) 2025-05-07T19:46:51.1136520Z #define _PSTL_PRAGMA_SIMD _PSTL_PRAGMA(omp simd) 2025-05-07T19:46:51.1136854Z #define _PSTL_PRAGMA_SIMD_EARLYEXIT 2025-05-07T19:46:51.1137157Z #define _PSTL_PRAGMA_SIMD_EXCLUSIVE_SCAN(PRM) 2025-05-07T19:46:51.1137510Z #define _PSTL_PRAGMA_SIMD_INCLUSIVE_SCAN(PRM) 2025-05-07T19:46:51.1137946Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC(PRM) 2025-05-07T19:46:51.1138337Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC_2ARGS(PRM1,PRM2) 2025-05-07T19:46:51.1138829Z #define _PSTL_PRAGMA_SIMD_REDUCTION(PRM) _PSTL_PRAGMA(omp simd reduction(PRM)) 2025-05-07T19:46:51.1139259Z #define _PSTL_PRAGMA_SIMD_SCAN(PRM) 2025-05-07T19:46:51.1139590Z #define _PSTL_PRAGMA_VECTOR_UNALIGNED 2025-05-07T19:46:51.1139954Z #define _PSTL_STRING(x) _PSTL_STRING_AUX(x) 2025-05-07T19:46:51.1140394Z #define _PSTL_STRING_AUX(x) #x 2025-05-07T19:46:51.1140857Z #define _PSTL_STRING_CONCAT(x,y) x #y 2025-05-07T19:46:51.1141210Z #define _PSTL_UDR_PRESENT 0 2025-05-07T19:46:51.1141709Z #define _PSTL_UDS_PRESENT (__INTEL_COMPILER >= 1900 && __INTEL_COMPILER_BUILD_DATE >= 20180626) 2025-05-07T19:46:51.1142223Z #define _PSTL_USAGE_WARNINGS 0 2025-05-07T19:46:51.1142584Z #define _PSTL_USE_NONTEMPORAL_STORES_IF_ALLOWED 2025-05-07T19:46:51.1142946Z #define _PSTL_VERSION 12000 2025-05-07T19:46:51.1143299Z #define _PSTL_VERSION_MAJOR (_PSTL_VERSION / 1000) 2025-05-07T19:46:51.1143717Z #define _PSTL_VERSION_MINOR ((_PSTL_VERSION % 1000) / 10) 2025-05-07T19:46:51.1144153Z #define _PSTL_VERSION_PATCH (_PSTL_VERSION % 10) 2025-05-07T19:46:51.1144499Z #define _PTRDIFF_T 2025-05-07T19:46:51.1144784Z #define _PTR_TRAITS_H 1 2025-05-07T19:46:51.1145089Z #define _SIGSET_H_types 1 2025-05-07T19:46:51.1145451Z #define _SIGSET_NWORDS (1024 / (8 * sizeof (unsigned long int))) 2025-05-07T19:46:51.1145882Z #define _SIZE_T 2025-05-07T19:46:51.1146145Z #define _STDC_PREDEF_H 1 2025-05-07T19:46:51.1146451Z #define _STDIO_H 1 2025-05-07T19:46:51.1146713Z #define _STDIO_USES_IOSTREAM 2025-05-07T19:46:51.1147029Z #define _STDLIB_H 1 2025-05-07T19:46:51.1147286Z #define _STL_ALGOBASE_H 1 2025-05-07T19:46:51.1147610Z #define _STL_ITERATOR_BASE_FUNCS_H 1 2025-05-07T19:46:51.1147936Z #define _STL_ITERATOR_BASE_TYPES_H 1 2025-05-07T19:46:51.1148289Z #define _STL_ITERATOR_H 1 2025-05-07T19:46:51.1148568Z #define _STL_PAIR_H 1 2025-05-07T19:46:51.1148855Z #define _STL_RELOPS_H 1 2025-05-07T19:46:51.1149150Z #define _STRING_H 1 2025-05-07T19:46:51.1149409Z #define _STRUCT_TIMEVAL 1 2025-05-07T19:46:51.1149715Z #define _SVID_SOURCE 1 2025-05-07T19:46:51.1149974Z #define _SYS_CDEFS_H 1 2025-05-07T19:46:51.1150251Z #define _SYS_SELECT_H 1 2025-05-07T19:46:51.1150503Z #define _SYS_SYSMACROS_H 1 2025-05-07T19:46:51.1150797Z #define _SYS_TYPES_H 1 2025-05-07T19:46:51.1151042Z #define _TIME_H 1 2025-05-07T19:46:51.1151305Z #define _VA_LIST_DEFINED 2025-05-07T19:46:51.1151567Z #define _XLOCALE_H 1 2025-05-07T19:46:51.1151858Z #define _XOPEN_IOV_MAX _POSIX_UIO_MAXIOV 2025-05-07T19:46:51.1152201Z #define _XOPEN_LIM_H 1 2025-05-07T19:46:51.1152459Z #define _XOPEN_SOURCE 700 2025-05-07T19:46:51.1152760Z #define _XOPEN_SOURCE_EXTENDED 1 2025-05-07T19:46:51.1153249Z #define __ASMNAME(cname) __ASMNAME2 (__USER_LABEL_PREFIX__, cname) 2025-05-07T19:46:51.1153712Z #define __ASMNAME2(prefix,cname) __STRING (prefix) cname 2025-05-07T19:46:51.1154086Z #define __ASSERT_FUNCTION __PRETTY_FUNCTION__ 2025-05-07T19:46:51.1154452Z #define __ASSERT_VOID_CAST static_cast 2025-05-07T19:46:51.1154761Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:46:51.1155050Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:46:51.1155310Z #define __ATOMIC_CONSUME 1 2025-05-07T19:46:51.1155594Z #define __ATOMIC_RELAXED 0 2025-05-07T19:46:51.1155879Z #define __ATOMIC_RELEASE 3 2025-05-07T19:46:51.1156134Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:46:51.1156427Z #define __BEGIN_DECLS extern "C" { 2025-05-07T19:46:51.1156714Z #define __BEGIN_NAMESPACE_C99 2025-05-07T19:46:51.1157011Z #define __BEGIN_NAMESPACE_STD 2025-05-07T19:46:51.1157279Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:46:51.1157577Z #define __BIG_ENDIAN 4321 2025-05-07T19:46:51.1157843Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:46:51.1158157Z #define __BIT_TYPES_DEFINED__ 1 2025-05-07T19:46:51.1158437Z #define __BLKCNT64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:51.1158779Z #define __BLKCNT_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.1159141Z #define __BLKSIZE_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.1159564Z #define __BOOL_WIDTH__ 8 2025-05-07T19:46:51.1159859Z #define __BYTE_ORDER __LITTLE_ENDIAN 2025-05-07T19:46:51.1160177Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:46:51.1160536Z #define __CHANNEL_DESCRIPTOR_H__ 2025-05-07T19:46:51.1160833Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:46:51.1161160Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:46:51.1161557Z #define __CHAR_BIT__ 8 2025-05-07T19:46:51.1161842Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:46:51.1162186Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:46:51.1162508Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:46:51.1162848Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:46:51.1163155Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:46:51.1163486Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:46:51.1163797Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:46:51.1164132Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:46:51.1164448Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:46:51.1164789Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:46:51.1165094Z #define __CLANG_LIMITS_H 2025-05-07T19:46:51.1165376Z #define __CLANG_MAX_ALIGN_T_DEFINED 2025-05-07T19:46:51.1165682Z #define __CLOCKID_T_TYPE __S32_TYPE 2025-05-07T19:46:51.1165981Z #define __CLOCK_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.1166316Z #define __COMMON_FUNCTIONS_H__ 2025-05-07T19:46:51.1166579Z #define __COMPAR_FN_T 2025-05-07T19:46:51.1166846Z #define __CONCAT(x,y) x ## y 2025-05-07T19:46:51.1167115Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:46:51.1167415Z #define __CUDACC_VER_BUILD__ 85 2025-05-07T19:46:51.1167687Z #define __CUDACC_VER_MAJOR__ 12 2025-05-07T19:46:51.1167980Z #define __CUDACC_VER_MINOR__ 6 2025-05-07T19:46:51.1168589Z #define __CUDACC_VER__ "__CUDACC_VER__ is no longer supported. Use __CUDACC_VER_MAJOR__, __CUDACC_VER_MINOR__, and __CUDACC_VER_BUILD__ instead." 2025-05-07T19:46:51.1169198Z #define __CUDACC__ 1 2025-05-07T19:46:51.1169465Z #define __CUDART_API_PTDS(api) api 2025-05-07T19:46:51.1169756Z #define __CUDART_API_PTSZ(api) api 2025-05-07T19:46:51.1170223Z #define __CUDART_API_VERSION ((__CUDA_API_VER_MAJOR__ * 1000) + (__CUDA_API_VER_MINOR__ * 10)) 2025-05-07T19:46:51.1170689Z #define __CUDA_API_VER_MAJOR__ 12 2025-05-07T19:46:51.1171004Z #define __CUDA_API_VER_MINOR__ 6 2025-05-07T19:46:51.1171361Z #define __CUDA_ARCH_HAS_FEATURE__(_FEAT) __CUDA_ARCH_FEAT_##_FEAT 2025-05-07T19:46:51.1171770Z #define __CUDA_ARCH_LIST__ 520 2025-05-07T19:46:51.1172058Z #define __CUDA_ARCH__ 520 2025-05-07T19:46:51.1172322Z #define __CUDA_DEVICE_RUNTIME_API_H__ 2025-05-07T19:46:51.1172647Z #define __CUDA_MATH_CRTIMP 2025-05-07T19:46:51.1172907Z #define __CUDA_RUNTIME_API_H__ 2025-05-07T19:46:51.1173202Z #define __CUDA_RUNTIME_H__ 2025-05-07T19:46:51.1173465Z #define __DADDR_T_TYPE __S32_TYPE 2025-05-07T19:46:51.1173766Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:46:51.1174065Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:46:51.1174396Z #define __DBL_DIG__ 15 2025-05-07T19:46:51.1174645Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:46:51.1174961Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:46:51.1175238Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:46:51.1175493Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:51.1175768Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:46:51.1176259Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:46:51.1176880Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:46:51.1177352Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:46:51.1177779Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:46:51.1178069Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:46:51.1178377Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:46:51.1178709Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:46:51.1179051Z #define __DELETE_THROW throw() 2025-05-07T19:46:51.1179340Z #define __DEPRECATED 1 2025-05-07T19:46:51.1179602Z #define __DEVICE_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:51.1180109Z #define __DEVICE_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:51.1180511Z #define __DEVICE_DOUBLE_FUNCTIONS_HPP__ 2025-05-07T19:46:51.1180849Z #define __DEVICE_DOUBLE_FUNCTIONS_H__ 2025-05-07T19:46:51.1181158Z #define __DEVICE_FUNCTIONS_HPP__ 2025-05-07T19:46:51.1181518Z #define __DEVICE_FUNCTIONS_H__ 2025-05-07T19:46:51.1181895Z #define __DEVICE_LAUNCH_PARAMETERS_H__ 2025-05-07T19:46:51.1182223Z #define __DEVICE_TYPES_H__ 2025-05-07T19:46:51.1182513Z #define __DEV_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:51.1182796Z #define __DRIVER_FUNCTIONS_H__ 2025-05-07T19:46:51.1183082Z #define __DRIVER_TYPES_H__ 2025-05-07T19:46:51.1183330Z #define __ELF__ 1 2025-05-07T19:46:51.1183564Z #define __END_DECLS } 2025-05-07T19:46:51.1183806Z #define __END_NAMESPACE_C99 2025-05-07T19:46:51.1184086Z #define __END_NAMESPACE_STD 2025-05-07T19:46:51.1184347Z #define __EXCEPTIONS 1 2025-05-07T19:46:51.1184603Z #define __EXCEPTION_H 1 2025-05-07T19:46:51.1184861Z #define __FDS_BITS(set) ((set)->fds_bits) 2025-05-07T19:46:51.1185304Z #define __FD_CLR(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] &= ~__FD_MASK (d))) 2025-05-07T19:46:51.1185749Z #define __FD_ELT(d) ((d) / __NFDBITS) 2025-05-07T19:46:51.1186150Z #define __FD_ISSET(d,set) ((__FDS_BITS (set)[__FD_ELT (d)] & __FD_MASK (d)) != 0) 2025-05-07T19:46:51.1186629Z #define __FD_MASK(d) ((__fd_mask) 1 << ((d) % __NFDBITS)) 2025-05-07T19:46:51.1187093Z #define __FD_SET(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] |= __FD_MASK (d))) 2025-05-07T19:46:51.1187528Z #define __FD_SETSIZE 1024 2025-05-07T19:46:51.1188216Z #define __FD_ZERO(fdsp) do { int __d0, __d1; __asm__ __volatile__ ("cld; rep; " __FD_ZERO_STOS : "=c" (__d0), "=D" (__d1) : "a" (0), "0" (sizeof (fd_set) / sizeof (__fd_mask)), "1" (&__FDS_BITS (fdsp)[0]) : "memory"); } while (0) 2025-05-07T19:46:51.1188964Z #define __FD_ZERO_STOS "stosq" 2025-05-07T19:46:51.1189257Z #define __FILE_defined 1 2025-05-07T19:46:51.1189512Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:46:51.1189797Z #define __FLOAT128__ 1 2025-05-07T19:46:51.1190051Z #define __FLOAT_WORD_ORDER __BYTE_ORDER 2025-05-07T19:46:51.1190370Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:46:51.1190683Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:46:51.1191025Z #define __FLT16_DIG__ 3 2025-05-07T19:46:51.1191279Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:46:51.1191600Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:46:51.1191873Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:46:51.1192176Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:46:51.1192473Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:46:51.1192740Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:46:51.1193127Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:46:51.1193378Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:46:51.1193661Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:46:51.1193925Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:46:51.1194196Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:46:51.1194477Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:46:51.1194760Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:46:51.1195055Z #define __FLT_DIG__ 6 2025-05-07T19:46:51.1195288Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:46:51.1195581Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:46:51.1195829Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:46:51.1196099Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:46:51.1196361Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:46:51.1196619Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:46:51.1196869Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:46:51.1197133Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:46:51.1197401Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:46:51.1197678Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:46:51.1197949Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:46:51.1198205Z #define __FLT_RADIX__ 2 2025-05-07T19:46:51.1198458Z #define __FSBLKCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:51.1198769Z #define __FSBLKCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:51.1199207Z #define __FSFILCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:51.1199516Z #define __FSFILCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:51.1199853Z #define __FSID_T_TYPE struct { int __val[2]; } 2025-05-07T19:46:51.1200170Z #define __FSWORD_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.1200468Z #define __FXSR__ 1 2025-05-07T19:46:51.1200688Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:46:51.1201037Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:46:51.1201344Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:46:51.1201642Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:46:51.1201950Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:46:51.1202232Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:46:51.1202528Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:46:51.1202815Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:46:51.1203126Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:46:51.1203426Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:46:51.1203740Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:46:51.1204057Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:46:51.1204342Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:46:51.1214530Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:46:51.1214858Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:46:51.1215174Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:46:51.1215489Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:46:51.1215790Z #define __GID_T_TYPE __U32_TYPE 2025-05-07T19:46:51.1216047Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:46:51.1216337Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:46:51.1216601Z #define __GLIBCXX__ 20230528 2025-05-07T19:46:51.1216847Z #define __GLIBC_HAVE_LONG_LONG 1 2025-05-07T19:46:51.1217093Z #define __GLIBC_MINOR__ 17 2025-05-07T19:46:51.1217477Z #define __GLIBC_PREREQ(maj,min) ((__GLIBC__ << 16) + __GLIBC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:51.1217902Z #define __GLIBC__ 2 2025-05-07T19:46:51.1218106Z #define __GNUC_GNU_INLINE__ 1 2025-05-07T19:46:51.1218351Z #define __GNUC_MINOR__ 2 2025-05-07T19:46:51.1218574Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:46:51.1218947Z #define __GNUC_PREREQ(maj,min) ((__GNUC__ << 16) + __GNUC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:46:51.1219340Z #define __GNUC_VA_LIST 2025-05-07T19:46:51.1219562Z #define __GNUC__ 4 2025-05-07T19:46:51.1219756Z #define __GNUG__ 4 2025-05-07T19:46:51.1219964Z #define __GNU_LIBRARY__ 6 2025-05-07T19:46:51.1220200Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:46:51.1220575Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:46:51.1221024Z #define __GXX_RTTI 1 2025-05-07T19:46:51.1221240Z #define __GXX_WEAK__ 1 2025-05-07T19:46:51.1221477Z #define __HAVE_COLUMN 2025-05-07T19:46:51.1221779Z #define __HOST_CONFIG_H__ 2025-05-07T19:46:51.1222035Z #define __HOST_DEFINES_H__ 2025-05-07T19:46:51.1222284Z #define __ID_T_TYPE __U32_TYPE 2025-05-07T19:46:51.1222559Z #define __INO64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:51.1222845Z #define __INO_T_MATCHES_INO64_T 1 2025-05-07T19:46:51.1223143Z #define __INO_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:51.1223439Z #define __INT16_C_SUFFIX__ 2025-05-07T19:46:51.1223697Z #define __INT16_FMTd__ "hd" 2025-05-07T19:46:51.1223952Z #define __INT16_FMTi__ "hi" 2025-05-07T19:46:51.1224193Z #define __INT16_MAX__ 32767 2025-05-07T19:46:51.1224459Z #define __INT16_TYPE__ short 2025-05-07T19:46:51.1224710Z #define __INT32_C_SUFFIX__ 2025-05-07T19:46:51.1224966Z #define __INT32_FMTd__ "d" 2025-05-07T19:46:51.1225204Z #define __INT32_FMTi__ "i" 2025-05-07T19:46:51.1225462Z #define __INT32_MAX__ 2147483647 2025-05-07T19:46:51.1225728Z #define __INT32_TYPE__ int 2025-05-07T19:46:51.1225987Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:46:51.1226239Z #define __INT64_FMTd__ "ld" 2025-05-07T19:46:51.1226499Z #define __INT64_FMTi__ "li" 2025-05-07T19:46:51.1226768Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:46:51.1227062Z #define __INT64_TYPE__ long int 2025-05-07T19:46:51.1227560Z #define __INT8_C_SUFFIX__ 2025-05-07T19:46:51.1227801Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:46:51.1228046Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:46:51.1228287Z #define __INT8_MAX__ 127 2025-05-07T19:46:51.1228540Z #define __INT8_TYPE__ signed char 2025-05-07T19:46:51.1228805Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:46:51.1229069Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:46:51.1229388Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:46:51.1229668Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:46:51.1229966Z #define __INTMAX_TYPE__ long int 2025-05-07T19:46:51.1230236Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:46:51.1230478Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:46:51.1230743Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:46:51.1231008Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:46:51.1231317Z #define __INTPTR_TYPE__ long int 2025-05-07T19:46:51.1231589Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:46:51.1231837Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:46:51.1232124Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:46:51.1232384Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:46:51.1232495Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:46:51.1232584Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:46:51.1232674Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:46:51.1232763Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:46:51.1232878Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:46:51.1233085Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:46:51.1233170Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:46:51.1233265Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:46:51.1233354Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:46:51.1233459Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:46:51.1233549Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:46:51.1233641Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:46:51.1233725Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:46:51.1233807Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:46:51.1233904Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:46:51.1233997Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:46:51.1234083Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:46:51.1234168Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:46:51.1234262Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:46:51.1234349Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:46:51.1234442Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:46:51.1234532Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:46:51.1234617Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:46:51.1234700Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:46:51.1234789Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:46:51.1234879Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:46:51.1234961Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:46:51.1235046Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:46:51.1235142Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:46:51.1235249Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:46:51.1235344Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:46:51.1235429Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:46:51.1235523Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:46:51.1235613Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:46:51.1235700Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:46:51.1235798Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:46:51.1235886Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:46:51.1235970Z #define __INT_MAX__ 2147483647 2025-05-07T19:46:51.1236054Z #define __INT_WIDTH__ 32 2025-05-07T19:46:51.1236147Z #define __KERNEL_STRICT_NAMES 2025-05-07T19:46:51.1236242Z #define __KEY_T_TYPE __S32_TYPE 2025-05-07T19:46:51.1236328Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:46:51.1236473Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:46:51.1236552Z #define __LDBL_DIG__ 18 2025-05-07T19:46:51.1236669Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:46:51.1236757Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:46:51.1236922Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:46:51.1237014Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:46:51.1237102Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:46:51.1237194Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:46:51.1237276Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:46:51.1237384Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:46:51.1237824Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:46:51.1237917Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:46:51.1238020Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:46:51.1238126Z #define __LDBL_REDIR(name,proto) name proto 2025-05-07T19:46:51.1238263Z #define __LDBL_REDIR1(name,proto,alias) name proto 2025-05-07T19:46:51.1238423Z #define __LDBL_REDIR1_NTH(name,proto,alias) name proto __THROW 2025-05-07T19:46:51.1238514Z #define __LDBL_REDIR_DECL(name) 2025-05-07T19:46:51.1238661Z #define __LDBL_REDIR_NTH(name,proto) name proto __THROW 2025-05-07T19:46:51.1238740Z #define __LEAF 2025-05-07T19:46:51.1238824Z #define __LEAF_ATTR 2025-05-07T19:46:51.1238913Z #define __LIBRARY_TYPES_H__ 2025-05-07T19:46:51.1239009Z #define __LITTLE_ENDIAN 1234 2025-05-07T19:46:51.1239093Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:46:51.1239177Z #define __LLONG_WIDTH__ 64 2025-05-07T19:46:51.1239298Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:46:51.1239397Z #define __LONG_LONG_PAIR(HI,LO) LO, HI 2025-05-07T19:46:51.1239493Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:46:51.1239580Z #define __LONG_WIDTH__ 64 2025-05-07T19:46:51.1239668Z #define __LP64__ 1 2025-05-07T19:46:51.1239975Z #define __MATHCALLX(function,suffix,args,attrib) __MATHDECLX (_Mdouble_,function,suffix, args, attrib) 2025-05-07T19:46:51.1240578Z #define __MATHDECLX(type,function,suffix,args,attrib) __MATHDECL_1(type, function,suffix, args) __attribute__ (attrib); __MATHDECL_1(type, __CONCAT(__,function),suffix, args) __attribute__ (attrib) 2025-05-07T19:46:51.1240682Z #define __MATH_DECLARE_LDOUBLE 1 2025-05-07T19:46:51.1240775Z #define __MATH_FUNCTIONS_HPP__ 2025-05-07T19:46:51.1240864Z #define __MATH_FUNCTIONS_H__ 2025-05-07T19:46:51.1240952Z #define __MMX__ 1 2025-05-07T19:46:51.1241041Z #define __MODE_T_TYPE __U32_TYPE 2025-05-07T19:46:51.1241125Z #define __N(msgid) (msgid) 2025-05-07T19:46:51.1241238Z #define __NFDBITS (8 * (int) sizeof (__fd_mask)) 2025-05-07T19:46:51.1241359Z #define __NLINK_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:51.1241443Z #define __NO_CTYPE 1 2025-05-07T19:46:51.1241521Z #define __NO_INLINE__ 1 2025-05-07T19:46:51.1241625Z #define __NO_MATH_INLINES 1 2025-05-07T19:46:51.1241727Z #define __NTH(fct) __LEAF_ATTR fct throw () 2025-05-07T19:46:51.1241827Z #define __NVCC_DIAG_PRAGMA_SUPPORT__ 1 2025-05-07T19:46:51.1241903Z #define __NVCC__ 1 2025-05-07T19:46:51.1242009Z #define __NV_GLIBCXX_VERSION 40800 2025-05-07T19:46:51.1242095Z #define __NV_LEGACY_LAUNCH 1 2025-05-07T19:46:51.1242191Z #define __NV_NO_HOST_COMPILER_CHECK 1 2025-05-07T19:46:51.1242290Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:46:51.1242386Z #define __OFF64_T_TYPE __SQUAD_TYPE 2025-05-07T19:46:51.1242479Z #define __OFF_T_MATCHES_OFF64_T 1 2025-05-07T19:46:51.1242578Z #define __OFF_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.1242705Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:46:51.1242801Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:46:51.1242900Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:46:51.1243017Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:46:51.1243123Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:46:51.1243216Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:46:51.1243306Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:46:51.1243406Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:46:51.1243490Z #define __P(args) args 2025-05-07T19:46:51.1243577Z #define __PDP_ENDIAN 3412 2025-05-07T19:46:51.1243666Z #define __PIC__ 2 2025-05-07T19:46:51.1243757Z #define __PID_T_TYPE __S32_TYPE 2025-05-07T19:46:51.1243836Z #define __PIE__ 2 2025-05-07T19:46:51.1243976Z #define __PMT(args) args 2025-05-07T19:46:51.1244072Z #define __POINTER_WIDTH__ 64 2025-05-07T19:46:51.1244168Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:46:51.1244260Z #define __PTHREAD_MUTEX_HAVE_PREV 1 2025-05-07T19:46:51.1244375Z #define __PTHREAD_RWLOCK_INT_FLAGS_SHARED 1 2025-05-07T19:46:51.1244466Z #define __PTHREAD_SPINS 0, 0 2025-05-07T19:46:51.1244553Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:46:51.1244690Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:46:51.1244803Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:46:51.1244896Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:46:51.1244983Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:46:51.1245200Z #define __REDIRECT(name,proto,alias) name proto __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:51.1245403Z #define __REDIRECT_LDBL(name,proto,alias) __REDIRECT (name, proto, alias) 2025-05-07T19:46:51.1245642Z #define __REDIRECT_NTH(name,proto,alias) name proto __THROW __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:51.1245909Z #define __REDIRECT_NTHNL(name,proto,alias) name proto __THROWNL __asm__ (__ASMNAME (#alias)) 2025-05-07T19:46:51.1246127Z #define __REDIRECT_NTH_LDBL(name,proto,alias) __REDIRECT_NTH (name, proto, alias) 2025-05-07T19:46:51.1246218Z #define __REGISTER_PREFIX__ 2025-05-07T19:46:51.1246312Z #define __RLIM64_T_TYPE __UQUAD_TYPE 2025-05-07T19:46:51.1246425Z #define __RLIM_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:46:51.1246516Z #define __S16_TYPE short int 2025-05-07T19:46:51.1246596Z #define __S32_TYPE int 2025-05-07T19:46:51.1246691Z #define __S64_TYPE long int 2025-05-07T19:46:51.1246769Z #define __SCHAR_MAX__ 127 2025-05-07T19:46:51.1246846Z #define __SEG_FS 1 2025-05-07T19:46:51.1246925Z #define __SEG_GS 1 2025-05-07T19:46:51.1247020Z #define __SHRT_MAX__ 32767 2025-05-07T19:46:51.1247101Z #define __SHRT_WIDTH__ 16 2025-05-07T19:46:51.1247197Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:46:51.1247296Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:46:51.1247379Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:46:51.1247471Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:46:51.1247558Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:46:51.1247648Z #define __SIZEOF_INT128__ 16 2025-05-07T19:46:51.1247729Z #define __SIZEOF_INT__ 4 2025-05-07T19:46:51.1247818Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:46:51.1247913Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:46:51.1247994Z #define __SIZEOF_LONG__ 8 2025-05-07T19:46:51.1248080Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:46:51.1248169Z #define __SIZEOF_PTHREAD_ATTR_T 56 2025-05-07T19:46:51.1248275Z #define __SIZEOF_PTHREAD_BARRIERATTR_T 4 2025-05-07T19:46:51.1248367Z #define __SIZEOF_PTHREAD_BARRIER_T 32 2025-05-07T19:46:51.1248459Z #define __SIZEOF_PTHREAD_CONDATTR_T 4 2025-05-07T19:46:51.1248559Z #define __SIZEOF_PTHREAD_COND_T 48 2025-05-07T19:46:51.1248654Z #define __SIZEOF_PTHREAD_MUTEXATTR_T 4 2025-05-07T19:46:51.1248743Z #define __SIZEOF_PTHREAD_MUTEX_T 40 2025-05-07T19:46:51.1248840Z #define __SIZEOF_PTHREAD_RWLOCKATTR_T 8 2025-05-07T19:46:51.1248945Z #define __SIZEOF_PTHREAD_RWLOCK_T 56 2025-05-07T19:46:51.1249036Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:46:51.1249118Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:46:51.1249218Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:46:51.1249301Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:46:51.1249385Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:46:51.1249478Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:46:51.1249559Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:46:51.1249642Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:46:51.1249725Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:46:51.1249831Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.1249926Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:46:51.1250008Z #define __SIZE_WIDTH__ 64 2025-05-07T19:46:51.1250101Z #define __SLONG32_TYPE int 2025-05-07T19:46:51.1250190Z #define __SLONGWORD_TYPE long int 2025-05-07T19:46:51.1250286Z #define __SM_20_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:51.1250378Z #define __SM_20_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:51.1250533Z #define __SM_20_INTRINSICS_HPP__ 2025-05-07T19:46:51.1250623Z #define __SM_20_INTRINSICS_H__ 2025-05-07T19:46:51.1250712Z #define __SM_30_INTRINSICS_HPP__ 2025-05-07T19:46:51.1250809Z #define __SM_30_INTRINSICS_H__ 2025-05-07T19:46:51.1250903Z #define __SM_32_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:51.1250995Z #define __SM_32_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:51.1251084Z #define __SM_32_INTRINSICS_HPP__ 2025-05-07T19:46:51.1251229Z #define __SM_32_INTRINSICS_H__ 2025-05-07T19:46:51.1251322Z #define __SM_35_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:51.1251407Z #define __SM_35_INTRINSICS_H__ 2025-05-07T19:46:51.1251511Z #define __SM_60_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:46:51.1251603Z #define __SM_60_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:46:51.1251694Z #define __SM_61_INTRINSICS_HPP__ 2025-05-07T19:46:51.1251778Z #define __SM_61_INTRINSICS_H__ 2025-05-07T19:46:51.1251873Z #define __SM_70_RT_HPP__ 2025-05-07T19:46:51.1251952Z #define __SM_70_RT_H__ 2025-05-07T19:46:51.1252034Z #define __SM_80_RT_HPP__ 2025-05-07T19:46:51.1252123Z #define __SM_80_RT_H__ 2025-05-07T19:46:51.1252204Z #define __SM_90_RT_HPP__ 2025-05-07T19:46:51.1252281Z #define __SM_90_RT_H__ 2025-05-07T19:46:51.1252370Z #define __SQUAD_TYPE long int 2025-05-07T19:46:51.1252457Z #define __SSE2_MATH__ 1 2025-05-07T19:46:51.1252533Z #define __SSE2__ 1 2025-05-07T19:46:51.1252613Z #define __SSE_MATH__ 1 2025-05-07T19:46:51.1252699Z #define __SSE__ 1 2025-05-07T19:46:51.1252797Z #define __SSIZE_T_TYPE __SWORD_TYPE 2025-05-07T19:46:51.1252911Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16UL 2025-05-07T19:46:51.1253015Z #define __STDCPP_MATH_SPEC_FUNCS__ 201003L 2025-05-07T19:46:51.1253115Z #define __STDCPP_THREADS__ 1 2025-05-07T19:46:51.1253200Z #define __STDC_HOSTED__ 1 2025-05-07T19:46:51.1253292Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:46:51.1253384Z #define __STDC_IEC_559__ 1 2025-05-07T19:46:51.1253469Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:46:51.1253556Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:46:51.1253643Z #define __STDC_UTF_16__ 1 2025-05-07T19:46:51.1253736Z #define __STDC_UTF_32__ 1 2025-05-07T19:46:51.1253813Z #define __STDC__ 1 2025-05-07T19:46:51.1253891Z #define __STDDEF_H 2025-05-07T19:46:51.1253983Z #define __STRING(x) #x 2025-05-07T19:46:51.1254085Z #define __SURFACE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:51.1254176Z #define __SURFACE_TYPES_H__ 2025-05-07T19:46:51.1254296Z #define __SUSECONDS_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.1254391Z #define __SWORD_TYPE long int 2025-05-07T19:46:51.1254504Z #define __SYSCALL_SLONG_TYPE __SLONGWORD_TYPE 2025-05-07T19:46:51.1254612Z #define __SYSCALL_ULONG_TYPE __ULONGWORD_TYPE 2025-05-07T19:46:51.1254710Z #define __SYSCALL_WORDSIZE 64 2025-05-07T19:46:51.1254809Z #define __TEXTURE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:46:51.1254896Z #define __TEXTURE_TYPES_H__ 2025-05-07T19:46:51.1254979Z #define __THROW throw () 2025-05-07T19:46:51.1255071Z #define __THROWNL throw () 2025-05-07T19:46:51.1255157Z #define __TIMER_T_TYPE void * 2025-05-07T19:46:51.1255259Z #define __TIME_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:46:51.1255363Z #define __U16_TYPE unsigned short int 2025-05-07T19:46:51.1255449Z #define __U32_TYPE unsigned int 2025-05-07T19:46:51.1255542Z #define __U64_TYPE unsigned long int 2025-05-07T19:46:51.1255631Z #define __UID_T_TYPE __U32_TYPE 2025-05-07T19:46:51.1255723Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:46:51.1255804Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:46:51.1255889Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:46:51.1255986Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:46:51.1256069Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:46:51.1256151Z #define __UINT16_MAX__ 65535 2025-05-07T19:46:51.1256247Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:46:51.1256343Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:46:51.1256424Z #define __UINT32_FMTX__ "X" 2025-05-07T19:46:51.1256505Z #define __UINT32_FMTo__ "o" 2025-05-07T19:46:51.1256602Z #define __UINT32_FMTu__ "u" 2025-05-07T19:46:51.1256681Z #define __UINT32_FMTx__ "x" 2025-05-07T19:46:51.1256823Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:46:51.1256913Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:46:51.1257011Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:46:51.1257093Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:46:51.1257179Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:46:51.1257281Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:46:51.1257363Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:46:51.1257511Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.1257613Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:46:51.1257713Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:46:51.1257796Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:46:51.1257878Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:46:51.1257975Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:46:51.1258058Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:46:51.1258138Z #define __UINT8_MAX__ 255 2025-05-07T19:46:51.1258229Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:46:51.1258332Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:46:51.1258422Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:46:51.1258508Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:46:51.1258606Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:46:51.1258691Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:46:51.1258793Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.1258894Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:46:51.1258990Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:46:51.1259075Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:46:51.1259161Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:46:51.1259257Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:46:51.1259341Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:46:51.1259443Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.1259559Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:46:51.1259646Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:46:51.1259736Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:46:51.1259824Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:46:51.1259923Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:46:51.1260012Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:46:51.1260099Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:46:51.1260212Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:46:51.1260407Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:46:51.1260498Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:46:51.1260589Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:46:51.1260864Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:46:51.1260965Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:46:51.1261070Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:46:51.1261174Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:46:51.1261267Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:46:51.1261359Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:46:51.1261451Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:46:51.1261581Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.1261697Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:46:51.1261797Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:46:51.1261898Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:46:51.1261991Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:46:51.1262083Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:46:51.1262178Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:46:51.1262294Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:46:51.1262395Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:46:51.1262490Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:46:51.1262595Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:46:51.1262691Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:46:51.1262785Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:46:51.1262898Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:46:51.1263004Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:46:51.1263098Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:46:51.1263191Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:46:51.1263294Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:46:51.1263483Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:46:51.1263592Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:46:51.1263700Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:46:51.1263797Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:46:51.1263894Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:46:51.1263989Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:46:51.1264177Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:46:51.1264302Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:46:51.1264400Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:46:51.1264503Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:46:51.1264599Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:46:51.1264694Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:46:51.1264789Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:46:51.1264906Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:46:51.1265004Z #define __ULONG32_TYPE unsigned int 2025-05-07T19:46:51.1265119Z #define __ULONGWORD_TYPE unsigned long int 2025-05-07T19:46:51.1265234Z #define __UQUAD_TYPE unsigned long int 2025-05-07T19:46:51.1265334Z #define __USECONDS_T_TYPE __U32_TYPE 2025-05-07T19:46:51.1265432Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:46:51.1265517Z #define __USE_ANSI 1 2025-05-07T19:46:51.1265611Z #define __USE_ATFILE 1 2025-05-07T19:46:51.1265692Z #define __USE_BSD 1 2025-05-07T19:46:51.1265792Z #define __USE_FORTIFY_LEVEL 0 2025-05-07T19:46:51.1265885Z #define __USE_GNU 1 2025-05-07T19:46:51.1265971Z #define __USE_ISOC11 1 2025-05-07T19:46:51.1266057Z #define __USE_ISOC95 1 2025-05-07T19:46:51.1266143Z #define __USE_ISOC99 1 2025-05-07T19:46:51.1266240Z #define __USE_ISOCXX11 1 2025-05-07T19:46:51.1266331Z #define __USE_LARGEFILE 1 2025-05-07T19:46:51.1266431Z #define __USE_LARGEFILE64 1 2025-05-07T19:46:51.1266525Z #define __USE_MISC 1 2025-05-07T19:46:51.1266611Z #define __USE_POSIX 1 2025-05-07T19:46:51.1266705Z #define __USE_POSIX199309 1 2025-05-07T19:46:51.1266800Z #define __USE_POSIX199506 1 2025-05-07T19:46:51.1266894Z #define __USE_POSIX2 1 2025-05-07T19:46:51.1266978Z #define __USE_SVID 1 2025-05-07T19:46:51.1267058Z #define __USE_UNIX98 1 2025-05-07T19:46:51.1267152Z #define __USE_XOPEN 1 2025-05-07T19:46:51.1267238Z #define __USE_XOPEN2K 1 2025-05-07T19:46:51.1267328Z #define __USE_XOPEN2K8 1 2025-05-07T19:46:51.1267423Z #define __USE_XOPEN2K8XSI 1 2025-05-07T19:46:51.1267529Z #define __USE_XOPEN2KXSI 1 2025-05-07T19:46:51.1267624Z #define __USE_XOPEN_EXTENDED 1 2025-05-07T19:46:51.1267728Z #define __USING_NAMESPACE_C99(name) 2025-05-07T19:46:51.1267839Z #define __USING_NAMESPACE_STD(name) 2025-05-07T19:46:51.1267941Z #define __UWORD_TYPE unsigned long int 2025-05-07T19:46:51.1268040Z #define __VECTOR_FUNCTIONS_HPP__ 2025-05-07T19:46:51.1268137Z #define __VECTOR_FUNCTIONS_H__ 2025-05-07T19:46:51.1268241Z #define __VECTOR_TYPES_H__ 2025-05-07T19:46:51.1268688Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:46:51.1268817Z #define __WAIT_INT(status) (*(int *) &(status)) 2025-05-07T19:46:51.1268922Z #define __WAIT_STATUS void * 2025-05-07T19:46:51.1269023Z #define __WAIT_STATUS_DEFN void * 2025-05-07T19:46:51.1269124Z #define __WALL 0x40000000 2025-05-07T19:46:51.1269216Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:46:51.1269307Z #define __WCHAR_TYPE__ int 2025-05-07T19:46:51.1269403Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:46:51.1269506Z #define __WCLONE 0x80000000 2025-05-07T19:46:51.1269641Z #define __WCOREDUMP(status) ((status) & __WCOREFLAG) 2025-05-07T19:46:51.1269730Z #define __WCOREFLAG 0x80 2025-05-07T19:46:51.1269893Z #define __WEXITSTATUS(status) (((status) & 0xff00) >> 8) 2025-05-07T19:46:51.1270051Z #define __WIFCONTINUED(status) ((status) == __W_CONTINUED) 2025-05-07T19:46:51.1270187Z #define __WIFEXITED(status) (__WTERMSIG(status) == 0) 2025-05-07T19:46:51.1270423Z #define __WIFSIGNALED(status) (((signed char) (((status) & 0x7f) + 1) >> 1) > 0) 2025-05-07T19:46:51.1270630Z #define __WIFSTOPPED(status) (((status) & 0xff) == 0x7f) 2025-05-07T19:46:51.1270725Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:46:51.1270820Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:46:51.1270930Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:46:51.1271020Z #define __WINT_WIDTH__ 32 2025-05-07T19:46:51.1271111Z #define __WNOTHREAD 0x20000000 2025-05-07T19:46:51.1271258Z #define __WORDSIZE 64 2025-05-07T19:46:51.1271360Z #define __WORDSIZE_TIME64_COMPAT32 1 2025-05-07T19:46:51.1271497Z #define __WSTOPSIG(status) __WEXITSTATUS(status) 2025-05-07T19:46:51.1271610Z #define __WTERMSIG(status) ((status) & 0x7f) 2025-05-07T19:46:51.1271713Z #define __W_CONTINUED 0xffff 2025-05-07T19:46:51.1271840Z #define __W_EXITCODE(ret,sig) ((ret) << 8 | (sig)) 2025-05-07T19:46:51.1271953Z #define __W_STOPCODE(sig) ((sig) << 8 | 0x7f) 2025-05-07T19:46:51.1272049Z #define ____FILE_defined 1 2025-05-07T19:46:51.1272141Z #define ____mbstate_t_defined 1 2025-05-07T19:46:51.1272264Z #define __align__(n) __attribute__((aligned(n))) 2025-05-07T19:46:51.1272458Z #define __always_inline __inline __attribute__ ((__always_inline__)) 2025-05-07T19:46:51.1272549Z #define __amd64 1 2025-05-07T19:46:51.1272630Z #define __amd64__ 1 2025-05-07T19:46:51.1272738Z #define __annotate__(a) __attribute__((a)) 2025-05-07T19:46:51.1272845Z #define __attribute_artificial__ 2025-05-07T19:46:51.1273105Z #define __attribute_const__ __attribute__ ((__const__)) 2025-05-07T19:46:51.1273280Z #define __attribute_deprecated__ __attribute__ ((__deprecated__)) 2025-05-07T19:46:51.1273485Z #define __attribute_format_arg__(x) __attribute__ ((__format_arg__ (x))) 2025-05-07T19:46:51.1273724Z #define __attribute_format_strfmon__(a,b) __attribute__ ((__format__ (__strfmon__, a, b))) 2025-05-07T19:46:51.1273863Z #define __attribute_malloc__ __attribute__ ((__malloc__)) 2025-05-07T19:46:51.1274016Z #define __attribute_noinline__ __attribute__ ((__noinline__)) 2025-05-07T19:46:51.1274148Z #define __attribute_pure__ __attribute__ ((__pure__)) 2025-05-07T19:46:51.1274276Z #define __attribute_used__ __attribute__ ((__used__)) 2025-05-07T19:46:51.1274490Z #define __attribute_warn_unused_result__ __attribute__ ((__warn_unused_result__)) 2025-05-07T19:46:51.1274582Z #define __blkcnt_t_defined 2025-05-07T19:46:51.1274670Z #define __blksize_t_defined 2025-05-07T19:46:51.1274847Z #define __bos(ptr) __builtin_object_size (ptr, __USE_FORTIFY_LEVEL > 1) 2025-05-07T19:46:51.1274983Z #define __bos0(ptr) __builtin_object_size (ptr, 0) 2025-05-07T19:46:51.1275060Z #define __bounded 2025-05-07T19:46:51.1275638Z #define __bswap_16(x) (__extension__ ({ unsigned short int __v, __x = (unsigned short int) (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_16 (__x); else __asm__ ("rorw $8, %w0" : "=r" (__v) : "0" (__x) : "cc"); __v; })) 2025-05-07T19:46:51.1276564Z #define __bswap_32(x) (__extension__ ({ unsigned int __v, __x = (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_32 (__x); else __asm__ ("bswap %0" : "=r" (__v) : "0" (__x)); __v; })) 2025-05-07T19:46:51.1277351Z #define __bswap_64(x) (__extension__ ({ __uint64_t __v, __x = (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_64 (__x); else __asm__ ("bswap %q0" : "=r" (__v) : "0" (__x)); __v; })) 2025-05-07T19:46:51.1277620Z #define __bswap_constant_16(x) ((unsigned short int) ((((x) >> 8) & 0xff) | (((x) & 0xff) << 8))) 2025-05-07T19:46:51.1277977Z #define __bswap_constant_32(x) ((((x) & 0xff000000) >> 24) | (((x) & 0x00ff0000) >> 8) | (((x) & 0x0000ff00) << 8) | (((x) & 0x000000ff) << 24)) 2025-05-07T19:46:51.1278951Z #define __bswap_constant_64(x) (__extension__ ((((x) & 0xff00000000000000ull) >> 56) | (((x) & 0x00ff000000000000ull) >> 40) | (((x) & 0x0000ff0000000000ull) >> 24) | (((x) & 0x000000ff00000000ull) >> 8) | (((x) & 0x00000000ff000000ull) << 8) | (((x) & 0x0000000000ff0000ull) << 24) | (((x) & 0x000000000000ff00ull) << 40) | (((x) & 0x00000000000000ffull) << 56))) 2025-05-07T19:46:51.1279061Z #define __builtin_align__(a) __align__(a) 2025-05-07T19:46:51.1279302Z #define __catch(X) catch(X) 2025-05-07T19:46:51.1279390Z #define __cdecl 2025-05-07T19:46:51.1279480Z #define __clang__ 1 2025-05-07T19:46:51.1279593Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:46:51.1279707Z #define __clang_major__ 16 2025-05-07T19:46:51.1279802Z #define __clang_minor__ 0 2025-05-07T19:46:51.1279903Z #define __clang_patchlevel__ 6 2025-05-07T19:46:51.1280419Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:46:51.1280550Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:46:51.1280645Z #define __clock_t_defined 1 2025-05-07T19:46:51.1280752Z #define __clockid_t_defined 1 2025-05-07T19:46:51.1280950Z #define __cluster_dims__(...) __attribute__((cluster_dims(__VA_ARGS__))) 2025-05-07T19:46:51.1281049Z #define __code_model_small__ 1 2025-05-07T19:46:51.1281162Z #define __constant__ __location__(constant) 2025-05-07T19:46:51.1281273Z #define __cplusplus 201703L 2025-05-07T19:46:51.1281384Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:46:51.1281492Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:46:51.1281607Z #define __cpp_alias_templates 200704L 2025-05-07T19:46:51.1281707Z #define __cpp_aligned_new 201606L 2025-05-07T19:46:51.1281808Z #define __cpp_attributes 200809L 2025-05-07T19:46:51.1281911Z #define __cpp_binary_literals 201304L 2025-05-07T19:46:51.1282032Z #define __cpp_capture_star_this 201603L 2025-05-07T19:46:51.1282138Z #define __cpp_constexpr 201603L 2025-05-07T19:46:51.1282254Z #define __cpp_constexpr_in_decltype 201711L 2025-05-07T19:46:51.1282361Z #define __cpp_decltype 200707L 2025-05-07T19:46:51.1282459Z #define __cpp_decltype_auto 201304L 2025-05-07T19:46:51.1282566Z #define __cpp_deduction_guides 201703L 2025-05-07T19:46:51.1282687Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:46:51.1282797Z #define __cpp_digit_separators 201309L 2025-05-07T19:46:51.1282910Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:46:51.1283014Z #define __cpp_exceptions 199711L 2025-05-07T19:46:51.1283128Z #define __cpp_fold_expressions 201603L 2025-05-07T19:46:51.1283227Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:46:51.1283347Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:46:51.1283453Z #define __cpp_hex_float 201603L 2025-05-07T19:46:51.1283553Z #define __cpp_if_constexpr 201606L 2025-05-07T19:46:51.1283672Z #define __cpp_impl_destroying_delete 201806L 2025-05-07T19:46:51.1283789Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:46:51.1283899Z #define __cpp_init_captures 201304L 2025-05-07T19:46:51.1284005Z #define __cpp_initializer_lists 200806L 2025-05-07T19:46:51.1284108Z #define __cpp_inline_variables 201606L 2025-05-07T19:46:51.1284212Z #define __cpp_lambdas 200907L 2025-05-07T19:46:51.1284325Z #define __cpp_lib_addressof_constexpr 201603 2025-05-07T19:46:51.1284430Z #define __cpp_lib_array_constexpr 201803L 2025-05-07T19:46:51.1284526Z #define __cpp_lib_as_const 201510 2025-05-07T19:46:51.1284643Z #define __cpp_lib_bool_constant 201505 2025-05-07T19:46:51.1284756Z #define __cpp_lib_exchange_function 201304 2025-05-07T19:46:51.1284916Z #define __cpp_lib_has_unique_object_representations 201606 2025-05-07T19:46:51.1285029Z #define __cpp_lib_hypot 201603 2025-05-07T19:46:51.1285133Z #define __cpp_lib_integer_sequence 201304 2025-05-07T19:46:51.1285264Z #define __cpp_lib_integral_constant_callable 201304 2025-05-07T19:46:51.1285370Z #define __cpp_lib_is_aggregate 201703 2025-05-07T19:46:51.1285483Z #define __cpp_lib_is_final 201402L 2025-05-07T19:46:51.1285583Z #define __cpp_lib_is_invocable 201703 2025-05-07T19:46:51.1285688Z #define __cpp_lib_is_null_pointer 201309 2025-05-07T19:46:51.1285805Z #define __cpp_lib_is_swappable 201603 2025-05-07T19:46:51.1285901Z #define __cpp_lib_launder 201606 2025-05-07T19:46:51.1286000Z #define __cpp_lib_logical_traits 201510 2025-05-07T19:46:51.1286132Z #define __cpp_lib_make_reverse_iterator 201402 2025-05-07T19:46:51.1286261Z #define __cpp_lib_math_special_functions 201603L 2025-05-07T19:46:51.1286420Z #define __cpp_lib_result_of_sfinae 201210 2025-05-07T19:46:51.1286557Z #define __cpp_lib_robust_nonmodifying_seq_ops 201304 2025-05-07T19:46:51.1286712Z #define __cpp_lib_transformation_trait_aliases 201304 2025-05-07T19:46:51.1286818Z #define __cpp_lib_tuple_element_t 201402L 2025-05-07T19:46:51.1286918Z #define __cpp_lib_tuples_by_type 201304 2025-05-07T19:46:51.1287127Z #define __cpp_lib_type_trait_variable_templates 201510L 2025-05-07T19:46:51.1287227Z #define __cpp_lib_void_t 201411 2025-05-07T19:46:51.1287346Z #define __cpp_named_character_escapes 202207L 2025-05-07T19:46:51.1287454Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:46:51.1287601Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:46:51.1287720Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:46:51.1287829Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:46:51.1287982Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:46:51.1288073Z #define __cpp_nsdmi 200809L 2025-05-07T19:46:51.1288299Z #define __cpp_range_based_for 201603L 2025-05-07T19:46:51.1288409Z #define __cpp_raw_strings 200710L 2025-05-07T19:46:51.1288507Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:46:51.1288726Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:46:51.1288813Z #define __cpp_rtti 199711L 2025-05-07T19:46:51.1288918Z #define __cpp_rvalue_references 200610L 2025-05-07T19:46:51.1289013Z #define __cpp_static_assert 201411L 2025-05-07T19:46:51.1289118Z #define __cpp_static_call_operator 202207L 2025-05-07T19:46:51.1289229Z #define __cpp_structured_bindings 201606L 2025-05-07T19:46:51.1289318Z #define __cpp_template_auto 201606L 2025-05-07T19:46:51.1289427Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:46:51.1289527Z #define __cpp_unicode_characters 200704L 2025-05-07T19:46:51.1289634Z #define __cpp_unicode_literals 200710L 2025-05-07T19:46:51.1289742Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:46:51.1289841Z #define __cpp_variable_templates 201304L 2025-05-07T19:46:51.1289952Z #define __cpp_variadic_templates 200704L 2025-05-07T19:46:51.1290045Z #define __cpp_variadic_using 201611L 2025-05-07T19:46:51.1290150Z #define __cudaCDP2DeviceGetAttribute 2025-05-07T19:46:51.1290254Z #define __cudaCDP2DeviceGetCacheConfig 2025-05-07T19:46:51.1290356Z #define __cudaCDP2DeviceGetLimit 2025-05-07T19:46:51.1290469Z #define __cudaCDP2DeviceGetSharedMemConfig 2025-05-07T19:46:51.1290578Z #define __cudaCDP2EventCreateWithFlags 2025-05-07T19:46:51.1290682Z #define __cudaCDP2EventDestroy 2025-05-07T19:46:51.1290775Z #define __cudaCDP2EventRecord 2025-05-07T19:46:51.1290878Z #define __cudaCDP2EventRecordWithFlags 2025-05-07T19:46:51.1290996Z #define __cudaCDP2EventRecordWithFlags_ptsz 2025-05-07T19:46:51.1291102Z #define __cudaCDP2EventRecord_ptsz 2025-05-07T19:46:51.1291184Z #define __cudaCDP2Free 2025-05-07T19:46:51.1291280Z #define __cudaCDP2FuncGetAttributes 2025-05-07T19:46:51.1291377Z #define __cudaCDP2GetDevice 2025-05-07T19:46:51.1291473Z #define __cudaCDP2GetDeviceCount 2025-05-07T19:46:51.1291563Z #define __cudaCDP2GetErrorName 2025-05-07T19:46:51.1291656Z #define __cudaCDP2GetErrorString 2025-05-07T19:46:51.1291750Z #define __cudaCDP2GetLastError 2025-05-07T19:46:51.1291851Z #define __cudaCDP2GetParameterBuffer 2025-05-07T19:46:51.1291954Z #define __cudaCDP2GetParameterBufferV2 2025-05-07T19:46:51.1292055Z #define __cudaCDP2LaunchDevice 2025-05-07T19:46:51.1292152Z #define __cudaCDP2LaunchDeviceV2 2025-05-07T19:46:51.1292253Z #define __cudaCDP2LaunchDeviceV2_ptsz 2025-05-07T19:46:51.1292358Z #define __cudaCDP2LaunchDevice_ptsz 2025-05-07T19:46:51.1292444Z #define __cudaCDP2Malloc 2025-05-07T19:46:51.1292533Z #define __cudaCDP2Memcpy2DAsync 2025-05-07T19:46:51.1292631Z #define __cudaCDP2Memcpy2DAsync_ptsz 2025-05-07T19:46:51.1292737Z #define __cudaCDP2Memcpy3DAsync 2025-05-07T19:46:51.1292838Z #define __cudaCDP2Memcpy3DAsync_ptsz 2025-05-07T19:46:51.1292931Z #define __cudaCDP2MemcpyAsync 2025-05-07T19:46:51.1293038Z #define __cudaCDP2MemcpyAsync_ptsz 2025-05-07T19:46:51.1293203Z #define __cudaCDP2Memset2DAsync 2025-05-07T19:46:51.1293298Z #define __cudaCDP2Memset2DAsync_ptsz 2025-05-07T19:46:51.1293388Z #define __cudaCDP2Memset3DAsync 2025-05-07T19:46:51.1293489Z #define __cudaCDP2Memset3DAsync_ptsz 2025-05-07T19:46:51.1293578Z #define __cudaCDP2MemsetAsync 2025-05-07T19:46:51.1293676Z #define __cudaCDP2MemsetAsync_ptsz 2025-05-07T19:46:51.1293925Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessor 2025-05-07T19:46:51.1294152Z #define __cudaCDP2OccupancyMaxActiveBlocksPerMultiprocessorWithFlags 2025-05-07T19:46:51.1294250Z #define __cudaCDP2PeekAtLastError 2025-05-07T19:46:51.1294347Z #define __cudaCDP2RuntimeGetVersion 2025-05-07T19:46:51.1294465Z #define __cudaCDP2StreamCreateWithFlags 2025-05-07T19:46:51.1294559Z #define __cudaCDP2StreamDestroy 2025-05-07T19:46:51.1294651Z #define __cudaCDP2StreamWaitEvent 2025-05-07T19:46:51.1294761Z #define __cudaCDP2StreamWaitEvent_ptsz 2025-05-07T19:46:51.1294855Z #define __cudaGet_blockDim() blockDim 2025-05-07T19:46:51.1294948Z #define __cudaGet_blockIdx() blockIdx 2025-05-07T19:46:51.1295037Z #define __cudaGet_gridDim() gridDim 2025-05-07T19:46:51.1295142Z #define __cudaGet_threadIdx() threadIdx 2025-05-07T19:46:51.1295239Z #define __cudaGet_warpSize() warpSize 2025-05-07T19:46:51.1295374Z #define __cudart_builtin__ __location__(cudart_builtin) 2025-05-07T19:46:51.1295470Z #define __daddr_t_defined 2025-05-07T19:46:51.1295553Z #define __dev_t_defined 2025-05-07T19:46:51.1295645Z #define __device__ __location__(device) 2025-05-07T19:46:51.1295777Z #define __device_builtin__ __location__(device_builtin) 2025-05-07T19:46:51.1296008Z #define __device_builtin_surface_type__ __location__(device_builtin_surface_type) 2025-05-07T19:46:51.1296229Z #define __device_builtin_texture_type__ __location__(device_builtin_texture_type) 2025-05-07T19:46:51.1296361Z #define __errordecl(name,msg) extern void name (void) 2025-05-07T19:46:51.1296500Z #define __exctype(name) extern int name (int) __THROW 2025-05-07T19:46:51.1296678Z #define __exctype_l(name) extern int name (int, __locale_t) __THROW 2025-05-07T19:46:51.1296760Z #define __export__ 2025-05-07T19:46:51.1297002Z #define __extern_always_inline extern __always_inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:51.1297193Z #define __extern_inline extern __inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:46:51.1297273Z #define __flexarr [] 2025-05-07T19:46:51.1297459Z #define __forceinline__ __inline__ __attribute__((always_inline)) 2025-05-07T19:46:51.1297654Z #define __fortify_function __extern_always_inline __attribute_artificial__ 2025-05-07T19:46:51.1297745Z #define __fsblkcnt_t_defined 2025-05-07T19:46:51.1297835Z #define __fsfilcnt_t_defined 2025-05-07T19:46:51.1297924Z #define __gid_t_defined 2025-05-07T19:46:51.1298068Z #define __glibc_likely(cond) __builtin_expect((cond), 1) 2025-05-07T19:46:51.1298212Z #define __glibc_unlikely(cond) __builtin_expect((cond), 0) 2025-05-07T19:46:51.1298440Z #define __glibcxx_assert(cond) do { __glibcxx_constexpr_assert(cond); } while (false) 2025-05-07T19:46:51.1298548Z #define __glibcxx_class_requires(_a,_b) 2025-05-07T19:46:51.1298658Z #define __glibcxx_class_requires2(_a,_b,_c) 2025-05-07T19:46:51.1298774Z #define __glibcxx_class_requires3(_a,_b,_c,_d) 2025-05-07T19:46:51.1298898Z #define __glibcxx_class_requires4(_a,_b,_c,_d,_e) 2025-05-07T19:46:51.1299237Z #define __glibcxx_constexpr_assert(cond) if (__builtin_is_constant_evaluated() && !bool(cond)) __builtin_unreachable() 2025-05-07T19:46:51.1299423Z #define __glibcxx_digits10_b(T,B) (__glibcxx_digits_b (T,B) * 643L / 2136) 2025-05-07T19:46:51.1299590Z #define __glibcxx_digits_b(T,B) (B - __glibcxx_signed_b (T,B)) 2025-05-07T19:46:51.1299689Z #define __glibcxx_function_requires(...) 2025-05-07T19:46:51.1299788Z #define __glibcxx_integral_traps true 2025-05-07T19:46:51.1300090Z #define __glibcxx_max_b(T,B) (__glibcxx_signed_b (T,B) ? (((((T)1 << (__glibcxx_digits_b (T,B) - 1)) - 1) << 1) + 1) : ~(T)0) 2025-05-07T19:46:51.1300480Z #define __glibcxx_min_b(T,B) (__glibcxx_signed_b (T,B) ? -__glibcxx_max_b (T,B) - 1 : (T)0) 2025-05-07T19:46:51.1300843Z #define __glibcxx_requires_can_decrement_range(_First1,_Last1,_First2) 2025-05-07T19:46:51.1301004Z #define __glibcxx_requires_can_increment(_First,_Size) 2025-05-07T19:46:51.1301207Z #define __glibcxx_requires_can_increment_range(_First1,_Last1,_First2) 2025-05-07T19:46:51.1301377Z #define __glibcxx_requires_cond(_Cond,_Msg) 2025-05-07T19:46:51.1301502Z #define __glibcxx_requires_heap(_First,_Last) 2025-05-07T19:46:51.1301671Z #define __glibcxx_requires_heap_pred(_First,_Last,_Pred) 2025-05-07T19:46:51.1301811Z #define __glibcxx_requires_irreflexive(_First,_Last) 2025-05-07T19:46:51.1301953Z #define __glibcxx_requires_irreflexive2(_First,_Last) 2025-05-07T19:46:51.1302145Z #define __glibcxx_requires_irreflexive_pred(_First,_Last,_Pred) 2025-05-07T19:46:51.1302327Z #define __glibcxx_requires_irreflexive_pred2(_First,_Last,_Pred) 2025-05-07T19:46:51.1302480Z #define __glibcxx_requires_non_empty_range(_First,_Last) 2025-05-07T19:46:51.1302599Z #define __glibcxx_requires_nonempty() 2025-05-07T19:46:51.1302784Z #define __glibcxx_requires_partitioned_lower(_First,_Last,_Value) 2025-05-07T19:46:51.1303011Z #define __glibcxx_requires_partitioned_lower_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:51.1303195Z #define __glibcxx_requires_partitioned_upper(_First,_Last,_Value) 2025-05-07T19:46:51.1303439Z #define __glibcxx_requires_partitioned_upper_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:46:51.1303570Z #define __glibcxx_requires_sorted(_First,_Last) 2025-05-07T19:46:51.1303728Z #define __glibcxx_requires_sorted_pred(_First,_Last,_Pred) 2025-05-07T19:46:51.1303909Z #define __glibcxx_requires_sorted_set(_First1,_Last1,_First2) 2025-05-07T19:46:51.1304119Z #define __glibcxx_requires_sorted_set_pred(_First1,_Last1,_First2,_Pred) 2025-05-07T19:46:51.1304234Z #define __glibcxx_requires_string(_String) 2025-05-07T19:46:51.1304379Z #define __glibcxx_requires_string_len(_String,_Len) 2025-05-07T19:46:51.1304496Z #define __glibcxx_requires_subscript(_N) 2025-05-07T19:46:51.1304637Z #define __glibcxx_requires_valid_range(_First,_Last) 2025-05-07T19:46:51.1304751Z #define __glibcxx_signed_b(T,B) ((T)(-1) < 0) 2025-05-07T19:46:51.1304863Z #define __global__ __location__(global) 2025-05-07T19:46:51.1304951Z #define __gnu_linux__ 1 2025-05-07T19:46:51.1305092Z #define __grid_constant__ __location__(grid_constant) 2025-05-07T19:46:51.1305198Z #define __have_pthread_attr_t 1 2025-05-07T19:46:51.1305296Z #define __host__ __location__(host) 2025-05-07T19:46:51.1305385Z #define __id_t_defined 2025-05-07T19:46:51.1305469Z #define __import__ 2025-05-07T19:46:51.1305621Z #define __inline_hint__ __attribute__((nv_inline_hint)) 2025-05-07T19:46:51.1305713Z #define __ino64_t_defined 2025-05-07T19:46:51.1305802Z #define __ino_t_defined 2025-05-07T19:46:51.1305901Z #define __int8_t_defined 2025-05-07T19:46:51.1306127Z #define __intN_t(N,MODE) typedef int int##N##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:51.1306282Z #define __isalnum_l(c,l) __isctype_l((c), _ISalnum, (l)) 2025-05-07T19:46:51.1306437Z #define __isalpha_l(c,l) __isctype_l((c), _ISalpha, (l)) 2025-05-07T19:46:51.1306539Z #define __isascii(c) (((c) & ~0x7f) == 0) 2025-05-07T19:46:51.1306653Z #define __isascii_l(c,l) ((l), __isascii (c)) 2025-05-07T19:46:51.1306799Z #define __isblank_l(c,l) __isctype_l((c), _ISblank, (l)) 2025-05-07T19:46:51.1306958Z #define __iscntrl_l(c,l) __isctype_l((c), _IScntrl, (l)) 2025-05-07T19:46:51.1307232Z #define __isctype_l(c,type,locale) ((locale)->__ctype_b[(int) (c)] & (unsigned short int) type) 2025-05-07T19:46:51.1307376Z #define __isdigit_l(c,l) __isctype_l((c), _ISdigit, (l)) 2025-05-07T19:46:51.1307535Z #define __isgraph_l(c,l) __isctype_l((c), _ISgraph, (l)) 2025-05-07T19:46:51.1307734Z #define __isleap(year) ((year) % 4 == 0 && ((year) % 100 != 0 || (year) % 400 == 0)) 2025-05-07T19:46:51.1307877Z #define __islower_l(c,l) __isctype_l((c), _ISlower, (l)) 2025-05-07T19:46:51.1308274Z #define __isprint_l(c,l) __isctype_l((c), _ISprint, (l)) 2025-05-07T19:46:51.1308415Z #define __ispunct_l(c,l) __isctype_l((c), _ISpunct, (l)) 2025-05-07T19:46:51.1308557Z #define __isspace_l(c,l) __isctype_l((c), _ISspace, (l)) 2025-05-07T19:46:51.1308704Z #define __isupper_l(c,l) __isctype_l((c), _ISupper, (l)) 2025-05-07T19:46:51.1308867Z #define __isxdigit_l(c,l) __isctype_l((c), _ISxdigit, (l)) 2025-05-07T19:46:51.1309007Z #define __k8 1 2025-05-07T19:46:51.1309087Z #define __k8__ 1 2025-05-07T19:46:51.1309187Z #define __key_t_defined 2025-05-07T19:46:51.1309382Z #define __launch_bounds__(...) __annotate__(launch_bounds(__VA_ARGS__)) 2025-05-07T19:46:51.1309474Z #define __ldiv_t_defined 1 2025-05-07T19:46:51.1309553Z #define __linux 1 2025-05-07T19:46:51.1309649Z #define __linux__ 1 2025-05-07T19:46:51.1309739Z #define __lldiv_t_defined 1 2025-05-07T19:46:51.1309818Z #define __llvm__ 1 2025-05-07T19:46:51.1309926Z #define __location__(a) __annotate__(a) 2025-05-07T19:46:51.1310022Z #define __long_double_t long double 2025-05-07T19:46:51.1310123Z #define __malloc_and_calloc_defined 2025-05-07T19:46:51.1310227Z #define __managed__ __location__(managed) 2025-05-07T19:46:51.1310363Z #define __maxnreg__(a) __attribute__((maxnreg(a))) 2025-05-07T19:46:51.1310451Z #define __mode_t_defined 2025-05-07T19:46:51.1310534Z #define __need_IOV_MAX 2025-05-07T19:46:51.1310638Z #define __need_clockid_t 2025-05-07T19:46:51.1310729Z #define __nlink_t_defined 2025-05-07T19:46:51.1310847Z #define __no_return__ __attribute__((noreturn)) 2025-05-07T19:46:51.1310982Z #define __noinline__ __attribute__((noinline)) 2025-05-07T19:46:51.1311151Z #define __nonnull(params) __attribute__ ((__nonnull__ params)) 2025-05-07T19:46:51.1311256Z #define __nv_pure__ __location__(nv_pure) 2025-05-07T19:46:51.1311357Z #define __off64_t_defined 2025-05-07T19:46:51.1311444Z #define __off_t_defined 2025-05-07T19:46:51.1311522Z #define __pic__ 2 2025-05-07T19:46:51.1311609Z #define __pid_t_defined 2025-05-07T19:46:51.1311700Z #define __pie__ 2 2025-05-07T19:46:51.1311801Z #define __private_extern__ extern 2025-05-07T19:46:51.1311885Z #define __ptr_t void * 2025-05-07T19:46:51.1311974Z #define __ptrvalue 2025-05-07T19:46:51.1312064Z #define __restrict_arr 2025-05-07T19:46:51.1312200Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:46:51.1312327Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:46:51.1312445Z #define __shared__ __location__(shared) 2025-05-07T19:46:51.1312537Z #define __sigset_t_defined 2025-05-07T19:46:51.1312639Z #define __specialization_static 2025-05-07T19:46:51.1312739Z #define __ssize_t_defined 2025-05-07T19:46:51.1312826Z #define __stub_bdflush 2025-05-07T19:46:51.1312910Z #define __stub_chflags 2025-05-07T19:46:51.1313105Z #define __stub_fattach 2025-05-07T19:46:51.1313198Z #define __stub_fchflags 2025-05-07T19:46:51.1313277Z #define __stub_fdetach 2025-05-07T19:46:51.1313354Z #define __stub_getmsg 2025-05-07T19:46:51.1313444Z #define __stub_gtty 2025-05-07T19:46:51.1313528Z #define __stub_lchmod 2025-05-07T19:46:51.1313608Z #define __stub_putmsg 2025-05-07T19:46:51.1313685Z #define __stub_revoke 2025-05-07T19:46:51.1313775Z #define __stub_setlogin 2025-05-07T19:46:51.1313859Z #define __stub_sigreturn 2025-05-07T19:46:51.1313935Z #define __stub_sstk 2025-05-07T19:46:51.1314019Z #define __stub_stty 2025-05-07T19:46:51.1314106Z #define __suseconds_t_defined 2025-05-07T19:46:51.1314196Z #define __thread__ __thread 2025-05-07T19:46:51.1314289Z #define __throw_exception_again throw 2025-05-07T19:46:51.1314381Z #define __time_t_defined 1 2025-05-07T19:46:51.1314461Z #define __timer_t_defined 1 2025-05-07T19:46:51.1314546Z #define __timespec_defined 1 2025-05-07T19:46:51.1314641Z #define __toascii(c) ((c) & 0x7f) 2025-05-07T19:46:51.1314743Z #define __toascii_l(c,l) ((l), __toascii (c)) 2025-05-07T19:46:51.1315262Z #define __tobody(c,f,a,args) (__extension__ ({ int __res; if (sizeof (c) > 1) { if (__builtin_constant_p (c)) { int __c = (c); __res = __c < -128 || __c > 255 ? __c : (a)[__c]; } else __res = f args; } else __res = (a)[(int) (c)]; __res; })) 2025-05-07T19:46:51.1315398Z #define __try try 2025-05-07T19:46:51.1315476Z #define __tune_k8__ 1 2025-05-07T19:46:51.1315556Z #define __u_char_defined 2025-05-07T19:46:51.1315805Z #define __u_intN_t(N,MODE) typedef unsigned int u_int##N##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:46:51.1315894Z #define __uid_t_defined 2025-05-07T19:46:51.1316029Z #define __unbounded 2025-05-07T19:46:51.1316102Z #define __unix 1 2025-05-07T19:46:51.1316186Z #define __unix__ 1 2025-05-07T19:46:51.1316274Z #define __useconds_t_defined 2025-05-07T19:46:51.1316351Z #define __warnattr(msg) 2025-05-07T19:46:51.1316474Z #define __warndecl(name,msg) extern void name (void) 2025-05-07T19:46:51.1316558Z #define __wur 2025-05-07T19:46:51.1316631Z #define __x86_64 1 2025-05-07T19:46:51.1316706Z #define __x86_64__ 1 2025-05-07T19:46:51.1316877Z #define _tolower(c) ((int) (*__ctype_tolower_loc ())[(int) (c)]) 2025-05-07T19:46:51.1317034Z #define _toupper(c) ((int) (*__ctype_toupper_loc ())[(int) (c)]) 2025-05-07T19:46:51.1317144Z #define alloca(size) __builtin_alloca (size) 2025-05-07T19:46:51.1317472Z #define assert(expr) ((expr) ? __ASSERT_VOID_CAST (0) : __assert_fail (__STRING(expr), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:51.1317863Z #define assert_perror(errnum) (!(errnum) ? __ASSERT_VOID_CAST (0) : __assert_perror_fail ((errnum), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:46:51.1317958Z #define be16toh(x) __bswap_16 (x) 2025-05-07T19:46:51.1318048Z #define be32toh(x) __bswap_32 (x) 2025-05-07T19:46:51.1318149Z #define be64toh(x) __bswap_64 (x) 2025-05-07T19:46:51.1318252Z #define cudaArrayColorAttachment 0x20 2025-05-07T19:46:51.1318341Z #define cudaArrayCubemap 0x04 2025-05-07T19:46:51.1318442Z #define cudaArrayDefault 0x00 2025-05-07T19:46:51.1318548Z #define cudaArrayDeferredMapping 0x80 2025-05-07T19:46:51.1318635Z #define cudaArrayLayered 0x01 2025-05-07T19:46:51.1318723Z #define cudaArraySparse 0x40 2025-05-07T19:46:51.1318872Z #define cudaArraySparsePropertiesSingleMipTail 0x1 2025-05-07T19:46:51.1318974Z #define cudaArraySurfaceLoadStore 0x02 2025-05-07T19:46:51.1319075Z #define cudaArrayTextureGather 0x08 2025-05-07T19:46:51.1319248Z #define cudaCooperativeLaunchMultiDeviceNoPostSync 0x02 2025-05-07T19:46:51.1319406Z #define cudaCooperativeLaunchMultiDeviceNoPreSync 0x01 2025-05-07T19:46:51.1319507Z #define cudaCpuDeviceId ((int)-1) 2025-05-07T19:46:51.1319605Z #define cudaDeviceBlockingSync 0x04 2025-05-07T19:46:51.1319715Z #define cudaDeviceLmemResizeToMax 0x10 2025-05-07T19:46:51.1319807Z #define cudaDeviceMapHost 0x08 2025-05-07T19:46:51.1319897Z #define cudaDeviceMask 0xff 2025-05-07T19:46:51.1320002Z #define cudaDeviceScheduleAuto 0x00 2025-05-07T19:46:51.1320114Z #define cudaDeviceScheduleBlockingSync 0x04 2025-05-07T19:46:51.1320207Z #define cudaDeviceScheduleMask 0x07 2025-05-07T19:46:51.1320302Z #define cudaDeviceScheduleSpin 0x01 2025-05-07T19:46:51.1320402Z #define cudaDeviceScheduleYield 0x02 2025-05-07T19:46:51.1320498Z #define cudaDeviceSyncMemops 0x80 2025-05-07T19:46:51.1320590Z #define cudaEventBlockingSync 0x01 2025-05-07T19:46:51.1320690Z #define cudaEventDefault 0x00 2025-05-07T19:46:51.1320782Z #define cudaEventDisableTiming 0x02 2025-05-07T19:46:51.1320874Z #define cudaEventInterprocess 0x04 2025-05-07T19:46:51.1320966Z #define cudaEventRecordDefault 0x00 2025-05-07T19:46:51.1321073Z #define cudaEventRecordExternal 0x01 2025-05-07T19:46:51.1321163Z #define cudaEventWaitDefault 0x00 2025-05-07T19:46:51.1321256Z #define cudaEventWaitExternal 0x01 2025-05-07T19:46:51.1321375Z #define cudaExternalMemoryDedicated 0x1 2025-05-07T19:46:51.1321554Z #define cudaExternalSemaphoreSignalSkipNvSciBufMemSync 0x01 2025-05-07T19:46:51.1321721Z #define cudaExternalSemaphoreWaitSkipNvSciBufMemSync 0x02 2025-05-07T19:46:51.1321895Z #define cudaGetDeviceProperties cudaGetDeviceProperties_v2 2025-05-07T19:46:51.1322005Z #define cudaGraphKernelNodePortDefault 0 2025-05-07T19:46:51.1322144Z #define cudaGraphKernelNodePortLaunchCompletion 2 2025-05-07T19:46:51.1322316Z #define cudaGraphKernelNodePortProgrammatic 1 2025-05-07T19:46:51.1322428Z #define cudaHostAllocDefault 0x00 2025-05-07T19:46:51.1322523Z #define cudaHostAllocMapped 0x02 2025-05-07T19:46:51.1322622Z #define cudaHostAllocPortable 0x01 2025-05-07T19:46:51.1322736Z #define cudaHostAllocWriteCombined 0x04 2025-05-07T19:46:51.1322836Z #define cudaHostRegisterDefault 0x00 2025-05-07T19:46:51.1323033Z #define cudaHostRegisterIoMemory 0x04 2025-05-07T19:46:51.1323135Z #define cudaHostRegisterMapped 0x02 2025-05-07T19:46:51.1323247Z #define cudaHostRegisterPortable 0x01 2025-05-07T19:46:51.1323348Z #define cudaHostRegisterReadOnly 0x08 2025-05-07T19:46:51.1323454Z #define cudaInitDeviceFlagsAreValid 0x01 2025-05-07T19:46:51.1323559Z #define cudaInvalidDeviceId ((int)-2) 2025-05-07T19:46:51.1323675Z #define cudaIpcMemLazyEnablePeerAccess 0x01 2025-05-07T19:46:51.1323808Z #define cudaKernelNodeAttrID cudaLaunchAttributeID 2025-05-07T19:46:51.1323977Z #define cudaKernelNodeAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:51.1324282Z #define cudaKernelNodeAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:51.1324566Z #define cudaKernelNodeAttributeClusterDimension cudaLaunchAttributeClusterDimension 2025-05-07T19:46:51.1325030Z #define cudaKernelNodeAttributeClusterSchedulingPolicyPreference cudaLaunchAttributeClusterSchedulingPolicyPreference 2025-05-07T19:46:51.1325283Z #define cudaKernelNodeAttributeCooperative cudaLaunchAttributeCooperative 2025-05-07T19:46:51.1325662Z #define cudaKernelNodeAttributeDeviceUpdatableKernelNode cudaLaunchAttributeDeviceUpdatableKernelNode 2025-05-07T19:46:51.1325918Z #define cudaKernelNodeAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:51.1326208Z #define cudaKernelNodeAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:51.1326626Z #define cudaKernelNodeAttributePreferredSharedMemoryCarveout cudaLaunchAttributePreferredSharedMemoryCarveout 2025-05-07T19:46:51.1326843Z #define cudaKernelNodeAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:51.1326947Z #define cudaMemAttachGlobal 0x01 2025-05-07T19:46:51.1327039Z #define cudaMemAttachHost 0x02 2025-05-07T19:46:51.1327132Z #define cudaMemAttachSingle 0x04 2025-05-07T19:46:51.1327234Z #define cudaNvSciSyncAttrSignal 0x1 2025-05-07T19:46:51.1327337Z #define cudaNvSciSyncAttrWait 0x2 2025-05-07T19:46:51.1327433Z #define cudaOccupancyDefault 0x00 2025-05-07T19:46:51.1327567Z #define cudaOccupancyDisableCachingOverride 0x01 2025-05-07T19:46:51.1327678Z #define cudaPeerAccessDefault 0x00 2025-05-07T19:46:51.1328002Z #define cudaSignalExternalSemaphoresAsync __CUDART_API_PTSZ(cudaSignalExternalSemaphoresAsync_v2) 2025-05-07T19:46:51.1328122Z #define cudaStreamAttrID cudaLaunchAttributeID 2025-05-07T19:46:51.1328272Z #define cudaStreamAttrValue cudaLaunchAttributeValue 2025-05-07T19:46:51.1328550Z #define cudaStreamAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:46:51.1328784Z #define cudaStreamAttributeMemSyncDomain cudaLaunchAttributeMemSyncDomain 2025-05-07T19:46:51.1329041Z #define cudaStreamAttributeMemSyncDomainMap cudaLaunchAttributeMemSyncDomainMap 2025-05-07T19:46:51.1329240Z #define cudaStreamAttributePriority cudaLaunchAttributePriority 2025-05-07T19:46:51.1329547Z #define cudaStreamAttributeSynchronizationPolicy cudaLaunchAttributeSynchronizationPolicy 2025-05-07T19:46:51.1329645Z #define cudaStreamDefault 0x00 2025-05-07T19:46:51.1329781Z #define cudaStreamFireAndForget ((cudaStream_t)0x4) 2025-05-07T19:46:51.1330023Z #define cudaStreamGetCaptureInfo __CUDART_API_PTSZ(cudaStreamGetCaptureInfo_v2) 2025-05-07T19:46:51.1330221Z #define cudaStreamGraphFireAndForget (cudaStream_t)0x0200000000000000 2025-05-07T19:46:51.1330474Z #define cudaStreamGraphFireAndForgetAsSibling (cudaStream_t)0x0300000000000000 2025-05-07T19:46:51.1330657Z #define cudaStreamGraphTailLaunch (cudaStream_t)0x0100000000000000 2025-05-07T19:46:51.1330765Z #define cudaStreamLegacy ((cudaStream_t)0x1) 2025-05-07T19:46:51.1331191Z #define cudaStreamNonBlocking 0x01 2025-05-07T19:46:51.1331319Z #define cudaStreamPerThread ((cudaStream_t)0x2) 2025-05-07T19:46:51.1331437Z #define cudaStreamTailLaunch ((cudaStream_t)0x3) 2025-05-07T19:46:51.1331529Z #define cudaSurfaceType1D 0x01 2025-05-07T19:46:51.1331641Z #define cudaSurfaceType1DLayered 0xF1 2025-05-07T19:46:51.1331733Z #define cudaSurfaceType2D 0x02 2025-05-07T19:46:51.1331882Z #define cudaSurfaceType2DLayered 0xF2 2025-05-07T19:46:51.1331980Z #define cudaSurfaceType3D 0x03 2025-05-07T19:46:51.1332077Z #define cudaSurfaceTypeCubemap 0x0C 2025-05-07T19:46:51.1332188Z #define cudaSurfaceTypeCubemapLayered 0xFC 2025-05-07T19:46:51.1332274Z #define cudaTextureType1D 0x01 2025-05-07T19:46:51.1332381Z #define cudaTextureType1DLayered 0xF1 2025-05-07T19:46:51.1332469Z #define cudaTextureType2D 0x02 2025-05-07T19:46:51.1332564Z #define cudaTextureType2DLayered 0xF2 2025-05-07T19:46:51.1332659Z #define cudaTextureType3D 0x03 2025-05-07T19:46:51.1332752Z #define cudaTextureTypeCubemap 0x0C 2025-05-07T19:46:51.1332865Z #define cudaTextureTypeCubemapLayered 0xFC 2025-05-07T19:46:51.1333169Z #define cudaWaitExternalSemaphoresAsync __CUDART_API_PTSZ(cudaWaitExternalSemaphoresAsync_v2) 2025-05-07T19:46:51.1333264Z #define getc(_fp) _IO_getc (_fp) 2025-05-07T19:46:51.1333360Z #define htobe16(x) __bswap_16 (x) 2025-05-07T19:46:51.1333446Z #define htobe32(x) __bswap_32 (x) 2025-05-07T19:46:51.1333542Z #define htobe64(x) __bswap_64 (x) 2025-05-07T19:46:51.1333619Z #define htole16(x) (x) 2025-05-07T19:46:51.1333698Z #define htole32(x) (x) 2025-05-07T19:46:51.1333780Z #define htole64(x) (x) 2025-05-07T19:46:51.1333896Z #define isalnum_l(c,l) __isalnum_l ((c), (l)) 2025-05-07T19:46:51.1334002Z #define isalpha_l(c,l) __isalpha_l ((c), (l)) 2025-05-07T19:46:51.1334091Z #define isascii(c) __isascii (c) 2025-05-07T19:46:51.1334199Z #define isascii_l(c,l) __isascii_l ((c), (l)) 2025-05-07T19:46:51.1334300Z #define isblank_l(c,l) __isblank_l ((c), (l)) 2025-05-07T19:46:51.1334398Z #define iscntrl_l(c,l) __iscntrl_l ((c), (l)) 2025-05-07T19:46:51.1334507Z #define isdigit_l(c,l) __isdigit_l ((c), (l)) 2025-05-07T19:46:51.1334608Z #define isgraph_l(c,l) __isgraph_l ((c), (l)) 2025-05-07T19:46:51.1334706Z #define islower_l(c,l) __islower_l ((c), (l)) 2025-05-07T19:46:51.1334808Z #define isprint_l(c,l) __isprint_l ((c), (l)) 2025-05-07T19:46:51.1334917Z #define ispunct_l(c,l) __ispunct_l ((c), (l)) 2025-05-07T19:46:51.1335021Z #define isspace_l(c,l) __isspace_l ((c), (l)) 2025-05-07T19:46:51.1335123Z #define isupper_l(c,l) __isupper_l ((c), (l)) 2025-05-07T19:46:51.1335239Z #define isxdigit_l(c,l) __isxdigit_l ((c), (l)) 2025-05-07T19:46:51.1335318Z #define le16toh(x) (x) 2025-05-07T19:46:51.1335404Z #define le32toh(x) (x) 2025-05-07T19:46:51.1335489Z #define le64toh(x) (x) 2025-05-07T19:46:51.1335585Z #define linux 1 2025-05-07T19:46:51.1335686Z #define major(dev) gnu_dev_major (dev) 2025-05-07T19:46:51.1335815Z #define makedev(maj,min) gnu_dev_makedev (maj, min) 2025-05-07T19:46:51.1335972Z #define math_errhandling (MATH_ERRNO | MATH_ERREXCEPT) 2025-05-07T19:46:51.1336075Z #define minor(dev) gnu_dev_minor (dev) 2025-05-07T19:46:51.1336190Z #define offsetof(t,d) __builtin_offsetof(t, d) 2025-05-07T19:46:51.1336296Z #define putc(_ch,_fp) _IO_putc (_ch, _fp) 2025-05-07T19:46:51.1336394Z #define stderr stderr 2025-05-07T19:46:51.1336477Z #define stdin stdin 2025-05-07T19:46:51.1336562Z #define stdout stdout 2025-05-07T19:46:51.1337050Z #define strdupa(s) (__extension__ ({ const char *__old = (s); size_t __len = strlen (__old) + 1; char *__new = (char *) __builtin_alloca (__len); (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:51.1337570Z #define strndupa(s,n) (__extension__ ({ const char *__old = (s); size_t __len = strnlen (__old, (n)); char *__new = (char *) __builtin_alloca (__len + 1); __new[__len] = '\0'; (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:46:51.1337668Z #define toascii(c) __toascii (c) 2025-05-07T19:46:51.1337794Z #define toascii_l(c,l) __toascii_l ((c), (l)) 2025-05-07T19:46:51.1337926Z #define unix 1 2025-05-07T19:46:51.1338053Z #define w_coredump __wait_terminated.__w_coredump 2025-05-07T19:46:51.1338190Z #define w_retcode __wait_terminated.__w_retcode 2025-05-07T19:46:51.1338304Z #define w_stopsig __wait_stopped.__w_stopsig 2025-05-07T19:46:51.1338415Z #define w_stopval __wait_stopped.__w_stopval 2025-05-07T19:46:51.1338533Z #define w_termsig __wait_terminated.__w_termsig 2025-05-07T19:46:51.1338581Z 2025-05-07T19:46:51.1447947Z 2025-05-07T19:46:52.7605644Z + conda run -n build_binary nvcc --version 2025-05-07T19:46:52.7605958Z 2025-05-07T19:46:52.7606096Z nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:46:52.7606447Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:46:52.7611330Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:46:52.7611794Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:46:52.7612195Z Build cuda_12.6.r12.6/compiler.35059454_0 2025-05-07T19:46:52.7612424Z 2025-05-07T19:46:52.8373943Z 2025-05-07T19:46:52.8383758Z which: no nvidia-smi in (CONDA=/github/home/miniconda:/github/home/miniconda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:46:52.8384549Z [CHECK] nvidia-smi not found 2025-05-07T19:46:52.8384863Z [INSTALL] Successfully installed CUDA 12.6.3 2025-05-07T19:46:52.8479329Z ##[group]Run . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:52.8479992Z . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/12.6.3 2025-05-07T19:46:52.8480674Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:46:52.8481037Z env: 2025-05-07T19:46:52.8481310Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:46:52.8481639Z BUILD_ENV: build_binary 2025-05-07T19:46:52.8481935Z BUILD_TARGET: default 2025-05-07T19:46:52.8482188Z BUILD_VARIANT: cuda 2025-05-07T19:46:52.8482473Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:46:52.8482750Z ##[endgroup] 2025-05-07T19:46:53.2732257Z ################################################################################ 2025-05-07T19:46:53.2733567Z # Install PyTorch (PIP) 2025-05-07T19:46:53.2733997Z # 2025-05-07T19:46:53.2743623Z # [2025-05-07T19:46:53.274Z] + install_pytorch_pip build_binary nightly cuda/12.6.3 2025-05-07T19:46:53.2744270Z ################################################################################ 2025-05-07T19:46:53.2744518Z 2025-05-07T19:46:53.2771268Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y numpy 2025-05-07T19:46:54.1874245Z Channels: 2025-05-07T19:46:54.1874513Z - conda-forge 2025-05-07T19:46:54.1874784Z Platform: linux-64 2025-05-07T19:46:57.2608518Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:46:58.9519984Z Solving environment: \ | / - done 2025-05-07T19:46:59.2584268Z 2025-05-07T19:46:59.2585272Z ## Package Plan ## 2025-05-07T19:46:59.2585501Z 2025-05-07T19:46:59.2585751Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:46:59.2586088Z 2025-05-07T19:46:59.2586237Z added / updated specs: 2025-05-07T19:46:59.2586505Z - numpy 2025-05-07T19:46:59.2586628Z 2025-05-07T19:46:59.2586634Z 2025-05-07T19:46:59.2586756Z The following packages will be downloaded: 2025-05-07T19:46:59.2586998Z 2025-05-07T19:46:59.2587149Z package | build 2025-05-07T19:46:59.2587498Z ---------------------------|----------------- 2025-05-07T19:46:59.2587894Z libblas-3.9.0 |31_h59b9bed_openblas 16 KB conda-forge 2025-05-07T19:46:59.2588404Z libcblas-3.9.0 |31_he106b2a_openblas 16 KB conda-forge 2025-05-07T19:46:59.2588879Z liblapack-3.9.0 |31_h7ac8fdf_openblas 16 KB conda-forge 2025-05-07T19:46:59.2589347Z numpy-2.0.2 | py39h9cb892a_1 7.6 MB conda-forge 2025-05-07T19:46:59.2589750Z ------------------------------------------------------------ 2025-05-07T19:46:59.2590118Z Total: 7.6 MB 2025-05-07T19:46:59.2590674Z 2025-05-07T19:46:59.2590823Z The following NEW packages will be INSTALLED: 2025-05-07T19:46:59.2591053Z 2025-05-07T19:46:59.2591288Z libblas conda-forge/linux-64::libblas-3.9.0-31_h59b9bed_openblas 2025-05-07T19:46:59.2591839Z libcblas conda-forge/linux-64::libcblas-3.9.0-31_he106b2a_openblas 2025-05-07T19:46:59.2592377Z liblapack conda-forge/linux-64::liblapack-3.9.0-31_h7ac8fdf_openblas 2025-05-07T19:46:59.2592889Z numpy conda-forge/linux-64::numpy-2.0.2-py39h9cb892a_1 2025-05-07T19:46:59.2593168Z 2025-05-07T19:46:59.2593172Z 2025-05-07T19:46:59.2593190Z 2025-05-07T19:46:59.2593337Z Downloading and Extracting Packages: ...working... 2025-05-07T19:46:59.2593711Z numpy-2.0.2 | 7.6 MB | | 0% 2025-05-07T19:46:59.2593959Z 2025-05-07T19:46:59.2594393Z libblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:59.2594636Z 2025-05-07T19:46:59.2594640Z 2025-05-07T19:46:59.2602430Z libcblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:59.2602706Z 2025-05-07T19:46:59.2602710Z 2025-05-07T19:46:59.2602768Z 2025-05-07T19:46:59.4803818Z liblapack-3.9.0 | 16 KB | | 0%  2025-05-07T19:46:59.4804185Z 2025-05-07T19:46:59.4804564Z 2025-05-07T19:46:59.4804572Z 2025-05-07T19:46:59.4810496Z liblapack-3.9.0 | 16 KB | #########7 | 98%  2025-05-07T19:46:59.4810829Z 2025-05-07T19:46:59.4810835Z 2025-05-07T19:46:59.4811388Z 2025-05-07T19:46:59.5025655Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:59.5026505Z 2025-05-07T19:46:59.5026520Z 2025-05-07T19:46:59.5343035Z 2025-05-07T19:46:59.5344136Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:59.5344999Z 2025-05-07T19:46:59.5345011Z 2025-05-07T19:46:59.5345668Z libcblas-3.9.0 | 16 KB | #########7 | 98%  2025-05-07T19:46:59.5346462Z 2025-05-07T19:46:59.5346485Z 2025-05-07T19:46:59.5464267Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:59.5537811Z numpy-2.0.2 | 7.6 MB | | 0% 2025-05-07T19:46:59.5538594Z 2025-05-07T19:46:59.5538609Z 2025-05-07T19:46:59.5841946Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:59.5842820Z 2025-05-07T19:46:59.5843454Z libblas-3.9.0 | 16 KB | #########7 | 97%  2025-05-07T19:46:59.5844192Z 2025-05-07T19:46:59.6102329Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:59.6102626Z 2025-05-07T19:46:59.6197010Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:46:59.9599369Z numpy-2.0.2 | 7.6 MB | ########## | 100% 2025-05-07T19:46:59.9600545Z numpy-2.0.2 | 7.6 MB | ########## | 100% 2025-05-07T19:46:59.9603318Z numpy-2.0.2 | 7.6 MB | ########## | 100% 2025-05-07T19:46:59.9604313Z 2025-05-07T19:46:59.9604794Z 2025-05-07T19:46:59.9605113Z  2025-05-07T19:46:59.9605345Z 2025-05-07T19:46:59.9605349Z 2025-05-07T19:46:59.9605531Z  2025-05-07T19:46:59.9605766Z 2025-05-07T19:46:59.9605770Z 2025-05-07T19:46:59.9605783Z 2025-05-07T19:46:59.9605975Z  done 2025-05-07T19:47:00.0614035Z Preparing transaction: | done 2025-05-07T19:47:00.2625338Z Verifying transaction: - \ done 2025-05-07T19:47:00.3637895Z Executing transaction: / done 2025-05-07T19:47:00.4734359Z ################################################################################ 2025-05-07T19:47:00.4734828Z # Install Package From PyTorch PIP: torch 2025-05-07T19:47:00.4735155Z # 2025-05-07T19:47:00.4759524Z # [2025-05-07T19:47:00.475Z] + install_from_pytorch_pip build_binary torch nightly cuda/12.6.3 2025-05-07T19:47:00.4760071Z ################################################################################ 2025-05-07T19:47:00.4760595Z 2025-05-07T19:47:00.4779132Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:47:00.5708519Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:47:00.5708967Z ################################################################################ 2025-05-07T19:47:00.5709359Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:47:00.5709657Z # 2025-05-07T19:47:00.5729930Z # [2025-05-07T19:47:00.572Z] + __prepare_pip_arguments torch nightly cuda/12.6.3 2025-05-07T19:47:00.5731305Z ################################################################################ 2025-05-07T19:47:00.5732043Z 2025-05-07T19:47:00.5749531Z [INSTALL] Extracted package (channel, version): (nightly, LATEST) 2025-05-07T19:47:00.5776540Z [INSTALL] Extracted package variant: cu126 2025-05-07T19:47:00.5788653Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:47:00.5790336Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:47:00.5793718Z [INSTALL] Extracted the full PIP package: --pre torch 2025-05-07T19:47:00.5801999Z [INSTALL] Attempting to install [torch, LATEST] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/cu126/ ... 2025-05-07T19:47:00.5825647Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:36.2922616Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:48:36.2927022Z 2025-05-07T19:48:36.2927264Z Looking in indexes: https://download.pytorch.org/whl/nightly/cu126/ 2025-05-07T19:48:36.2927781Z Collecting torch 2025-05-07T19:48:36.2928521Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp39-cp39-manylinux_2_28_x86_64.whl.metadata (30 kB) 2025-05-07T19:48:36.2929316Z Collecting filelock (from torch) 2025-05-07T19:48:36.2929923Z Downloading https://download.pytorch.org/whl/nightly/filelock-3.16.1-py3-none-any.whl (16 kB) 2025-05-07T19:48:36.2930930Z Requirement already satisfied: typing-extensions>=4.10.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from torch) (4.13.2) 2025-05-07T19:48:36.2931736Z Collecting sympy>=1.13.3 (from torch) 2025-05-07T19:48:36.2932295Z Downloading https://download.pytorch.org/whl/nightly/sympy-1.13.3-py3-none-any.whl (6.2 MB) 2025-05-07T19:48:36.2933260Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 6.2/6.2 MB 29.5 MB/s eta 0:00:00 2025-05-07T19:48:36.2933668Z Collecting networkx (from torch) 2025-05-07T19:48:36.2934208Z Downloading https://download.pytorch.org/whl/nightly/networkx-3.2.1-py3-none-any.whl (1.6 MB) 2025-05-07T19:48:36.2934937Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.6/1.6 MB 11.0 MB/s eta 0:00:00 2025-05-07T19:48:36.2935705Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from torch) (3.1.6) 2025-05-07T19:48:36.2936528Z Collecting fsspec (from torch) 2025-05-07T19:48:36.2937052Z Downloading https://download.pytorch.org/whl/nightly/fsspec-2024.10.0-py3-none-any.whl (179 kB) 2025-05-07T19:48:36.2937637Z Collecting nvidia-cuda-nvrtc-cu12==12.6.77 (from torch) 2025-05-07T19:48:36.2938387Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_nvrtc_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (23.7 MB) 2025-05-07T19:48:36.2939205Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 23.7/23.7 MB 56.2 MB/s eta 0:00:00 2025-05-07T19:48:36.2939659Z Collecting nvidia-cuda-runtime-cu12==12.6.77 (from torch) 2025-05-07T19:48:36.2940736Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_runtime_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (897 kB) 2025-05-07T19:48:36.2942130Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 897.7/897.7 kB 8.7 MB/s eta 0:00:00 2025-05-07T19:48:36.2942599Z Collecting nvidia-cuda-cupti-cu12==12.6.80 (from torch) 2025-05-07T19:48:36.2943392Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cuda_cupti_cu12-12.6.80-py3-none-manylinux2014_x86_64.whl (8.9 MB) 2025-05-07T19:48:36.2944271Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 8.9/8.9 MB 68.9 MB/s eta 0:00:00 2025-05-07T19:48:36.2944691Z Collecting nvidia-cudnn-cu12==9.5.1.17 (from torch) 2025-05-07T19:48:36.2945462Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cudnn_cu12-9.5.1.17-py3-none-manylinux_2_28_x86_64.whl (571.0 MB) 2025-05-07T19:48:36.2946334Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 571.0/571.0 MB 37.6 MB/s eta 0:00:00 2025-05-07T19:48:36.2946873Z Collecting nvidia-cublas-cu12==12.6.4.1 (from torch) 2025-05-07T19:48:36.2947685Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cublas_cu12-12.6.4.1-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (393.1 MB) 2025-05-07T19:48:36.2948583Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 393.1/393.1 MB 53.7 MB/s eta 0:00:00 2025-05-07T19:48:36.2948998Z Collecting nvidia-cufft-cu12==11.3.0.4 (from torch) 2025-05-07T19:48:36.2949905Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufft_cu12-11.3.0.4-py3-none-manylinux2014_x86_64.whl (200.2 MB) 2025-05-07T19:48:36.2950703Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 200.2/200.2 MB 72.9 MB/s eta 0:00:00 2025-05-07T19:48:36.2951134Z Collecting nvidia-curand-cu12==10.3.7.77 (from torch) 2025-05-07T19:48:36.2951841Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_curand_cu12-10.3.7.77-py3-none-manylinux2014_x86_64.whl (56.3 MB) 2025-05-07T19:48:36.2952667Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 56.3/56.3 MB 65.9 MB/s eta 0:00:00 2025-05-07T19:48:36.2953116Z Collecting nvidia-cusolver-cu12==11.7.1.2 (from torch) 2025-05-07T19:48:36.2953837Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusolver_cu12-11.7.1.2-py3-none-manylinux2014_x86_64.whl (158.2 MB) 2025-05-07T19:48:36.2954668Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 158.2/158.2 MB 73.4 MB/s eta 0:00:00 2025-05-07T19:48:36.2955077Z Collecting nvidia-cusparse-cu12==12.5.4.2 (from torch) 2025-05-07T19:48:36.2955815Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparse_cu12-12.5.4.2-py3-none-manylinux2014_x86_64.whl (216.6 MB) 2025-05-07T19:48:36.2956613Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 216.6/216.6 MB 57.5 MB/s eta 0:00:00 2025-05-07T19:48:36.2957034Z Collecting nvidia-cusparselt-cu12==0.6.3 (from torch) 2025-05-07T19:48:36.2957775Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cusparselt_cu12-0.6.3-py3-none-manylinux2014_x86_64.whl (156.8 MB) 2025-05-07T19:48:36.2958575Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 156.8/156.8 MB 61.6 MB/s eta 0:00:00 2025-05-07T19:48:36.2958977Z Collecting nvidia-nccl-cu12==2.26.2 (from torch) 2025-05-07T19:48:36.2959807Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (2.0 kB) 2025-05-07T19:48:36.2960576Z Collecting nvidia-nvtx-cu12==12.6.77 (from torch) 2025-05-07T19:48:36.2961256Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvtx_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl (89 kB) 2025-05-07T19:48:36.2961926Z Collecting nvidia-nvjitlink-cu12==12.6.85 (from torch) 2025-05-07T19:48:36.2962729Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nvjitlink_cu12-12.6.85-py3-none-manylinux2010_x86_64.manylinux_2_12_x86_64.whl (19.7 MB) 2025-05-07T19:48:36.2963616Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 19.7/19.7 MB 20.8 MB/s eta 0:00:00 2025-05-07T19:48:36.2963997Z Collecting nvidia-cufile-cu12==1.11.1.6 (from torch) 2025-05-07T19:48:36.2964808Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.metadata (1.5 kB) 2025-05-07T19:48:36.2965696Z Collecting pytorch-triton==3.3.0+git96316ce5 (from torch) 2025-05-07T19:48:36.2966559Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp39-cp39-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.6 kB) 2025-05-07T19:48:36.2967857Z Requirement already satisfied: setuptools>=40.8.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from pytorch-triton==3.3.0+git96316ce5->torch) (78.1.1) 2025-05-07T19:48:36.2968717Z Collecting mpmath<1.4,>=1.1.0 (from sympy>=1.13.3->torch) 2025-05-07T19:48:36.2969286Z Downloading https://download.pytorch.org/whl/nightly/mpmath-1.3.0-py3-none-any.whl (536 kB) 2025-05-07T19:48:36.2969923Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 536.2/536.2 kB 1.8 MB/s eta 0:00:00 2025-05-07T19:48:36.2970689Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from jinja2->torch) (3.0.2) 2025-05-07T19:48:36.2971790Z Downloading https://download.pytorch.org/whl/nightly/cu126/torch-2.8.0.dev20250507%2Bcu126-cp39-cp39-manylinux_2_28_x86_64.whl (825.5 MB) 2025-05-07T19:48:36.2972666Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 825.5/825.5 MB 23.9 MB/s eta 0:00:00 2025-05-07T19:48:36.2973457Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_cufile_cu12-1.11.1.6-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (1.1 MB) 2025-05-07T19:48:36.2974305Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.1/1.1 MB 8.8 MB/s eta 0:00:00 2025-05-07T19:48:36.2975080Z Downloading https://download.pytorch.org/whl/nightly/cu126/nvidia_nccl_cu12-2.26.2-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (201.3 MB) 2025-05-07T19:48:36.2976120Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 201.3/201.3 MB 61.8 MB/s eta 0:00:00 2025-05-07T19:48:36.2977141Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp39-cp39-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (153.4 MB) 2025-05-07T19:48:36.2978122Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 153.4/153.4 MB 80.1 MB/s eta 0:00:00 2025-05-07T19:48:36.2979969Z Installing collected packages: nvidia-cusparselt-cu12, mpmath, sympy, pytorch-triton, nvidia-nvtx-cu12, nvidia-nvjitlink-cu12, nvidia-nccl-cu12, nvidia-curand-cu12, nvidia-cufile-cu12, nvidia-cuda-runtime-cu12, nvidia-cuda-nvrtc-cu12, nvidia-cuda-cupti-cu12, nvidia-cublas-cu12, networkx, fsspec, filelock, nvidia-cusparse-cu12, nvidia-cufft-cu12, nvidia-cudnn-cu12, nvidia-cusolver-cu12, torch 2025-05-07T19:48:36.2981712Z 2025-05-07T19:48:36.2983757Z Successfully installed filelock-3.16.1 fsspec-2024.10.0 mpmath-1.3.0 networkx-3.2.1 nvidia-cublas-cu12-12.6.4.1 nvidia-cuda-cupti-cu12-12.6.80 nvidia-cuda-nvrtc-cu12-12.6.77 nvidia-cuda-runtime-cu12-12.6.77 nvidia-cudnn-cu12-9.5.1.17 nvidia-cufft-cu12-11.3.0.4 nvidia-cufile-cu12-1.11.1.6 nvidia-curand-cu12-10.3.7.77 nvidia-cusolver-cu12-11.7.1.2 nvidia-cusparse-cu12-12.5.4.2 nvidia-cusparselt-cu12-0.6.3 nvidia-nccl-cu12-2.26.2 nvidia-nvjitlink-cu12-12.6.85 nvidia-nvtx-cu12-12.6.77 pytorch-triton-3.3.0+git96316ce5 sympy-1.13.3 torch-2.8.0.dev20250507+cu126 2025-05-07T19:48:36.2986035Z 2025-05-07T19:48:38.2522935Z torch 2.8.0.dev20250507+cu126 2025-05-07T19:48:38.2528561Z [CHECK] The installed package [torch, nightly/LATEST] is the correct variant (cu126) 2025-05-07T19:48:41.3471346Z [CHECK] Python (sub-)package 'torch.distributed' found ... 2025-05-07T19:48:44.4573185Z [CHECK] NOTE: The installed version is: 2.8.0.dev20250507+cu126 2025-05-07T19:48:44.4574478Z [CHECK] NOTE: Checking _GLIBCXX_USE_CXX11_ABI ... 2025-05-07T19:48:47.4763377Z True 2025-05-07T19:48:47.4764040Z True 2025-05-07T19:48:47.4764158Z 2025-05-07T19:48:47.5507545Z [INSTALL] Successfully installed PyTorch through PyTorch PIP 2025-05-07T19:48:47.5595428Z ##[group]Run if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:47.5596102Z if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:48:47.5596805Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:47.5597166Z env: 2025-05-07T19:48:47.5597380Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:47.5597694Z BUILD_ENV: build_binary 2025-05-07T19:48:47.5597932Z BUILD_TARGET: default 2025-05-07T19:48:47.5610801Z BUILD_VARIANT: cuda 2025-05-07T19:48:47.5611170Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:47.5611445Z ##[endgroup] 2025-05-07T19:48:47.9963482Z /github/home/miniconda/bin/conda 2025-05-07T19:48:47.9964469Z ################################################################################ 2025-05-07T19:48:47.9965723Z # Collect PyTorch Environment Information (for Reporting Issues) 2025-05-07T19:48:47.9966821Z # 2025-05-07T19:48:47.9981898Z # [2025-05-07T19:48:47.997Z] + collect_pytorch_env_info build_binary 2025-05-07T19:48:47.9983148Z ################################################################################ 2025-05-07T19:48:47.9983850Z 2025-05-07T19:48:48.0003141Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:48.0919217Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:48.0926310Z [INFO] Downloading the PyTorch environment info collection script ... 2025-05-07T19:48:48.0928222Z + wget -q https://raw.githubusercontent.com/pytorch/pytorch/main/torch/utils/collect_env.py 2025-05-07T19:48:48.0928797Z 2025-05-07T19:48:48.1792387Z 2025-05-07T19:48:48.1793558Z [INFO] Collecting PyTorch environment info (will be needed for reporting issues to PyTorch) ... 2025-05-07T19:48:48.1822840Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary python collect_env.py 2025-05-07T19:48:53.6913444Z Collecting environment information... 2025-05-07T19:48:53.6914529Z PyTorch version: 2.8.0.dev20250507+cu126 2025-05-07T19:48:53.6915469Z Is debug build: False 2025-05-07T19:48:53.6916216Z CUDA used to build PyTorch: 12.6 2025-05-07T19:48:53.6917028Z ROCM used to build PyTorch: N/A 2025-05-07T19:48:53.6917572Z 2025-05-07T19:48:53.6917910Z OS: Amazon Linux 2023.7.20250428 (x86_64) 2025-05-07T19:48:53.6918839Z GCC version: Could not collect 2025-05-07T19:48:53.6919462Z Clang version: 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:48:53.6920085Z CMake version: version 4.0.2 2025-05-07T19:48:53.6920473Z Libc version: glibc-2.34 2025-05-07T19:48:53.6920630Z 2025-05-07T19:48:53.6920965Z Python version: 3.9.22 | packaged by conda-forge | (main, Apr 14 2025, 23:35:59) [GCC 13.3.0] (64-bit runtime) 2025-05-07T19:48:53.6921604Z Python platform: Linux-6.1.130-139.222.amzn2023.x86_64-x86_64-with-glibc2.34 2025-05-07T19:48:53.6922050Z Is CUDA available: False 2025-05-07T19:48:53.6922310Z CUDA runtime version: 12.6.85 2025-05-07T19:48:53.6922603Z CUDA_MODULE_LOADING set to: N/A 2025-05-07T19:48:53.6922942Z GPU models and configuration: Could not collect 2025-05-07T19:48:53.6923286Z Nvidia driver version: Could not collect 2025-05-07T19:48:53.6923613Z cuDNN version: Could not collect 2025-05-07T19:48:53.6924224Z HIP runtime version: N/A 2025-05-07T19:48:53.6924500Z MIOpen runtime version: N/A 2025-05-07T19:48:53.6924764Z Is XNNPACK available: True 2025-05-07T19:48:53.6924946Z 2025-05-07T19:48:53.6925028Z CPU: 2025-05-07T19:48:53.6925361Z Architecture: x86_64 2025-05-07T19:48:53.6925705Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:48:53.6926082Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:48:53.6926549Z Byte Order: Little Endian 2025-05-07T19:48:53.6926888Z CPU(s): 96 2025-05-07T19:48:53.6927180Z On-line CPU(s) list: 0-95 2025-05-07T19:48:53.6927517Z Vendor ID: GenuineIntel 2025-05-07T19:48:53.6928118Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:48:53.6928495Z CPU family: 6 2025-05-07T19:48:53.6928798Z Model: 85 2025-05-07T19:48:53.6929090Z Thread(s) per core: 2 2025-05-07T19:48:53.6929386Z Core(s) per socket: 24 2025-05-07T19:48:53.6929665Z Socket(s): 2 2025-05-07T19:48:53.6929957Z Stepping: 7 2025-05-07T19:48:53.6930247Z BogoMIPS: 6000.01 2025-05-07T19:48:53.6932472Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:48:53.6934707Z Hypervisor vendor: KVM 2025-05-07T19:48:53.6935022Z Virtualization type: full 2025-05-07T19:48:53.6935350Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:48:53.6935725Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:48:53.6936072Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:48:53.6936435Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:48:53.6936766Z NUMA node(s): 2 2025-05-07T19:48:53.6937063Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:48:53.6937397Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:48:53.6937841Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:48:53.6938390Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:48:53.6938856Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:48:53.6939449Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:53.6940012Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:48:53.6940915Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:48:53.6941556Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:48:53.6941936Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:48:53.6942333Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:48:53.6942715Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:48:53.6943300Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:48:53.6944158Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:48:53.6944907Z Vulnerability Srbds: Not affected 2025-05-07T19:48:53.6945308Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:48:53.6945556Z 2025-05-07T19:48:53.6945665Z Versions of relevant libraries: 2025-05-07T19:48:53.6945962Z [pip3] numpy==2.0.2 2025-05-07T19:48:53.6946219Z [pip3] nvidia-cublas-cu12==12.6.4.1 2025-05-07T19:48:53.6946558Z [pip3] nvidia-cuda-cupti-cu12==12.6.80 2025-05-07T19:48:53.6946885Z [pip3] nvidia-cuda-nvrtc-cu12==12.6.77 2025-05-07T19:48:53.6947234Z [pip3] nvidia-cuda-runtime-cu12==12.6.77 2025-05-07T19:48:53.6947579Z [pip3] nvidia-cudnn-cu12==9.5.1.17 2025-05-07T19:48:53.6947875Z [pip3] nvidia-cufft-cu12==11.3.0.4 2025-05-07T19:48:53.6948189Z [pip3] nvidia-curand-cu12==10.3.7.77 2025-05-07T19:48:53.6948495Z [pip3] nvidia-cusolver-cu12==11.7.1.2 2025-05-07T19:48:53.6948934Z [pip3] nvidia-cusparse-cu12==12.5.4.2 2025-05-07T19:48:53.6949253Z [pip3] nvidia-cusparselt-cu12==0.6.3 2025-05-07T19:48:53.6949585Z [pip3] nvidia-nccl-cu12==2.26.2 2025-05-07T19:48:53.6949882Z [pip3] nvidia-nvjitlink-cu12==12.6.85 2025-05-07T19:48:53.6950209Z [pip3] nvidia-nvtx-cu12==12.6.77 2025-05-07T19:48:53.6950536Z [pip3] pytorch-triton==3.3.0+git96316ce5 2025-05-07T19:48:53.6950870Z [pip3] torch==2.8.0.dev20250507+cu126 2025-05-07T19:48:53.6951260Z [conda] cuda-cudart 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:53.6951798Z [conda] cuda-cudart-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:53.6952341Z [conda] cuda-cudart-dev_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:53.6953032Z [conda] cuda-cudart-static 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:53.6953585Z [conda] cuda-cudart-static_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:53.6954121Z [conda] cuda-cudart_linux-64 12.6.77 h3f2d84a_0 conda-forge 2025-05-07T19:48:53.6954616Z [conda] cuda-cupti 12.6.80 hbd13f7d_0 conda-forge 2025-05-07T19:48:53.6955085Z [conda] cuda-cupti-dev 12.6.80 h5888daf_0 conda-forge 2025-05-07T19:48:53.6955589Z [conda] cuda-libraries 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:53.6956104Z [conda] cuda-libraries-dev 12.6.3 ha770c72_0 conda-forge 2025-05-07T19:48:53.6956586Z [conda] cuda-nvrtc 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:53.6957066Z [conda] cuda-nvrtc-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:53.6957526Z [conda] cuda-nvtx 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:53.6957997Z [conda] cuda-opencl 12.6.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:53.6958475Z [conda] cuda-opencl-dev 12.6.77 h5888daf_0 conda-forge 2025-05-07T19:48:53.6958970Z [conda] cuda-runtime 12.6.3 ha804496_0 conda-forge 2025-05-07T19:48:53.6959453Z [conda] libcublas 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:53.6959914Z [conda] libcublas-dev 12.6.4.1 h5888daf_1 conda-forge 2025-05-07T19:48:53.6960387Z [conda] libcufft 11.3.0.4 hbd13f7d_0 conda-forge 2025-05-07T19:48:53.6960843Z [conda] libcufft-dev 11.3.0.4 h5888daf_0 conda-forge 2025-05-07T19:48:53.6961324Z [conda] libcurand 10.3.7.77 hbd13f7d_0 conda-forge 2025-05-07T19:48:53.6961785Z [conda] libcurand-dev 10.3.7.77 h5888daf_0 conda-forge 2025-05-07T19:48:53.6962266Z [conda] libcusolver 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:53.6962754Z [conda] libcusolver-dev 11.7.1.2 h5888daf_1 conda-forge 2025-05-07T19:48:53.6963226Z [conda] libcusparse 12.5.4.2 hbd13f7d_0 conda-forge 2025-05-07T19:48:53.6963718Z [conda] libcusparse-dev 12.5.4.2 h5888daf_0 conda-forge 2025-05-07T19:48:53.6964264Z [conda] libnvjitlink 12.6.85 hbd13f7d_0 conda-forge 2025-05-07T19:48:53.6964756Z [conda] libnvjitlink-dev 12.6.85 h5888daf_0 conda-forge 2025-05-07T19:48:53.6965228Z [conda] numpy 2.0.2 py39h9cb892a_1 conda-forge 2025-05-07T19:48:53.6965676Z [conda] nvidia-cublas-cu12 12.6.4.1 pypi_0 pypi 2025-05-07T19:48:53.6966180Z [conda] nvidia-cuda-cupti-cu12 12.6.80 pypi_0 pypi 2025-05-07T19:48:53.6966665Z [conda] nvidia-cuda-nvrtc-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:53.6967175Z [conda] nvidia-cuda-runtime-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:53.6967722Z [conda] nvidia-cudnn-cu12 9.5.1.17 pypi_0 pypi 2025-05-07T19:48:53.6968205Z [conda] nvidia-cufft-cu12 11.3.0.4 pypi_0 pypi 2025-05-07T19:48:53.6968696Z [conda] nvidia-curand-cu12 10.3.7.77 pypi_0 pypi 2025-05-07T19:48:53.6969185Z [conda] nvidia-cusolver-cu12 11.7.1.2 pypi_0 pypi 2025-05-07T19:48:53.6969690Z [conda] nvidia-cusparse-cu12 12.5.4.2 pypi_0 pypi 2025-05-07T19:48:53.6970182Z [conda] nvidia-cusparselt-cu12 0.6.3 pypi_0 pypi 2025-05-07T19:48:53.6970684Z [conda] nvidia-nccl-cu12 2.26.2 pypi_0 pypi 2025-05-07T19:48:53.6971157Z [conda] nvidia-nvjitlink-cu12 12.6.85 pypi_0 pypi 2025-05-07T19:48:53.6971649Z [conda] nvidia-nvtx-cu12 12.6.77 pypi_0 pypi 2025-05-07T19:48:53.6972130Z [conda] pytorch-triton 3.3.0+git96316ce5 pypi_0 pypi 2025-05-07T19:48:53.6972590Z [conda] torch 2.8.0.dev20250507+cu126 pypi_0 pypi 2025-05-07T19:48:53.6972878Z 2025-05-07T19:48:53.7909658Z ##[group]Run . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:53.7910678Z . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 12.6.3 2025-05-07T19:48:53.7911414Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:53.7911784Z env: 2025-05-07T19:48:53.7912036Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:53.7912438Z BUILD_ENV: build_binary 2025-05-07T19:48:53.7913022Z BUILD_TARGET: default 2025-05-07T19:48:53.7913266Z BUILD_VARIANT: cuda 2025-05-07T19:48:53.7913571Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:48:53.7913843Z ##[endgroup] 2025-05-07T19:48:54.2323678Z ################################################################################ 2025-05-07T19:48:54.2324692Z # Install cuDNN 2025-05-07T19:48:54.2325330Z # 2025-05-07T19:48:54.2338280Z # [2025-05-07T19:48:54.233Z] + install_cudnn build_binary /__w/FBGEMM/FBGEMM/build_only/cudnn 12.6.3 2025-05-07T19:48:54.2338958Z ################################################################################ 2025-05-07T19:48:54.2339232Z 2025-05-07T19:48:54.2350213Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:54.3247599Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:54.3248751Z [INSTALL] cuda_concat_version is determined to be: 126 2025-05-07T19:48:54.3249231Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:54.3249459Z 2025-05-07T19:48:54.3262234Z 2025-05-07T19:48:54.3263264Z + mkdir -p /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:48:54.3263602Z 2025-05-07T19:48:54.3277222Z 2025-05-07T19:48:54.3302893Z [INSTALL] Downloading cuDNN to /tmp/tmp.2ItTwf0Fg4 ... 2025-05-07T19:48:54.3329773Z [EXEC] [ATTEMPT 0/3] + wget -q https://developer.download.nvidia.com/compute/cudnn/redist/cudnn/linux-x86_64/cudnn-linux-x86_64-9.5.1.17_cuda12-archive.tar.xz -O cudnn.tar.xz 2025-05-07T19:48:56.1588920Z [INSTALL] Unpacking cuDNN ... 2025-05-07T19:48:56.1589386Z + tar -xvf cudnn.tar.xz 2025-05-07T19:48:56.1589561Z 2025-05-07T19:48:56.1620861Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/ 2025-05-07T19:48:56.1622642Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/ 2025-05-07T19:48:56.1623109Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static_v9.a 2025-05-07T19:49:00.8522846Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static_v9.a 2025-05-07T19:49:00.9159439Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static_v9.a 2025-05-07T19:49:08.5405509Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static_v9.a 2025-05-07T19:49:08.7895992Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static_v9.a 2025-05-07T19:49:08.8278710Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static_v9.a 2025-05-07T19:49:09.3779667Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static_v9.a 2025-05-07T19:49:11.5286995Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv_static.a 2025-05-07T19:49:11.5287612Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn_static.a 2025-05-07T19:49:11.5288236Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled_static.a 2025-05-07T19:49:11.5288895Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled_static.a 2025-05-07T19:49:11.5289521Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph_static.a 2025-05-07T19:49:11.5290052Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic_static.a 2025-05-07T19:49:11.5290609Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops_static.a 2025-05-07T19:49:11.5291086Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so 2025-05-07T19:49:11.5291564Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9 2025-05-07T19:49:11.5292045Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn.so.9.5.1 2025-05-07T19:49:11.5296449Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so 2025-05-07T19:49:11.5297321Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9 2025-05-07T19:49:11.5297836Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_adv.so.9.5.1 2025-05-07T19:49:16.1481227Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so 2025-05-07T19:49:16.1482767Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9.5.1 2025-05-07T19:49:16.2109547Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_cnn.so.9 2025-05-07T19:49:16.2111353Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9.5.1 2025-05-07T19:49:23.4183289Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so.9 2025-05-07T19:49:23.4183958Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_precompiled.so 2025-05-07T19:49:23.4184579Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so 2025-05-07T19:49:23.4185278Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9.5.1 2025-05-07T19:49:23.6098430Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_engines_runtime_compiled.so.9 2025-05-07T19:49:23.6100438Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9 2025-05-07T19:49:23.6101924Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so 2025-05-07T19:49:23.6103451Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_graph.so.9.5.1 2025-05-07T19:49:23.6448643Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9.5.1 2025-05-07T19:49:24.1741820Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so.9 2025-05-07T19:49:24.1742616Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_heuristic.so 2025-05-07T19:49:24.1743148Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9 2025-05-07T19:49:24.1743686Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so 2025-05-07T19:49:24.1744204Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib/libcudnn_ops.so.9.5.1 2025-05-07T19:49:26.2954197Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/ 2025-05-07T19:49:26.2955578Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_v9.h 2025-05-07T19:49:26.2957451Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv_v9.h 2025-05-07T19:49:26.2958952Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend_v9.h 2025-05-07T19:49:26.2960414Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn_v9.h 2025-05-07T19:49:26.2961564Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph_v9.h 2025-05-07T19:49:26.2962113Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops_v9.h 2025-05-07T19:49:26.2962615Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version_v9.h 2025-05-07T19:49:26.2963114Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn.h 2025-05-07T19:49:26.2963567Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_adv.h 2025-05-07T19:49:26.2964071Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_backend.h 2025-05-07T19:49:26.2964584Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_cnn.h 2025-05-07T19:49:26.2965076Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_graph.h 2025-05-07T19:49:26.2965575Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_ops.h 2025-05-07T19:49:26.2966056Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include/cudnn_version.h 2025-05-07T19:49:26.2966515Z cudnn-linux-x86_64-9.5.1.17_cuda12-archive/LICENSE 2025-05-07T19:49:26.2975392Z 2025-05-07T19:49:26.2976539Z [INSTALL] Moving cuDNN files to /__w/FBGEMM/FBGEMM/build_only/cudnn ... 2025-05-07T19:49:26.2977909Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:49:26.2978617Z 2025-05-07T19:49:26.2994424Z 2025-05-07T19:49:26.2994896Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:26.2995162Z 2025-05-07T19:49:26.3015468Z 2025-05-07T19:49:26.3016984Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/include /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:26.3018153Z 2025-05-07T19:49:26.3049909Z 2025-05-07T19:49:26.3050893Z + mv cudnn-linux-x86_64-9.5.1.17_cuda12-archive/lib /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:49:26.3051414Z 2025-05-07T19:49:27.7764932Z 2025-05-07T19:49:27.7765285Z /__w/FBGEMM/FBGEMM 2025-05-07T19:49:27.7766944Z + rm -rf /tmp/tmp.2ItTwf0Fg4 2025-05-07T19:49:27.7767606Z 2025-05-07T19:49:28.2103222Z 2025-05-07T19:49:28.2116405Z [INSTALL] Set environment variables CUDNN_INCLUDE_DIR and CUDNN_LIBRARY ... 2025-05-07T19:49:28.2117442Z + conda env config vars set -n build_binary CUDNN_INCLUDE_DIR=/__w/FBGEMM/FBGEMM/build_only/cudnn/include CUDNN_LIBRARY=/__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:49:28.2118129Z 2025-05-07T19:49:28.6301612Z 2025-05-07T19:49:28.6302013Z [INSTALL] Successfully installed cuDNN (for CUDA 12.6.3) 2025-05-07T19:49:28.6370399Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:28.6371043Z . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:49:28.6371680Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:28.6372050Z env: 2025-05-07T19:49:28.6372314Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:28.6372659Z BUILD_ENV: build_binary 2025-05-07T19:49:28.6372923Z BUILD_TARGET: default 2025-05-07T19:49:28.6373201Z BUILD_VARIANT: cuda 2025-05-07T19:49:28.6373482Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:28.6373753Z ##[endgroup] 2025-05-07T19:49:29.0861520Z ################################################################################ 2025-05-07T19:49:29.0862621Z # Prepare FBGEMM-GPU Build 2025-05-07T19:49:29.0863358Z # 2025-05-07T19:49:29.0878976Z # [2025-05-07T19:49:29.087Z] + prepare_fbgemm_gpu_build build_binary 2025-05-07T19:49:29.0880317Z ################################################################################ 2025-05-07T19:49:29.0880993Z 2025-05-07T19:49:29.0902862Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:29.1759944Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:29.1775827Z [BUILD] Running git submodules update ... 2025-05-07T19:49:29.1798152Z [EXEC] [ATTEMPT 0/3] + git submodule sync 2025-05-07T19:49:29.2098685Z Synchronizing submodule url for '../external/asmjit' 2025-05-07T19:49:29.2100402Z Synchronizing submodule url for '../external/composable_kernel' 2025-05-07T19:49:29.2101797Z Synchronizing submodule url for '../external/cpuinfo' 2025-05-07T19:49:29.2103059Z Synchronizing submodule url for '../external/cutlass' 2025-05-07T19:49:29.2103542Z Synchronizing submodule url for '../external/googletest' 2025-05-07T19:49:29.2104005Z Synchronizing submodule url for '../external/hipify_torch' 2025-05-07T19:49:29.2104467Z Synchronizing submodule url for '../external/json' 2025-05-07T19:49:29.2126301Z [EXEC] [ATTEMPT 0/3] + git submodule update --init --recursive 2025-05-07T19:49:29.2552812Z [BUILD] Installing other build dependencies ... 2025-05-07T19:49:29.2575348Z [EXEC] [ATTEMPT 0/3] + conda run --no-capture-output -n build_binary python -m pip install -r requirements.txt 2025-05-07T19:49:31.1261796Z Collecting backports.tarfile (from -r requirements.txt (line 13)) 2025-05-07T19:49:31.1410630Z Downloading backports.tarfile-1.2.0-py3-none-any.whl.metadata (2.0 kB) 2025-05-07T19:49:31.1509023Z Requirement already satisfied: build in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 14)) (1.2.2.post1) 2025-05-07T19:49:31.2868049Z Collecting cmake (from -r requirements.txt (line 15)) 2025-05-07T19:49:31.2904296Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (6.3 kB) 2025-05-07T19:49:31.2978757Z Requirement already satisfied: click in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 16)) (8.1.8) 2025-05-07T19:49:31.2980219Z Requirement already satisfied: hypothesis in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 17)) (6.131.14) 2025-05-07T19:49:31.2981555Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 18)) (3.1.6) 2025-05-07T19:49:31.2984770Z Requirement already satisfied: mpmath==1.3.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 19)) (1.3.0) 2025-05-07T19:49:31.3311266Z Collecting ninja (from -r requirements.txt (line 20)) 2025-05-07T19:49:31.3350013Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl.metadata (5.0 kB) 2025-05-07T19:49:31.3414674Z Requirement already satisfied: numpy>=2.0.2 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 21)) (2.0.2) 2025-05-07T19:49:31.3568940Z Collecting pyre-extensions (from -r requirements.txt (line 22)) 2025-05-07T19:49:31.3620780Z Downloading pyre_extensions-0.0.32-py3-none-any.whl.metadata (4.0 kB) 2025-05-07T19:49:31.3696162Z Requirement already satisfied: pyyaml in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 23)) (6.0.2) 2025-05-07T19:49:31.3700307Z Requirement already satisfied: scikit-build in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 24)) (0.18.1) 2025-05-07T19:49:31.3702398Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from -r requirements.txt (line 25)) (78.1.1) 2025-05-07T19:49:31.3934844Z Collecting setuptools_git_versioning (from -r requirements.txt (line 26)) 2025-05-07T19:49:31.3972163Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl.metadata (6.1 kB) 2025-05-07T19:49:31.4167759Z Collecting tabulate (from -r requirements.txt (line 27)) 2025-05-07T19:49:31.4210638Z Downloading tabulate-0.9.0-py3-none-any.whl.metadata (34 kB) 2025-05-07T19:49:31.4462013Z Collecting patchelf (from -r requirements.txt (line 28)) 2025-05-07T19:49:31.4513284Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl.metadata (3.5 kB) 2025-05-07T19:49:31.4669873Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from build->-r requirements.txt (line 14)) (25.0) 2025-05-07T19:49:31.4672465Z Requirement already satisfied: pyproject_hooks in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from build->-r requirements.txt (line 14)) (1.2.0) 2025-05-07T19:49:31.4681804Z Requirement already satisfied: importlib-metadata>=4.6 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from build->-r requirements.txt (line 14)) (8.7.0) 2025-05-07T19:49:31.4688711Z Requirement already satisfied: tomli>=1.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from build->-r requirements.txt (line 14)) (2.2.1) 2025-05-07T19:49:31.4812991Z Requirement already satisfied: attrs>=22.2.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from hypothesis->-r requirements.txt (line 17)) (25.3.0) 2025-05-07T19:49:31.4817372Z Requirement already satisfied: exceptiongroup>=1.0.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from hypothesis->-r requirements.txt (line 17)) (1.2.2) 2025-05-07T19:49:31.4822780Z Requirement already satisfied: sortedcontainers<3.0.0,>=2.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from hypothesis->-r requirements.txt (line 17)) (2.4.0) 2025-05-07T19:49:31.4842506Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from jinja2->-r requirements.txt (line 18)) (3.0.2) 2025-05-07T19:49:31.4974474Z Collecting typing-inspect (from pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:31.5011101Z Downloading typing_inspect-0.9.0-py3-none-any.whl.metadata (1.5 kB) 2025-05-07T19:49:31.5079561Z Requirement already satisfied: typing-extensions in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from pyre-extensions->-r requirements.txt (line 22)) (4.13.2) 2025-05-07T19:49:31.5128270Z Requirement already satisfied: distro in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from scikit-build->-r requirements.txt (line 24)) (1.9.0) 2025-05-07T19:49:31.5132930Z Requirement already satisfied: wheel>=0.32.0 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from scikit-build->-r requirements.txt (line 24)) (0.45.1) 2025-05-07T19:49:31.5349357Z Requirement already satisfied: zipp>=3.20 in /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages (from importlib-metadata>=4.6->build->-r requirements.txt (line 14)) (3.21.0) 2025-05-07T19:49:31.5543947Z Collecting mypy-extensions>=0.3.0 (from typing-inspect->pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:49:31.5579027Z Downloading mypy_extensions-1.1.0-py3-none-any.whl.metadata (1.1 kB) 2025-05-07T19:49:31.5689409Z Downloading backports.tarfile-1.2.0-py3-none-any.whl (30 kB) 2025-05-07T19:49:31.5790761Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (27.9 MB) 2025-05-07T19:49:31.6851234Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 27.9/27.9 MB 270.6 MB/s eta 0:00:00 2025-05-07T19:49:31.6892622Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl (422 kB) 2025-05-07T19:49:31.6979224Z Downloading pyre_extensions-0.0.32-py3-none-any.whl (12 kB) 2025-05-07T19:49:31.7051038Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl (10 kB) 2025-05-07T19:49:31.7122149Z Downloading tabulate-0.9.0-py3-none-any.whl (35 kB) 2025-05-07T19:49:31.7190341Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl (466 kB) 2025-05-07T19:49:31.7270968Z Downloading typing_inspect-0.9.0-py3-none-any.whl (8.8 kB) 2025-05-07T19:49:31.7349150Z Downloading mypy_extensions-1.1.0-py3-none-any.whl (5.0 kB) 2025-05-07T19:49:31.9170461Z Installing collected packages: tabulate, setuptools_git_versioning, patchelf, ninja, mypy-extensions, cmake, backports.tarfile, typing-inspect, pyre-extensions 2025-05-07T19:49:32.8711364Z 2025-05-07T19:49:32.8781162Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:32.8783393Z Successfully installed backports.tarfile-1.2.0 cmake-4.0.0 mypy-extensions-1.1.0 ninja-1.11.1.4 patchelf-0.17.2.2 pyre-extensions-0.0.32 setuptools_git_versioning-2.1.0 tabulate-0.9.0 typing-inspect-0.9.0 2025-05-07T19:49:33.0066698Z ################################################################################ 2025-05-07T19:49:33.0067791Z # Install PyTorch (PyTorch PIP) 2025-05-07T19:49:33.0068540Z # 2025-05-07T19:49:33.0083081Z # [2025-05-07T19:49:33.007Z] + install_triton_pip build_binary 2025-05-07T19:49:33.0084308Z ################################################################################ 2025-05-07T19:49:33.0085018Z 2025-05-07T19:49:33.0085691Z [BUILD] Installing pytorch-triton nightly/3.2.0+git4b3bb1f8 from PIP ... 2025-05-07T19:49:33.0087023Z ################################################################################ 2025-05-07T19:49:33.0088100Z # Install Package From PyTorch PIP: pytorch-triton 2025-05-07T19:49:33.0089061Z # 2025-05-07T19:49:33.0102797Z # [2025-05-07T19:49:33.009Z] + install_from_pytorch_pip build_binary pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:33.0103393Z ################################################################################ 2025-05-07T19:49:33.0103663Z 2025-05-07T19:49:33.0120095Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:49:33.1006907Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:49:33.1007361Z ################################################################################ 2025-05-07T19:49:33.1007771Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:49:33.1022992Z # 2025-05-07T19:49:33.1023475Z # [2025-05-07T19:49:33.101Z] + __prepare_pip_arguments pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:49:33.1024035Z ################################################################################ 2025-05-07T19:49:33.1024273Z 2025-05-07T19:49:33.1069088Z [INSTALL] Extracted package (channel, version): (nightly, 3.2.0+git4b3bb1f8) 2025-05-07T19:49:33.1082528Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:49:33.1084119Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:33.1091225Z [INSTALL] Extracted the full PIP package: --pre pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:33.1095467Z [INSTALL] Attempting to install [pytorch-triton, 3.2.0+git4b3bb1f8] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/ ... 2025-05-07T19:49:33.1121867Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre pytorch-triton==3.2.0+git4b3bb1f8 --index-url https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:38.2646999Z ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. 2025-05-07T19:49:38.2651250Z torch 2.8.0.dev20250507+cu126 requires pytorch-triton==3.3.0+git96316ce5; platform_system == "Linux" and platform_machine == "x86_64", but you have pytorch-triton 3.2.0+git4b3bb1f8 which is incompatible. 2025-05-07T19:49:38.2653659Z Looking in indexes: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:49:38.2654113Z Collecting pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:49:38.2654961Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp39-cp39-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.3 kB) 2025-05-07T19:49:38.2656229Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp39-cp39-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (166.4 MB) 2025-05-07T19:49:38.2657826Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 166.4/166.4 MB 192.2 MB/s eta 0:00:00 2025-05-07T19:49:38.2658222Z Installing collected packages: pytorch-triton 2025-05-07T19:49:38.2658597Z Attempting uninstall: pytorch-triton 2025-05-07T19:49:38.2658989Z Found existing installation: pytorch-triton 3.3.0+git96316ce5 2025-05-07T19:49:38.2659448Z Uninstalling pytorch-triton-3.3.0+git96316ce5: 2025-05-07T19:49:38.2659867Z Successfully uninstalled pytorch-triton-3.3.0+git96316ce5 2025-05-07T19:49:38.2660445Z Successfully installed pytorch-triton-3.2.0+git4b3bb1f8 2025-05-07T19:49:38.2660907Z 2025-05-07T19:49:38.2662328Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:49:38.2663801Z 2025-05-07T19:49:40.1636040Z [CHECK] Python (sub-)package 'triton' found ... 2025-05-07T19:49:40.1636572Z [CHECK] Printing out the pytorch-triton version ... 2025-05-07T19:49:41.9826217Z ################################################################################ 2025-05-07T19:49:41.9827491Z [CHECK] The installed VERSION of pytorch-triton is: 3.2.0 2025-05-07T19:49:41.9828657Z ################################################################################ 2025-05-07T19:49:41.9829326Z 2025-05-07T19:49:43.7219229Z [CHECK] Python (sub-)package 'numpy' found ... 2025-05-07T19:49:45.5944785Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:49:45.5945272Z [BUILD] Successfully ran git submodules update 2025-05-07T19:49:45.6013192Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:45.6013884Z . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:49:45.6014473Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:49:45.6014792Z env: 2025-05-07T19:49:45.6015001Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:49:45.6015303Z BUILD_ENV: build_binary 2025-05-07T19:49:45.6015529Z BUILD_TARGET: default 2025-05-07T19:49:45.6015758Z BUILD_VARIANT: cuda 2025-05-07T19:49:45.6015995Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T19:49:45.6016226Z ##[endgroup] 2025-05-07T19:49:46.0978397Z [BUILD] BUILD_TARGET_VARIANT: default/cuda 2025-05-07T19:49:46.0979324Z [BUILD] Extracted build target: default 2025-05-07T19:49:46.0979754Z [BUILD] Extracted build variant: cuda 2025-05-07T19:49:47.6807412Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:49:47.6808201Z 2025-05-07T19:49:47.7578209Z [CHECK] Binary cc found in PATH 2025-05-07T19:49:49.3478306Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:49:49.3478645Z 2025-05-07T19:49:49.4225045Z [CHECK] Binary gcc found in PATH 2025-05-07T19:49:51.0201732Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:49:51.0202168Z 2025-05-07T19:49:51.0801526Z [CHECK] Binary c++ found in PATH 2025-05-07T19:49:52.6599163Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:49:52.6599953Z 2025-05-07T19:49:52.7175595Z [CHECK] Binary g++ found in PATH 2025-05-07T19:49:54.3663993Z [BUILD] Extracted and set Python tag: py39 2025-05-07T19:49:54.3664749Z [BUILD] Extracted and set Python platform name: manylinux_2_28_x86_64 2025-05-07T19:49:54.3885152Z core = 24 2025-05-07T19:49:54.4100469Z sockets = 2 2025-05-07T19:49:54.4101380Z [BUILD] Set multicore run option for setup.py: -j 48 2025-05-07T19:49:54.4102475Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:49:54.4103290Z [BUILD] Running pre-build cleanups ... 2025-05-07T19:49:54.4104163Z + rm -rf dist 2025-05-07T19:49:54.4104552Z 2025-05-07T19:49:54.4115985Z 2025-05-07T19:49:54.4116701Z + conda run --no-capture-output -n build_binary python setup.py clean 2025-05-07T19:49:54.4117180Z 2025-05-07T19:49:57.3972433Z INFO:root:running clean 2025-05-07T19:49:57.3974282Z [SETUP.PY] ARGV: ['setup.py', 'clean'] 2025-05-07T19:49:57.3978268Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:49:57.3980455Z [SETUP.PY] Other arguments: ['clean'] 2025-05-07T19:49:57.3980946Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:49:57.3981543Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:49:57.3982153Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:49:57.3982745Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:49:57.3983176Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:49:57.3984425Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:49:57.7098936Z 2025-05-07T19:49:57.7099574Z [BUILD] Printing git status ... 2025-05-07T19:49:57.7100636Z + git status 2025-05-07T19:49:57.7100989Z 2025-05-07T19:49:58.1572359Z HEAD detached at pull/4066/merge 2025-05-07T19:49:58.1573265Z Untracked files: 2025-05-07T19:49:58.1574125Z (use "git add ..." to include in what will be committed) 2025-05-07T19:49:58.1575213Z ../build_only/ 2025-05-07T19:49:58.1575859Z ../collect_env.py 2025-05-07T19:49:58.1576350Z fbgemm_gpu/docs/version.py 2025-05-07T19:49:58.1576526Z 2025-05-07T19:49:58.1577148Z nothing added to commit but untracked files present (use "git add" to track) 2025-05-07T19:49:58.1577501Z 2025-05-07T19:49:58.1577587Z + git diff 2025-05-07T19:49:58.1577706Z 2025-05-07T19:49:58.1860495Z 2025-05-07T19:49:58.1861111Z ################################################################################ 2025-05-07T19:49:58.1862170Z # Configure FBGEMM-GPU Build 2025-05-07T19:49:58.1862771Z # 2025-05-07T19:49:58.1878362Z # [2025-05-07T19:49:58.187Z] + __configure_fbgemm_gpu_build 2025-05-07T19:49:58.1878791Z ################################################################################ 2025-05-07T19:49:58.1879028Z 2025-05-07T19:49:58.1882729Z [BUILD] Setting the build target: default ... 2025-05-07T19:49:59.8023474Z [BUILD] Configuring build as CUDA variant (this is the default behavior) ... 2025-05-07T19:49:59.8024567Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:49:59.8024835Z 2025-05-07T19:49:59.8607109Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:50:01.4769956Z /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:50:01.4770339Z 2025-05-07T19:50:01.5581486Z [CHECK] Environment variable CUDNN_INCLUDE_DIR is defined in the Conda environment 2025-05-07T19:50:03.1822108Z /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:50:03.1822396Z 2025-05-07T19:50:03.2598992Z [CHECK] Environment variable CUDNN_LIBRARY is defined in the Conda environment 2025-05-07T19:50:04.8865079Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:04.8865469Z 2025-05-07T19:50:04.9651919Z [CHECK] Environment variable NVML_LIB_PATH is defined in the Conda environment 2025-05-07T19:50:06.6725524Z [BUILD] Using the default architectures for CUDA nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:50:06.6726116Z Copyright (c) 2005-2024 NVIDIA Corporation 2025-05-07T19:50:06.6726454Z Built on Tue_Oct_29_23:50:19_PDT_2024 2025-05-07T19:50:06.6726800Z Cuda compilation tools, release 12.6, V12.6.85 2025-05-07T19:50:06.6727189Z Build cuda_12.6.r12.6/compiler.35059454_0 ... 2025-05-07T19:50:06.6727577Z [BUILD] Setting the following CUDA targets: 7.0;8.0;9.0;9.0a 2025-05-07T19:50:06.6727968Z [BUILD] Looking up NVML filepath ... 2025-05-07T19:50:08.3042373Z [BUILD] Looking up NCCL filepath ... 2025-05-07T19:50:11.7137370Z [BUILD] Setting NVCC verbose mode ... 2025-05-07T19:50:11.7138089Z + conda env config vars set -n build_binary NVCC_VERBOSE=1 2025-05-07T19:50:11.7138415Z 2025-05-07T19:50:12.1396012Z 2025-05-07T19:50:12.1396870Z [BUILD] Setting CUDA build args ... 2025-05-07T19:50:13.7747853Z [BUILD] Looking up CUDA version ... 2025-05-07T19:50:17.0303580Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:17.0304451Z 2025-05-07T19:50:18.6983389Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:18.6984488Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:50:18.6984974Z 2025-05-07T19:50:18.6985139Z [BUILD] Setting NVCC flags ... 2025-05-07T19:50:18.6986129Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-std=c++20 -Xcompiler -std=c++20 -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler" 2025-05-07T19:50:18.6987013Z 2025-05-07T19:50:19.1137090Z 2025-05-07T19:50:19.1137420Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:50:19.1137700Z 2025-05-07T19:50:20.6827450Z -std=c++20 -Xcompiler -std=c++20 -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler 2025-05-07T19:50:20.6829363Z 2025-05-07T19:50:20.7410056Z 2025-05-07T19:50:20.7410834Z [BUILD] Setting CUDA build args ... 2025-05-07T19:50:20.7411832Z + conda run -n build_binary c++ --version 2025-05-07T19:50:20.7412508Z 2025-05-07T19:50:22.3326958Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:22.3328279Z Target: x86_64-conda-linux-gnu 2025-05-07T19:50:22.3328580Z Thread model: posix 2025-05-07T19:50:22.3328946Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:50:22.3329618Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:50:22.3330092Z 2025-05-07T19:50:22.3889010Z 2025-05-07T19:50:22.3889421Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:50:22.3889745Z 2025-05-07T19:50:24.0377842Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:24.0378799Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:50:24.0379277Z 2025-05-07T19:50:24.0379484Z [BUILD] Clang is available; configuring for Clang-based build ... 2025-05-07T19:50:25.6582964Z .github/scripts/fbgemm_gpu_build.bash: line 370: [: : integer expression expected 2025-05-07T19:50:25.6584575Z [BUILD] Enabling debug features in the build ... 2025-05-07T19:50:25.6589644Z [BUILD] FBGEMM_GPU build arguments have been set: --verbose --build-target=default --build-variant=cuda --nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 -DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 --cxxprefix=/github/home/miniconda/envs/build_binary --debug 2025-05-07T19:50:25.6592095Z ################################################################################ 2025-05-07T19:50:25.6592465Z # Build FBGEMM-GPU Package (Wheel) 2025-05-07T19:50:25.6592751Z # 2025-05-07T19:50:25.6693554Z # [2025-05-07T19:50:25.659Z] + build_fbgemm_gpu_package build_binary nightly default/cuda 2025-05-07T19:50:25.6696144Z ################################################################################ 2025-05-07T19:50:25.6696987Z 2025-05-07T19:50:25.6697558Z [BUILD] Building FBGEMM wheel (TARGET=default, VARIANT=cuda) ... 2025-05-07T19:50:25.6706131Z + conda run --no-capture-output -n build_binary python -m build --wheel --no-isolation --config-setting=--build-option=--verbose --config-setting=--build-option=--build-target=default --config-setting=--build-option=--build-variant=cuda --config-setting=--build-option=--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --config-setting=--build-option=--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 --config-setting=--build-option=-DTORCH_CUDA_ARCH_LIST='7.0;8.0;9.0;9.0a' --config-setting=--build-option=-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux --config-setting=--build-option=-DCMAKE_CXX_STANDARD=20 --config-setting=--build-option=--cxxprefix=/github/home/miniconda/envs/build_binary --config-setting=--build-option=--debug --config-setting=--build-option=--package_channel=nightly --config-setting=--build-option=--python-tag=py39 --config-setting=--build-option=--plat-name=manylinux_2_28_x86_64 2025-05-07T19:50:25.6713685Z 2025-05-07T19:50:27.2752995Z * Getting build dependencies for wheel... 2025-05-07T19:50:28.7132161Z INFO:root:running egg_info 2025-05-07T19:50:28.7163658Z INFO:root:creating fbgemm_gpu_nightly.egg-info 2025-05-07T19:50:28.7164162Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T19:50:28.7166458Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T19:50:28.7168552Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T19:50:28.7169471Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T19:50:28.7171153Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:28.7236422Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:28.7253688Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:50:28.7255932Z [SETUP.PY] ARGV: ['setup.py', 'egg_info'] 2025-05-07T19:50:28.7258528Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:50:28.7259736Z [SETUP.PY] Other arguments: ['egg_info'] 2025-05-07T19:50:28.7260221Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:28.7261123Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:28.7261792Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:28.7262351Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:28.7262781Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:28.7264032Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:50:29.0698034Z * Building wheel... 2025-05-07T19:50:30.5078968Z [SETUP.PY] ARGV: ['setup.py', 'bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-sth8qixf', '--verbose', '--build-target=default', '--build-variant=cuda', '--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--cxxprefix=/github/home/miniconda/envs/build_binary', '--debug', '--package_channel=nightly', '--python-tag=py39', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:30.5084180Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=True, debug=True, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path='/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', nccl_lib_path='/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2', use_fb_only=False, cxxprefix='/github/home/miniconda/envs/build_binary') 2025-05-07T19:50:30.5087182Z [SETUP.PY] Other arguments: ['bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-sth8qixf', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20', '--python-tag=py39', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:50:30.5088890Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:50:30.5089422Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:50:30.5089979Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:50:30.5090498Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:50:30.5090892Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:50:30.5096805Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DCMAKE_VERBOSE_MAKEFILE=ON', '-DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', '-DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '-DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include', '-DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DCMAKE_C_COMPILER=/github/home/miniconda/envs/build_binary/bin/cc', '-DCMAKE_CXX_COMPILER=/github/home/miniconda/envs/build_binary/bin/c++', "-DCMAKE_C_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'", "-DCMAKE_CXX_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'", '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a', '-DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux', '-DCMAKE_CXX_STANDARD=20'] 2025-05-07T19:50:30.5103000Z 2025-05-07T19:50:30.5103005Z 2025-05-07T19:50:30.5103191Z -------------------------------------------------------------------------------- 2025-05-07T19:50:30.5103602Z -- Trying 'Ninja' generator 2025-05-07T19:50:30.5103871Z -------------------------------- 2025-05-07T19:50:30.5104161Z --------------------------- 2025-05-07T19:50:30.5104412Z ---------------------- 2025-05-07T19:50:30.5104667Z ----------------- 2025-05-07T19:50:30.5104883Z ------------ 2025-05-07T19:50:30.5105109Z ------- 2025-05-07T19:50:30.5105304Z -- 2025-05-07T19:50:30.5540201Z CMake Deprecation Warning at CMakeLists.txt:1 (cmake_minimum_required): 2025-05-07T19:50:30.5542100Z Compatibility with CMake < 3.10 will be removed from a future version of 2025-05-07T19:50:30.5543310Z CMake. 2025-05-07T19:50:30.5543627Z 2025-05-07T19:50:30.5544322Z Update the VERSION argument value. Or, use the ... syntax 2025-05-07T19:50:30.5545914Z to tell CMake that the project requires at least but has been updated 2025-05-07T19:50:30.5547299Z to work with policies introduced by or earlier. 2025-05-07T19:50:30.5548451Z 2025-05-07T19:50:30.5548465Z 2025-05-07T19:50:30.5548996Z Not searching for unused variables given on the command line. 2025-05-07T19:50:30.6351222Z -- The C compiler identification is Clang 16.0.6 2025-05-07T19:50:30.6461083Z -- Detecting C compiler ABI info 2025-05-07T19:50:30.7658973Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:30.7778942Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/cc - skipped 2025-05-07T19:50:30.7780695Z -- Detecting C compile features 2025-05-07T19:50:30.7782717Z -- Detecting C compile features - done 2025-05-07T19:50:30.9150811Z -- The CXX compiler identification is Clang 16.0.6 2025-05-07T19:50:30.9233689Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:31.0680207Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:31.0806332Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/c++ - skipped 2025-05-07T19:50:31.0807921Z -- Detecting CXX compile features 2025-05-07T19:50:31.0814031Z -- Detecting CXX compile features - done 2025-05-07T19:50:31.0828540Z -- Configuring done (0.6s) 2025-05-07T19:50:31.0878366Z -- Generating done (0.0s) 2025-05-07T19:50:31.0894058Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_cmake_test_compile/build 2025-05-07T19:50:31.0938138Z -- 2025-05-07T19:50:31.0939414Z ------- 2025-05-07T19:50:31.0939826Z ------------ 2025-05-07T19:50:31.0940080Z ----------------- 2025-05-07T19:50:31.0940495Z ---------------------- 2025-05-07T19:50:31.0940824Z --------------------------- 2025-05-07T19:50:31.0941162Z -------------------------------- 2025-05-07T19:50:31.0941517Z -- Trying 'Ninja' generator - success 2025-05-07T19:50:31.0942263Z -------------------------------------------------------------------------------- 2025-05-07T19:50:31.0942586Z 2025-05-07T19:50:31.0950531Z Configuring Project 2025-05-07T19:50:31.0950848Z Working directory: 2025-05-07T19:50:31.0951330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build 2025-05-07T19:50:31.0951799Z Command: 2025-05-07T19:50:31.0969376Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/cmake/data/bin/cmake /__w/FBGEMM/FBGEMM/fbgemm_gpu -G Ninja -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja --no-warn-unused-cli -DCMAKE_INSTALL_PREFIX:PATH=/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install -DPYTHON_VERSION_STRING:STRING=3.9.22 -DSKBUILD:INTERNAL=TRUE -DCMAKE_MODULE_PATH:PATH=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/skbuild/resources/cmake -DPYTHON_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPYTHON_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.9 -DPYTHON_LIBRARY:PATH=/github/home/miniconda/envs/build_binary/lib/libpython3.9.so -DPython_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython_FIND_REGISTRY:STRING=NEVER -DPython_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.9 -DPython_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/numpy/_core/include -DPython3_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython3_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython3_FIND_REGISTRY:STRING=NEVER -DPython3_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.9 -DPython3_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/numpy/_core/include -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja -DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch -D_GLIBCXX_USE_CXX11_ABI=1 -DCMAKE_VERBOSE_MAKEFILE=ON -DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE -DFBGEMM_BUILD_TARGET=default -DFBGEMM_BUILD_VARIANT=cuda -DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 -DCMAKE_C_COMPILER=/github/home/miniconda/envs/build_binary/bin/cc -DCMAKE_CXX_COMPILER=/github/home/miniconda/envs/build_binary/bin/c++ '-DCMAKE_C_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'"'"'' '-DCMAKE_CXX_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'"'"'' '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 '-DTORCH_CUDA_ARCH_LIST=7.0;8.0;9.0;9.0a' -DCUDA_TOOLKIT_ROOT_DIR=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCUDAToolkit_ROOT=/github/home/miniconda/envs/build_binary/targets/x86_64-linux -DCMAKE_CXX_STANDARD=20 -DCMAKE_BUILD_TYPE:STRING=Release 2025-05-07T19:50:31.0983941Z 2025-05-07T19:50:31.1484927Z 2025-05-07T19:50:31.1484946Z 2025-05-07T19:50:31.1485496Z ================================================================================ 2025-05-07T19:50:31.1486843Z Not searching for unused variables given on the command line. 2025-05-07T19:50:31.1488076Z Default C compiler flags 2025-05-07T19:50:31.1489111Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:31.1490014Z 2025-05-07T19:50:31.1492963Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include 2025-05-07T19:50:31.1495027Z ================================================================================ 2025-05-07T19:50:31.1495251Z 2025-05-07T19:50:31.1495255Z 2025-05-07T19:50:31.1495258Z 2025-05-07T19:50:31.1495369Z ================================================================================ 2025-05-07T19:50:31.1495710Z Default C++ compiler flags 2025-05-07T19:50:31.1496050Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:50:31.1496354Z 2025-05-07T19:50:31.1497121Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include 2025-05-07T19:50:31.1498112Z ================================================================================ 2025-05-07T19:50:31.1498330Z 2025-05-07T19:50:31.1498333Z 2025-05-07T19:50:31.1498336Z 2025-05-07T19:50:31.1498448Z ================================================================================ 2025-05-07T19:50:31.1498768Z AVX2_FLAGS: 2025-05-07T19:50:31.1498884Z 2025-05-07T19:50:31.1498981Z -mavx2 2025-05-07T19:50:31.1499161Z -mf16c 2025-05-07T19:50:31.1499360Z -mfma 2025-05-07T19:50:31.1499542Z -fopenmp 2025-05-07T19:50:31.1499776Z ================================================================================ 2025-05-07T19:50:31.1499991Z 2025-05-07T19:50:31.1499995Z 2025-05-07T19:50:31.1499998Z 2025-05-07T19:50:31.1500108Z ================================================================================ 2025-05-07T19:50:31.1500538Z AVX512_FLAGS: 2025-05-07T19:50:31.1500661Z 2025-05-07T19:50:31.1500938Z -mavx2 2025-05-07T19:50:31.1501139Z -mf16c 2025-05-07T19:50:31.1501354Z -mfma 2025-05-07T19:50:31.1501553Z -mavx512f 2025-05-07T19:50:31.1501786Z -mavx512bw 2025-05-07T19:50:31.1501992Z -mavx512dq 2025-05-07T19:50:31.1502209Z -mavx512vl 2025-05-07T19:50:31.1502412Z -fopenmp 2025-05-07T19:50:31.1502664Z ================================================================================ 2025-05-07T19:50:31.1503025Z 2025-05-07T19:50:31.1503029Z 2025-05-07T19:50:31.1503033Z 2025-05-07T19:50:31.1503175Z ================================================================================ 2025-05-07T19:50:31.1503530Z The project is built using scikit-build 2025-05-07T19:50:31.1503883Z ================================================================================ 2025-05-07T19:50:31.1504119Z 2025-05-07T19:50:31.1504123Z 2025-05-07T19:50:31.1504127Z 2025-05-07T19:50:31.1504244Z ================================================================================ 2025-05-07T19:50:31.1504587Z Build Settings 2025-05-07T19:50:31.1504721Z 2025-05-07T19:50:31.1504829Z FBGEMM_BUILD_TARGET : default 2025-05-07T19:50:31.1505150Z FBGEMM_BUILD_VARIANT : cuda 2025-05-07T19:50:31.1505338Z 2025-05-07T19:50:31.1505459Z NVCC_VERBOSE : 2025-05-07T19:50:31.1505719Z CUDNN_INCLUDE_DIR : 2025-05-07T19:50:31.1506001Z CUDNN_LIBRARY : 2025-05-07T19:50:31.1506429Z NVML_LIB_PATH : /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:31.1507064Z TORCH_CUDA_ARCH_LIST : 7.0 2025-05-07T19:50:31.1507305Z 8.0 2025-05-07T19:50:31.1507507Z 9.0 2025-05-07T19:50:31.1507689Z 9.0a 2025-05-07T19:50:31.1507813Z 2025-05-07T19:50:31.1507905Z HIP_ROOT_DIR : 2025-05-07T19:50:31.1508166Z HIPCC_VERBOSE : 2025-05-07T19:50:31.1508404Z AMDGPU_TARGETS : 2025-05-07T19:50:31.1508660Z PYTORCH_ROCM_ARCH : 2025-05-07T19:50:31.1508917Z ================================================================================ 2025-05-07T19:50:31.1509133Z 2025-05-07T19:50:31.2907944Z -- The CXX compiler identification is Clang 16.0.6 2025-05-07T19:50:31.3604060Z -- The C compiler identification is Clang 16.0.6 2025-05-07T19:50:32.4149565Z -- The CUDA compiler identification is NVIDIA 12.6.85 with host compiler Clang 16.0.6 2025-05-07T19:50:32.4258336Z -- Detecting CXX compiler ABI info 2025-05-07T19:50:32.5700912Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:50:32.5828567Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/c++ - skipped 2025-05-07T19:50:32.5829486Z -- Detecting CXX compile features 2025-05-07T19:50:32.5838506Z -- Detecting CXX compile features - done 2025-05-07T19:50:32.5913328Z -- Detecting C compiler ABI info 2025-05-07T19:50:32.7102029Z -- Detecting C compiler ABI info - done 2025-05-07T19:50:32.7225100Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/cc - skipped 2025-05-07T19:50:32.7227391Z -- Detecting C compile features 2025-05-07T19:50:32.7230903Z -- Detecting C compile features - done 2025-05-07T19:50:32.7279499Z -- Detecting CUDA compiler ABI info 2025-05-07T19:50:33.7356088Z -- Detecting CUDA compiler ABI info - done 2025-05-07T19:50:33.7887579Z -- Check for working CUDA compiler: /github/home/miniconda/envs/build_binary/bin/nvcc - skipped 2025-05-07T19:50:33.7918923Z -- Detecting CUDA compile features 2025-05-07T19:50:33.7920556Z -- Detecting CUDA compile features - done 2025-05-07T19:50:33.7943248Z -- Performing Test C_HAS_AVX_1 2025-05-07T19:50:34.0764483Z -- Performing Test C_HAS_AVX_1 - Failed 2025-05-07T19:50:34.0765492Z -- Performing Test C_HAS_AVX_2 2025-05-07T19:50:34.4006367Z -- Performing Test C_HAS_AVX_2 - Success 2025-05-07T19:50:34.4007255Z -- Performing Test C_HAS_AVX2_1 2025-05-07T19:50:34.6842060Z -- Performing Test C_HAS_AVX2_1 - Failed 2025-05-07T19:50:34.6843512Z -- Performing Test C_HAS_AVX2_2 2025-05-07T19:50:35.0104780Z -- Performing Test C_HAS_AVX2_2 - Success 2025-05-07T19:50:35.0106001Z -- Performing Test C_HAS_AVX512_1 2025-05-07T19:50:35.2947715Z -- Performing Test C_HAS_AVX512_1 - Failed 2025-05-07T19:50:35.2948194Z -- Performing Test C_HAS_AVX512_2 2025-05-07T19:50:35.6214896Z -- Performing Test C_HAS_AVX512_2 - Success 2025-05-07T19:50:35.6215976Z -- Performing Test CXX_HAS_AVX_1 2025-05-07T19:50:35.9039985Z -- Performing Test CXX_HAS_AVX_1 - Failed 2025-05-07T19:50:35.9040408Z -- Performing Test CXX_HAS_AVX_2 2025-05-07T19:50:36.2304753Z -- Performing Test CXX_HAS_AVX_2 - Success 2025-05-07T19:50:36.2305807Z -- Performing Test CXX_HAS_AVX2_1 2025-05-07T19:50:36.5146427Z -- Performing Test CXX_HAS_AVX2_1 - Failed 2025-05-07T19:50:36.5147460Z -- Performing Test CXX_HAS_AVX2_2 2025-05-07T19:50:36.8403866Z -- Performing Test CXX_HAS_AVX2_2 - Success 2025-05-07T19:50:36.8404889Z -- Performing Test CXX_HAS_AVX512_1 2025-05-07T19:50:37.1267139Z -- Performing Test CXX_HAS_AVX512_1 - Failed 2025-05-07T19:50:37.1268163Z -- Performing Test CXX_HAS_AVX512_2 2025-05-07T19:50:37.4571321Z -- Performing Test CXX_HAS_AVX512_2 - Success 2025-05-07T19:50:37.4743441Z -- Found CUDA: /github/home/miniconda/envs/build_binary/targets/x86_64-linux (found version "12.6") 2025-05-07T19:50:37.4778029Z -- Found CUDAToolkit: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include (found version "12.6.85") 2025-05-07T19:50:37.4843081Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD 2025-05-07T19:50:37.6058640Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success 2025-05-07T19:50:37.6068978Z -- Found Threads: TRUE 2025-05-07T19:50:37.6080725Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/share/cmake/Caffe2/FindCUDAToolkit.cmake:957 (message): 2025-05-07T19:50:37.6081693Z Could not find librt library, needed by CUDA::cudart_static 2025-05-07T19:50:37.6082105Z Call Stack (most recent call first): 2025-05-07T19:50:37.6082823Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:59 (find_package) 2025-05-07T19:50:37.6083952Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:50:37.6085514Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:50:37.6086337Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:37.6086775Z CMakeLists.txt:112 (include) 2025-05-07T19:50:37.6086955Z 2025-05-07T19:50:37.6086960Z 2025-05-07T19:50:37.7352633Z -- PyTorch: CUDA detected: 12.6 2025-05-07T19:50:37.7353215Z -- PyTorch: CUDA nvcc is: /github/home/miniconda/envs/build_binary/targets/x86_64-linux/bin/nvcc 2025-05-07T19:50:37.7353986Z -- PyTorch: CUDA toolkit directory: /github/home/miniconda/envs/build_binary/targets/x86_64-linux 2025-05-07T19:50:37.9044878Z -- PyTorch: Header version is: 12.6 2025-05-07T19:50:38.0869112Z -- Found Python: /github/home/miniconda/envs/build_binary/bin/python (found version "3.9.22") found components: Interpreter 2025-05-07T19:50:38.0882012Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:140 (message): 2025-05-07T19:50:38.0882851Z Failed to compute shorthash for libnvrtc.so 2025-05-07T19:50:38.0883265Z -- USE_CUDNN is set to 0. Compiling without cuDNN support 2025-05-07T19:50:38.0883727Z -- USE_CUSPARSELT is set to 0. Compiling without cuSPARSELt support 2025-05-07T19:50:38.0884211Z -- USE_CUDSS is set to 0. Compiling without cuDSS support 2025-05-07T19:50:38.0884626Z -- USE_CUFILE is set to 0. Compiling without cuFile support 2025-05-07T19:50:38.0885036Z Call Stack (most recent call first): 2025-05-07T19:50:38.0885744Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:50:38.0886840Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:50:38.0887797Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:38.0888228Z CMakeLists.txt:112 (include) 2025-05-07T19:50:38.0888423Z 2025-05-07T19:50:38.0888428Z 2025-05-07T19:50:38.0888981Z -- Added CUDA NVCC flags for: -gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_80,code=sm_80;-gencode;arch=compute_90,code=sm_90;-gencode;arch=compute_90a,code=sm_90a 2025-05-07T19:50:38.1217418Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:22 (message): 2025-05-07T19:50:38.1218822Z static library kineto_LIBRARY-NOTFOUND not found. 2025-05-07T19:50:38.1219193Z Call Stack (most recent call first): 2025-05-07T19:50:38.1219952Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:125 (append_torchlib_if_found) 2025-05-07T19:50:38.1221166Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:50:38.1221651Z CMakeLists.txt:112 (include) 2025-05-07T19:50:38.1221838Z 2025-05-07T19:50:38.1221843Z 2025-05-07T19:50:38.1222231Z 2025-05-07T19:50:38.1222242Z 2025-05-07T19:50:38.1222648Z ================================================================================ 2025-05-07T19:50:38.1223053Z PyTorch Flags: 2025-05-07T19:50:38.1223284Z 2025-05-07T19:50:38.1223501Z TORCH_INCLUDE_DIRS: 2025-05-07T19:50:38.1223928Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:38.1224722Z -- Found Torch: /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so 2025-05-07T19:50:38.1225617Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:38.1226211Z 2025-05-07T19:50:38.1226403Z TORCH_LIBRARIES: 2025-05-07T19:50:38.1226644Z torch 2025-05-07T19:50:38.1226846Z torch_library 2025-05-07T19:50:38.1227295Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:38.1227986Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:38.1228966Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:38.1229522Z 2025-05-07T19:50:38.1229724Z TORCH_CUDA_OPTIONS: 2025-05-07T19:50:38.1229990Z --expt-relaxed-constexpr 2025-05-07T19:50:38.1230265Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:38.1230578Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:38.1230876Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:38.1231192Z ================================================================================ 2025-05-07T19:50:38.1231429Z 2025-05-07T19:50:38.1231476Z 2025-05-07T19:50:38.1231480Z 2025-05-07T19:50:38.1231598Z ================================================================================ 2025-05-07T19:50:38.1231914Z NCCL Flags 2025-05-07T19:50:38.1232176Z 2025-05-07T19:50:38.1232536Z NCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:38.1233538Z NCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:38.1234136Z ================================================================================ 2025-05-07T19:50:38.1234373Z 2025-05-07T19:50:38.1234377Z 2025-05-07T19:50:38.1234380Z 2025-05-07T19:50:38.1234491Z ================================================================================ 2025-05-07T19:50:38.1234796Z CUDA Driver Path 2025-05-07T19:50:38.1234942Z 2025-05-07T19:50:38.1235278Z CUDA_DRIVER_LIBRARIES=/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:38.1235841Z ================================================================================ 2025-05-07T19:50:38.1236054Z 2025-05-07T19:50:38.1236328Z -- Found NVML_LIB_PATH: /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:38.1253909Z 2025-05-07T19:50:38.1253991Z 2025-05-07T19:50:38.1254582Z ================================================================================ 2025-05-07T19:50:38.1255738Z GPU CPP Library Target: asmjit (SHARED) 2025-05-07T19:50:38.1256603Z 2025-05-07T19:50:38.1257148Z CPU_SRCS: 2025-05-07T19:50:38.1257484Z 2025-05-07T19:50:38.1257698Z 2025-05-07T19:50:38.1258213Z GPU_SRCS: 2025-05-07T19:50:38.1258527Z 2025-05-07T19:50:38.1258733Z 2025-05-07T19:50:38.1259576Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:38.1260154Z 2025-05-07T19:50:38.1260234Z 2025-05-07T19:50:38.1260562Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:38.1260719Z 2025-05-07T19:50:38.1260798Z 2025-05-07T19:50:38.1260987Z OTHER_SRCS: 2025-05-07T19:50:38.1261471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:38.1262097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:38.1262703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:38.1263330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:38.1263947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:38.1264561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:38.1265143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:38.1265753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:38.1266349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:38.1266935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:38.1267547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:38.1268150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:38.1268762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:38.1269371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:38.1270064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:38.1270690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:38.1271286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:38.1271901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:38.1272491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:38.1273104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:38.1273712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:38.1274305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:38.1274929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:38.1275540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:38.1276340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:38.1276916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:38.1277537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:38.1278164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:38.1278729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:38.1279301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:38.1279892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:38.1280515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:38.1281116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:38.1281806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:38.1282495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:38.1283022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:38.1283709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:38.1284233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:38.1284777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:38.1285321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:38.1285845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:38.1286382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:38.1286904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:38.1287442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:38.1287969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:38.1288521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:38.1289080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:38.1289624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:38.1290185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:38.1290744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:38.1291310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:38.1291873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:38.1292520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:38.1293103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:38.1293649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:38.1294209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:38.1294750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:38.1295307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:38.1295865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:38.1296259Z 2025-05-07T19:50:38.1296452Z CC_FLAGS: 2025-05-07T19:50:38.1296568Z 2025-05-07T19:50:38.1296646Z 2025-05-07T19:50:38.1296836Z NVCC_FLAGS: 2025-05-07T19:50:38.1296950Z 2025-05-07T19:50:38.1297024Z 2025-05-07T19:50:38.1297219Z HIPCC_FLAGS: 2025-05-07T19:50:38.1297339Z 2025-05-07T19:50:38.1297413Z 2025-05-07T19:50:38.1297608Z INCLUDE_DIRS: 2025-05-07T19:50:38.1297850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:38.1298144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:38.1298422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:38.1298708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:38.1299192Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:38.1299915Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:38.1300611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:38.1301209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:38.1301690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:38.1302177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:38.1302697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:38.1303178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:38.1303728Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:38.1304240Z 2025-05-07T19:50:38.1304517Z Selected Source Files: 2025-05-07T19:50:38.1304691Z 2025-05-07T19:50:38.1304775Z 2025-05-07T19:50:38.1304993Z HIPified Source Files: 2025-05-07T19:50:38.1305149Z 2025-05-07T19:50:38.1305232Z 2025-05-07T19:50:38.1305451Z Library Dependencies: 2025-05-07T19:50:38.1305687Z torch 2025-05-07T19:50:38.1305902Z torch_library 2025-05-07T19:50:38.1306331Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:38.1307013Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:38.1307701Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:38.1308506Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:38.1309254Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:38.1309853Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:38.1310273Z 2025-05-07T19:50:38.1310465Z Output Library: 2025-05-07T19:50:38.1310703Z asmjit 2025-05-07T19:50:38.1310890Z 2025-05-07T19:50:38.1311103Z Destination Directory: 2025-05-07T19:50:38.1311340Z fbgemm_gpu 2025-05-07T19:50:38.1311591Z ================================================================================ 2025-05-07T19:50:38.1311826Z 2025-05-07T19:50:38.1311886Z 2025-05-07T19:50:38.1311889Z 2025-05-07T19:50:38.1312009Z ================================================================================ 2025-05-07T19:50:38.1312373Z GPU CPP Library Target: fbgemm (SHARED) 2025-05-07T19:50:38.1312665Z 2025-05-07T19:50:38.1312873Z CPU_SRCS: 2025-05-07T19:50:38.1312990Z 2025-05-07T19:50:38.1313068Z 2025-05-07T19:50:38.1313383Z GPU_SRCS: 2025-05-07T19:50:38.1314880Z 2025-05-07T19:50:38.1314969Z 2025-05-07T19:50:38.1315171Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:38.1315305Z 2025-05-07T19:50:38.1315379Z 2025-05-07T19:50:38.1315579Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:38.1315714Z 2025-05-07T19:50:38.1315797Z 2025-05-07T19:50:38.1315986Z OTHER_SRCS: 2025-05-07T19:50:38.1316251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDM.cc 2025-05-07T19:50:38.1316678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:38.1317130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMNBit.cc 2025-05-07T19:50:38.1317534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtils.cc 2025-05-07T19:50:38.1317920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RefImplementations.cc 2025-05-07T19:50:38.1318387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:38.1318814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/SparseAdagrad.cc 2025-05-07T19:50:38.1319183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/Utils.cc 2025-05-07T19:50:38.1319557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:38.1319981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:38.1320393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:38.1320795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:50:38.1321219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:38.1321564Z 2025-05-07T19:50:38.1321766Z CC_FLAGS: 2025-05-07T19:50:38.1321878Z 2025-05-07T19:50:38.1321957Z 2025-05-07T19:50:38.1322160Z NVCC_FLAGS: 2025-05-07T19:50:38.1322275Z 2025-05-07T19:50:38.1322355Z 2025-05-07T19:50:38.1322550Z HIPCC_FLAGS: 2025-05-07T19:50:38.1322668Z 2025-05-07T19:50:38.1322764Z 2025-05-07T19:50:38.1322944Z INCLUDE_DIRS: 2025-05-07T19:50:38.1323192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:38.1323488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:38.1323770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:38.1324060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:38.1324531Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:38.1325249Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:38.1325972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:38.1326367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:38.1326759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:38.1327208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:38.1327685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:38.1328131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:38.1328641Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:38.1329116Z 2025-05-07T19:50:38.1329304Z Selected Source Files: 2025-05-07T19:50:38.1329468Z 2025-05-07T19:50:38.1329544Z 2025-05-07T19:50:38.1329747Z HIPified Source Files: 2025-05-07T19:50:38.1329892Z 2025-05-07T19:50:38.1329969Z 2025-05-07T19:50:38.1330170Z Library Dependencies: 2025-05-07T19:50:38.1330390Z torch 2025-05-07T19:50:38.1330599Z torch_library 2025-05-07T19:50:38.1330992Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:38.1331639Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:38.1332275Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:38.1333019Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:38.1333713Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:38.1334145Z asmjit 2025-05-07T19:50:38.1334464Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:38.1334900Z 2025-05-07T19:50:38.1335098Z Output Library: 2025-05-07T19:50:38.1335299Z fbgemm 2025-05-07T19:50:38.1335494Z 2025-05-07T19:50:38.1335683Z Destination Directory: 2025-05-07T19:50:38.1335931Z fbgemm_gpu 2025-05-07T19:50:38.1336179Z ================================================================================ 2025-05-07T19:50:38.1336402Z 2025-05-07T19:50:38.1336406Z 2025-05-07T19:50:38.1336409Z 2025-05-07T19:50:38.1336524Z ================================================================================ 2025-05-07T19:50:38.1336857Z Running code generation script ... 2025-05-07T19:50:38.1337548Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py --opensource 2025-05-07T19:50:38.1338276Z ================================================================================ 2025-05-07T19:50:38.1338489Z 2025-05-07T19:50:38.7729174Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:38.7731668Z [GENERAATE BACKWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py', '--opensource'] 2025-05-07T19:50:38.7732480Z Written: gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:38.7732946Z Written: gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:38.7733416Z Written: gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:38.7733898Z Written: gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:38.7734370Z Written: gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:38.7734825Z Written: gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:38.7735307Z Written: gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:38.7735768Z Written: gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:38.7736275Z Written: gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:38.7736758Z Written: gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:38.7737228Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:38.7737728Z Written: gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:38.7738478Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:38.7739026Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:38.7739530Z Written: gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:38.7740055Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:38.7740693Z Written: gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:38.7741416Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:38.7742015Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:38.7742574Z Written: gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:38.7743105Z Written: gen_embedding_optimizer_dense_split_device_kernel.cuh 2025-05-07T19:50:38.7743539Z Written: gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:38.7743945Z Written: gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:38.7744396Z Written: gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:38.7744901Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:38.7745435Z Written: gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:38.7745923Z Written: gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:38.7746455Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:38.7747095Z Written: gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:38.7747590Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:38.7748243Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:38.7748772Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:38.7749299Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:38.7749824Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:38.7750377Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:38.7750865Z Written: gen_embedding_optimizer_adagrad_split_device_kernel.cuh 2025-05-07T19:50:38.7751292Z Written: gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:38.7751677Z Written: gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:38.7752106Z Written: gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:38.7752516Z Written: lookup_adagrad.py 2025-05-07T19:50:38.7752820Z Written: gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:38.7753219Z Written: gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:38.7753636Z Written: gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:38.7754099Z Written: gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:38.7754543Z Written: gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:38.7754989Z Written: gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:38.7755466Z Written: gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:38.7755906Z Written: gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:38.7756356Z Written: gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:38.7756789Z Written: gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:38.7757253Z Written: gen_embedding_backward_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:38.7757739Z Written: gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:38.7758195Z Written: gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:38.7758675Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:38.7759146Z Written: gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:38.7759718Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:38.7760235Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:38.7760745Z Written: gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:38.7761241Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:38.7761720Z Written: gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:38.7762233Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:38.7762748Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:38.7763262Z Written: gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:38.7763722Z Written: gen_embedding_optimizer_adam_split_device_kernel.cuh 2025-05-07T19:50:38.7764128Z Written: gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:38.7764493Z Written: gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:38.7764902Z Written: gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:38.7765286Z Written: lookup_adam.py 2025-05-07T19:50:38.7765564Z Written: gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:38.7765980Z Written: gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:38.7766409Z Written: gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:38.7766869Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:38.7767324Z Written: gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:38.7767771Z Written: gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:38.7768238Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:38.7768763Z Written: gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:38.7769229Z Written: gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:38.7769716Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:38.7770233Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:38.7770702Z Written: gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:38.7771211Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:38.7771738Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:38.7772194Z Written: gen_embedding_optimizer_lamb_split_device_kernel.cuh 2025-05-07T19:50:38.7772596Z Written: gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:38.7772945Z Written: gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:38.7773363Z Written: gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:38.7773735Z Written: lookup_lamb.py 2025-05-07T19:50:38.7774029Z Written: gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:38.7774444Z Written: gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:38.7774882Z Written: gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:38.7775368Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:38.7775849Z Written: gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:38.7776736Z Written: gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:38.7777251Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:38.7777793Z Written: gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:38.7778324Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:38.7778880Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:38.7779469Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:38.7780008Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:38.7780655Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:38.7781380Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:38.7781919Z Written: gen_embedding_optimizer_lars_sgd_split_device_kernel.cuh 2025-05-07T19:50:38.7782377Z Written: gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:38.7782777Z Written: gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:38.7783258Z Written: gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:38.7783676Z Written: lookup_lars_sgd.py 2025-05-07T19:50:38.7784021Z Written: gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:38.7784475Z Written: gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:38.7785026Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:38.7785651Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:38.7786263Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:38.7786865Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:38.7787473Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:38.7788108Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:38.7788714Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:38.7789381Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:38.7790058Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:38.7790832Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:38.7791513Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:38.7792184Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:38.8934750Z Written: gen_embedding_optimizer_partial_rowwise_adam_split_device_kernel.cuh 2025-05-07T19:50:38.8936386Z Written: gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:38.8937930Z Written: gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:38.8939558Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:38.8941193Z Written: lookup_partial_rowwise_adam.py 2025-05-07T19:50:38.8942111Z Written: gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:38.8942678Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:38.8943325Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:38.8943942Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:38.8944599Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:38.8945211Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:38.8945873Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:38.8946534Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:38.8947265Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:38.8947914Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:38.8948547Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:38.8949180Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:38.8949844Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:38.8950481Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:38.8951332Z Written: gen_embedding_optimizer_partial_rowwise_lamb_split_device_kernel.cuh 2025-05-07T19:50:38.8951851Z Written: gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:38.8952354Z Written: gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:38.8952893Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:38.8953387Z Written: lookup_partial_rowwise_lamb.py 2025-05-07T19:50:38.8953819Z Written: gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:38.8954350Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:38.8954938Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:38.8955476Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:38.8956020Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:38.8956530Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:38.8957085Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:38.8957669Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:38.8958217Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:38.8958779Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:38.8959306Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:38.8959849Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:38.8960512Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:38.8961075Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:38.8961620Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_meta.cpp 2025-05-07T19:50:38.8962132Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:38.8962688Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_meta.cpp 2025-05-07T19:50:38.8963254Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:38.8963841Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:38.8964410Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:38.8964939Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_meta.cpp 2025-05-07T19:50:38.8965469Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:38.8966010Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:38.8966588Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:38.8967131Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:38.8967677Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:38.8968247Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:38.8968836Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:38.8969426Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:38.8969998Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:38.8970571Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:38.8971114Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:38.8971688Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:38.8972271Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:38.8972896Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:38.8973447Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:38.8974013Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:38.8974631Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:38.8975244Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:38.8975823Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:38.8976807Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:38.8977409Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:38.8978051Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:38.8978681Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:38.8979334Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:38.8979990Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:38.8980691Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:38.8981343Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:38.8981984Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:38.8982651Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:38.8983394Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:38.8983977Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:38.8984583Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:38.8985180Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:38.8985755Z Written: gen_embedding_optimizer_rowwise_adagrad_ssd_device_kernel.cuh 2025-05-07T19:50:38.8986294Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:38.8986798Z Written: gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:38.8987246Z Written: gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:38.8987750Z Written: gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:38.8988209Z Written: lookup_rowwise_adagrad_ssd.py 2025-05-07T19:50:38.8988580Z Written: gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:38.8989050Z Written: gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:38.8989560Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:38.8990021Z Written: lookup_rowwise_adagrad.py 2025-05-07T19:50:38.8990400Z Written: gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:38.8990877Z Written: gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:38.8991401Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:38.8991987Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:38.8992552Z Written: gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:38.8993182Z Written: gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:38.8993730Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:38.8994282Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:38.8994801Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:38.8995419Z Written: gen_embedding_optimizer_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:38.8996096Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:38.8996671Z Written: gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:38.8997286Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:38.8997943Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:38.8998585Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:38.8999272Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:50:38.8999960Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:38.9000563Z Written: gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:38.9001258Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:39.0300194Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:39.0301140Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:39.0301916Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:39.0302602Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:39.0303283Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:39.0304019Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:39.0304930Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:39.0305650Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:39.0306360Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:39.0307179Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:39.0308005Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:39.0308648Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:39.0309331Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:39.0310025Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:39.0310704Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:39.0311442Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:39.0312121Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:39.0312834Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:39.0313504Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:39.0314214Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:39.0314942Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:39.0315618Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:39.0316265Z Written: gen_embedding_optimizer_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:39.0316839Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:39.0317392Z Written: gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:39.0317997Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:39.0318626Z Written: lookup_rowwise_adagrad_with_counter.py 2025-05-07T19:50:39.0319093Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:39.0319673Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:39.0320342Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:50:39.0320952Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:39.0321546Z Written: gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:39.0322211Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:39.0322843Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:39.0323493Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:39.0324129Z Written: gen_embedding_optimizer_rowwise_weighted_adagrad_split_device_kernel.cuh 2025-05-07T19:50:39.0324708Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:39.0325232Z Written: gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:39.0325790Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:39.0326374Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:39.0326926Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:39.0327478Z Written: gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:39.0327996Z Written: gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:39.0328479Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:39.0328966Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:39.0329432Z Written: gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:39.0329917Z Written: gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:39.0330372Z Written: gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:39.0330854Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:39.0331337Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:39.0331822Z Written: gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:39.0332309Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:39.0332788Z Written: gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:39.0333310Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:39.0333829Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:39.0334346Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:39.0334839Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:39.0335350Z Written: gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:39.0335875Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:39.0336401Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:39.0336929Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:39.0337393Z Written: gen_embedding_optimizer_sgd_split_device_kernel.cuh 2025-05-07T19:50:39.0337818Z Written: gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:39.0338177Z Written: gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:39.0338617Z Written: gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:39.0339016Z Written: lookup_sgd.py 2025-05-07T19:50:39.0339329Z Written: gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:39.0339719Z Written: gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:39.0340198Z Written: gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:39.0340965Z Written: gen_embedding_optimizer_approx_sgd_split_device_kernel.cuh 2025-05-07T19:50:39.0341544Z Written: gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:39.0341971Z Written: gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:39.0342481Z Written: gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:39.0342971Z Written: gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:39.0343477Z Written: gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:39.0343974Z Written: gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:39.0344499Z Written: gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:39.0344996Z Written: gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:39.0345497Z Written: gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:39.0346022Z Written: gen_embedding_backward_none_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:39.0346533Z Written: gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:39.0347165Z Written: gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:39.0347667Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:39.0348207Z Written: gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:39.0348690Z Written: gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:39.0349223Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:39.0349845Z Written: gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:39.0350315Z Written: gen_embedding_optimizer_none_split_device_kernel.cuh 2025-05-07T19:50:39.0350738Z Written: gen_embedding_backward_split_none.cpp 2025-05-07T19:50:39.0351099Z Written: gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:39.0351540Z Written: gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:39.0351919Z Written: lookup_none.py 2025-05-07T19:50:39.0352236Z Written: gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:39.0352676Z Written: gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:39.0353147Z Written: gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:39.0353687Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:39.0354212Z Written: gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:39.0354714Z Written: gen_embedding_backward_ssd_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:39.0355192Z Written: gen_embedding_backward_split_weighted_vbe_device_kernel.cuh 2025-05-07T19:50:39.0355674Z Written: gen_embedding_backward_ssd_weighted_device_kernel.cuh 2025-05-07T19:50:39.0356145Z Written: gen_embedding_backward_split_weighted_device_kernel.cuh 2025-05-07T19:50:39.0356625Z Written: gen_embedding_backward_ssd_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:39.0357154Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel.cuh 2025-05-07T19:50:39.0357652Z Written: gen_embedding_backward_ssd_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:39.0358162Z Written: gen_embedding_backward_split_unweighted_vbe_device_kernel.cuh 2025-05-07T19:50:39.0358636Z Written: gen_embedding_backward_ssd_unweighted_device_kernel.cuh 2025-05-07T19:50:39.0359115Z Written: gen_embedding_backward_split_unweighted_device_kernel.cuh 2025-05-07T19:50:39.0359589Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:39.0360024Z Written: gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:39.0360503Z Written: gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:39.0360975Z Written: gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:39.0361468Z Written: gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:39.0361915Z Written: pt2_arg_utils.h 2025-05-07T19:50:39.0362180Z Written: __init__.py 2025-05-07T19:50:39.0362416Z Written: lookup_args_ssd.py 2025-05-07T19:50:39.0362682Z Written: lookup_args.py 2025-05-07T19:50:39.0397810Z 2025-05-07T19:50:39.0397908Z 2025-05-07T19:50:39.0398451Z ================================================================================ 2025-05-07T19:50:39.0399533Z Running code generation script ... 2025-05-07T19:50:39.0400690Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py --opensource 2025-05-07T19:50:39.0401512Z ================================================================================ 2025-05-07T19:50:39.0401763Z 2025-05-07T19:50:39.1488191Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:39.1489087Z [GENERATE OPTIMIZERS]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py', '--opensource'] 2025-05-07T19:50:39.1489840Z Written: gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:39.1490342Z Written: gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:39.1490827Z Written: gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:39.1491322Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:50:39.1491828Z Written: split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T19:50:39.1492190Z Written: optimizer_args.py 2025-05-07T19:50:39.1563409Z 2025-05-07T19:50:39.1563513Z 2025-05-07T19:50:39.1564069Z ================================================================================ 2025-05-07T19:50:39.1565577Z Running code generation script ... 2025-05-07T19:50:39.1567851Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py --opensource 2025-05-07T19:50:39.1570181Z ================================================================================ 2025-05-07T19:50:39.1570561Z 2025-05-07T19:50:39.2834125Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:39.2836749Z [GENERATE FORWARD QUANTIZED]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py', '--opensource'] 2025-05-07T19:50:39.2839253Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:39.2840468Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:39.2841103Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:39.2841752Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:39.2842370Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:39.2843003Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:39.2843661Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:39.2844352Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:39.2845017Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:39.2845699Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:39.2846385Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:39.2847057Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:39.2847729Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:39.2848362Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:39.2849264Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:39.2849912Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:39.2850545Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:39.2851194Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:39.2851800Z Written: gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:39.2852415Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:39.2853042Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:39.2853569Z Written: gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:39.2854049Z Written: gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:39.2907630Z 2025-05-07T19:50:39.2907735Z 2025-05-07T19:50:39.2908255Z ================================================================================ 2025-05-07T19:50:39.2909357Z Running code generation script ... 2025-05-07T19:50:39.2911585Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py --opensource 2025-05-07T19:50:39.2912965Z ================================================================================ 2025-05-07T19:50:39.2913205Z 2025-05-07T19:50:39.7043936Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:39.7046860Z [GENERATE FORWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py', '--opensource'] 2025-05-07T19:50:39.7048983Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:39.7050404Z Written: gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:39.7051290Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:39.7051766Z Written: gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:39.7052230Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:39.7052676Z Written: gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:39.7053135Z Written: gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:39.7053565Z Written: gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:39.7054032Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:39.7054495Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:39.7054967Z Written: gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:39.7055420Z Written: gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:39.7055885Z Written: gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:39.7056381Z Written: gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:39.7072634Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:39.7073412Z Written: gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:39.7073910Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:39.7074369Z Written: gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:39.7074850Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:39.7075312Z Written: gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:39.7075778Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:39.7076682Z Written: gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:39.7077186Z Written: gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:39.7077672Z Written: gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:39.7078375Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:39.7078908Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:39.7079410Z Written: gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:39.7079911Z Written: gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:39.7080382Z Written: gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:39.7080842Z Written: gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:39.7081309Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:39.7081786Z Written: gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:39.7082265Z Written: gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:39.7082699Z Written: gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:39.7083163Z Written: gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:39.7083595Z Written: gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:39.7084029Z Written: gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:39.7084490Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:39.7084961Z Written: gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:39.7085443Z Written: gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:39.7085896Z Written: gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:39.7086350Z Written: gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:39.7086773Z Written: gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:39.7087233Z Written: gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:39.7087799Z Written: gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:39.7088269Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:39.7088844Z Written: gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:39.7089265Z Written: gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:39.7089675Z Written: gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:39.7090119Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:39.7090619Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:39.7091094Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:39.7091575Z Written: gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:39.7092031Z Written: gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:39.7092430Z Written: gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:39.7092838Z Written: gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:39.7134443Z 2025-05-07T19:50:39.7134567Z 2025-05-07T19:50:39.7135127Z ================================================================================ 2025-05-07T19:50:39.7136225Z Running code generation script ... 2025-05-07T19:50:39.7138431Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py --opensource 2025-05-07T19:50:39.7140955Z ================================================================================ 2025-05-07T19:50:39.7141655Z 2025-05-07T19:50:40.0087017Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:50:40.0089443Z [INDEX SELECT GENERATOR]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py', '--opensource'] 2025-05-07T19:50:40.0091483Z Written: gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:40.0092698Z Written: gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:40.0093089Z Written: gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:40.0093527Z Written: gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:40.0094180Z Written: gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:40.0094611Z Written: gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:40.0095079Z Written: gen_embedding_backward_split_batch_index_select_device_kernel.cuh 2025-05-07T19:50:40.0095565Z Written: gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:40.0095989Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:50:40.0168825Z -- Adding merge_pooled_embeddings sources 2025-05-07T19:50:40.0180020Z 2025-05-07T19:50:40.0180813Z 2025-05-07T19:50:40.0181223Z ================================================================================ 2025-05-07T19:50:40.0181673Z GPU CPP Library Target: fbgemm_gpu_tbe_cache (SHARED) 2025-05-07T19:50:40.0182054Z 2025-05-07T19:50:40.0182247Z CPU_SRCS: 2025-05-07T19:50:40.0182686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:40.0183363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:40.0184017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:40.0184640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:40.0185242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:40.0185733Z 2025-05-07T19:50:40.0185935Z GPU_SRCS: 2025-05-07T19:50:40.0186275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:40.0186874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:40.0187495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:40.0188349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:40.0188962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:40.0189571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:40.0190222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:40.0190927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:40.0191512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:40.0192240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:40.0192695Z 2025-05-07T19:50:40.0192882Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:40.0193035Z 2025-05-07T19:50:40.0193113Z 2025-05-07T19:50:40.0193319Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:40.0193452Z 2025-05-07T19:50:40.0193529Z 2025-05-07T19:50:40.0193731Z OTHER_SRCS: 2025-05-07T19:50:40.0193840Z 2025-05-07T19:50:40.0193911Z 2025-05-07T19:50:40.0194097Z CC_FLAGS: 2025-05-07T19:50:40.0194206Z 2025-05-07T19:50:40.0194279Z 2025-05-07T19:50:40.0194458Z NVCC_FLAGS: 2025-05-07T19:50:40.0194657Z --expt-relaxed-constexpr 2025-05-07T19:50:40.0194911Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:40.0195171Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:40.0195456Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:40.0195701Z 2025-05-07T19:50:40.0195869Z HIPCC_FLAGS: 2025-05-07T19:50:40.0195987Z 2025-05-07T19:50:40.0196071Z 2025-05-07T19:50:40.0196241Z INCLUDE_DIRS: 2025-05-07T19:50:40.0196465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.0196751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:40.0197032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:40.0197313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.0197783Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:40.0198520Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:40.0199116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:40.0199511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:40.0200051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:40.0200493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:40.0200984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:40.0201409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:40.0201926Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:40.0202379Z 2025-05-07T19:50:40.0202575Z Selected Source Files: 2025-05-07T19:50:40.0202965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:40.0203582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:40.0204174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:40.0204744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:40.0205318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:40.0205894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:50:40.0206452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:50:40.0207022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:50:40.0207628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:50:40.0208201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:50:40.0208806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:50:40.0209390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:50:40.0209929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:50:40.0210479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:50:40.0211073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:50:40.0211523Z 2025-05-07T19:50:40.0211723Z HIPified Source Files: 2025-05-07T19:50:40.0211870Z 2025-05-07T19:50:40.0211945Z 2025-05-07T19:50:40.0212147Z Library Dependencies: 2025-05-07T19:50:40.0212362Z torch 2025-05-07T19:50:40.0212559Z torch_library 2025-05-07T19:50:40.0212958Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:40.0213590Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:40.0214233Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:40.0214970Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:40.0215655Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:40.0216207Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:40.0216589Z 2025-05-07T19:50:40.0216764Z Output Library: 2025-05-07T19:50:40.0216988Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:40.0217196Z 2025-05-07T19:50:40.0217391Z Destination Directory: 2025-05-07T19:50:40.0217606Z fbgemm_gpu 2025-05-07T19:50:40.0217840Z ================================================================================ 2025-05-07T19:50:40.0218056Z 2025-05-07T19:50:40.0679790Z 2025-05-07T19:50:40.0679898Z 2025-05-07T19:50:40.0680443Z ================================================================================ 2025-05-07T19:50:40.0681215Z GPU CPP Library Target: fbgemm_gpu_tbe_inference (SHARED) 2025-05-07T19:50:40.0681608Z 2025-05-07T19:50:40.0681841Z CPU_SRCS: 2025-05-07T19:50:40.0682160Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:40.0682644Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:40.0683330Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:40.0683702Z 2025-05-07T19:50:40.0683897Z GPU_SRCS: 2025-05-07T19:50:40.0684185Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:40.0684641Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:40.0685201Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:40.0685806Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:40.0686418Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:40.0687019Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:40.0687752Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:40.0688349Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:40.0689063Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:40.0689711Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:40.0690518Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:40.0691146Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:40.0691789Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:40.0692424Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:40.0693060Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:40.0693802Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:40.0694404Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:40.0695020Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:40.0695619Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:40.0696233Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:40.0696819Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:40.0697381Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:40.0698061Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:40.0698445Z 2025-05-07T19:50:40.0698639Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:40.0698770Z 2025-05-07T19:50:40.0698844Z 2025-05-07T19:50:40.0699043Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:40.0699172Z 2025-05-07T19:50:40.0699244Z 2025-05-07T19:50:40.0699423Z OTHER_SRCS: 2025-05-07T19:50:40.0699534Z 2025-05-07T19:50:40.0699625Z 2025-05-07T19:50:40.0699785Z CC_FLAGS: 2025-05-07T19:50:40.0699891Z 2025-05-07T19:50:40.0699976Z 2025-05-07T19:50:40.0700138Z NVCC_FLAGS: 2025-05-07T19:50:40.0700457Z --expt-relaxed-constexpr 2025-05-07T19:50:40.0700714Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:40.0701186Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:40.0701538Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:40.0701798Z 2025-05-07T19:50:40.0701985Z HIPCC_FLAGS: 2025-05-07T19:50:40.0702122Z 2025-05-07T19:50:40.0702197Z 2025-05-07T19:50:40.0702373Z INCLUDE_DIRS: 2025-05-07T19:50:40.0702613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.0702931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:40.0703204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:40.0703515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.0704001Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:40.0704786Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:40.0705506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:40.0705927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:40.0706360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:40.0706822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:40.0707346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:40.0707797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:40.0708351Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:40.0708844Z 2025-05-07T19:50:40.0709048Z Selected Source Files: 2025-05-07T19:50:40.0709376Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:50:40.0709834Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:40.0710277Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:40.0710699Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:40.0711163Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:50:40.0711699Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:50:40.0712302Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:50:40.0712889Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:50:40.0713571Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:50:40.0714130Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:50:40.0714743Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:50:40.0715330Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:50:40.0715934Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:50:40.0716551Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:50:40.0717162Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:50:40.0717769Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:50:40.0718377Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:50:40.0718967Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:50:40.0719536Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:50:40.0720100Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:50:40.0720674Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:50:40.0721241Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:50:40.0721805Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:50:40.0722346Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:50:40.0722873Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:50:40.0723419Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:50:40.0723794Z 2025-05-07T19:50:40.0723977Z HIPified Source Files: 2025-05-07T19:50:40.0724112Z 2025-05-07T19:50:40.0724192Z 2025-05-07T19:50:40.0724371Z Library Dependencies: 2025-05-07T19:50:40.0724585Z torch 2025-05-07T19:50:40.0724757Z torch_library 2025-05-07T19:50:40.0725160Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:40.0725773Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:40.0726422Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:40.0727196Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:40.0727868Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:40.0728299Z asmjit 2025-05-07T19:50:40.0728469Z fbgemm 2025-05-07T19:50:40.0728658Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:40.0728865Z fbgemm_gpu_config 2025-05-07T19:50:40.0729191Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:40.0729556Z 2025-05-07T19:50:40.0729733Z Output Library: 2025-05-07T19:50:40.0729941Z fbgemm_gpu_tbe_inference 2025-05-07T19:50:40.0730175Z 2025-05-07T19:50:40.0730350Z Destination Directory: 2025-05-07T19:50:40.0730571Z fbgemm_gpu 2025-05-07T19:50:40.0730796Z ================================================================================ 2025-05-07T19:50:40.0731007Z 2025-05-07T19:50:40.3036197Z 2025-05-07T19:50:40.3036216Z 2025-05-07T19:50:40.3036880Z ================================================================================ 2025-05-07T19:50:40.3038049Z GPU CPP Library Target: fbgemm_gpu_config (SHARED) 2025-05-07T19:50:40.3038976Z 2025-05-07T19:50:40.3039470Z CPU_SRCS: 2025-05-07T19:50:40.3040077Z src/config/feature_gates.cpp 2025-05-07T19:50:40.3040768Z 2025-05-07T19:50:40.3041266Z GPU_SRCS: 2025-05-07T19:50:40.3041565Z 2025-05-07T19:50:40.3041635Z 2025-05-07T19:50:40.3041819Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:40.3041951Z 2025-05-07T19:50:40.3042021Z 2025-05-07T19:50:40.3042206Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:40.3042337Z 2025-05-07T19:50:40.3042422Z 2025-05-07T19:50:40.3042588Z OTHER_SRCS: 2025-05-07T19:50:40.3042872Z 2025-05-07T19:50:40.3042960Z 2025-05-07T19:50:40.3043257Z CC_FLAGS: 2025-05-07T19:50:40.3043764Z 2025-05-07T19:50:40.3043865Z 2025-05-07T19:50:40.3044040Z NVCC_FLAGS: 2025-05-07T19:50:40.3044167Z 2025-05-07T19:50:40.3044241Z 2025-05-07T19:50:40.3044420Z HIPCC_FLAGS: 2025-05-07T19:50:40.3044552Z 2025-05-07T19:50:40.3044636Z 2025-05-07T19:50:40.3044811Z INCLUDE_DIRS: 2025-05-07T19:50:40.3045033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3045347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:40.3045621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:40.3045931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3046415Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:40.3047197Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:40.3047829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:40.3048249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:40.3048673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:40.3049140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:40.3049697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:40.3050153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:40.3050706Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:40.3051201Z 2025-05-07T19:50:40.3051394Z Selected Source Files: 2025-05-07T19:50:40.3051660Z src/config/feature_gates.cpp 2025-05-07T19:50:40.3051905Z 2025-05-07T19:50:40.3052102Z HIPified Source Files: 2025-05-07T19:50:40.3052249Z 2025-05-07T19:50:40.3052321Z 2025-05-07T19:50:40.3052513Z Library Dependencies: 2025-05-07T19:50:40.3052731Z torch 2025-05-07T19:50:40.3052923Z torch_library 2025-05-07T19:50:40.3053451Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:40.3054220Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:40.3054857Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:40.3055776Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:40.3056798Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:40.3057388Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:40.3057789Z 2025-05-07T19:50:40.3057967Z Output Library: 2025-05-07T19:50:40.3058197Z fbgemm_gpu_config 2025-05-07T19:50:40.3058406Z 2025-05-07T19:50:40.3058598Z Destination Directory: 2025-05-07T19:50:40.3058836Z fbgemm_gpu 2025-05-07T19:50:40.3059064Z ================================================================================ 2025-05-07T19:50:40.3059297Z 2025-05-07T19:50:40.3059312Z 2025-05-07T19:50:40.3059316Z 2025-05-07T19:50:40.3059435Z ================================================================================ 2025-05-07T19:50:40.3059798Z GPU CPP Library Target: fbgemm_gpu_tbe_utils (SHARED) 2025-05-07T19:50:40.3060134Z 2025-05-07T19:50:40.3060433Z CPU_SRCS: 2025-05-07T19:50:40.3060728Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:40.3061198Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:40.3061545Z 2025-05-07T19:50:40.3061735Z GPU_SRCS: 2025-05-07T19:50:40.3061989Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:40.3062401Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:40.3062772Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:40.3063145Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:40.3063524Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:40.3063859Z 2025-05-07T19:50:40.3064046Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:40.3064177Z 2025-05-07T19:50:40.3064251Z 2025-05-07T19:50:40.3064513Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:40.3064648Z 2025-05-07T19:50:40.3064723Z 2025-05-07T19:50:40.3064905Z OTHER_SRCS: 2025-05-07T19:50:40.3065019Z 2025-05-07T19:50:40.3065096Z 2025-05-07T19:50:40.3065284Z CC_FLAGS: 2025-05-07T19:50:40.3065402Z 2025-05-07T19:50:40.3065478Z 2025-05-07T19:50:40.3065675Z NVCC_FLAGS: 2025-05-07T19:50:40.3065791Z 2025-05-07T19:50:40.3065882Z 2025-05-07T19:50:40.3066055Z HIPCC_FLAGS: 2025-05-07T19:50:40.3066175Z 2025-05-07T19:50:40.3066266Z 2025-05-07T19:50:40.3066444Z INCLUDE_DIRS: 2025-05-07T19:50:40.3066682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3067167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:40.3067452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:40.3067925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3068412Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:40.3069189Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:40.3069832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:40.3070241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:40.3070659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:40.3071135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:40.3071638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:40.3072099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:40.3072646Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:40.3073133Z 2025-05-07T19:50:40.3073332Z Selected Source Files: 2025-05-07T19:50:40.3073765Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:40.3074199Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:40.3074608Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:40.3075002Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:50:40.3075372Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:50:40.3075737Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:50:40.3076674Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:50:40.3077015Z 2025-05-07T19:50:40.3077224Z HIPified Source Files: 2025-05-07T19:50:40.3077376Z 2025-05-07T19:50:40.3077450Z 2025-05-07T19:50:40.3077657Z Library Dependencies: 2025-05-07T19:50:40.3077880Z torch 2025-05-07T19:50:40.3078084Z torch_library 2025-05-07T19:50:40.3078500Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:40.3079169Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:40.3079860Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:40.3080655Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:40.3081382Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:40.3081969Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:40.3082481Z 2025-05-07T19:50:40.3082655Z Output Library: 2025-05-07T19:50:40.3082875Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:40.3083076Z 2025-05-07T19:50:40.3083273Z Destination Directory: 2025-05-07T19:50:40.3083509Z fbgemm_gpu 2025-05-07T19:50:40.3083728Z ================================================================================ 2025-05-07T19:50:40.3083954Z 2025-05-07T19:50:40.3084054Z 2025-05-07T19:50:40.3084058Z 2025-05-07T19:50:40.3084167Z ================================================================================ 2025-05-07T19:50:40.3084571Z GPU CPP Library Target: fbgemm_gpu_sparse_async_cumsum (SHARED) 2025-05-07T19:50:40.3084923Z 2025-05-07T19:50:40.3085106Z CPU_SRCS: 2025-05-07T19:50:40.3085317Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:40.3085707Z 2025-05-07T19:50:40.3085880Z GPU_SRCS: 2025-05-07T19:50:40.3086084Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:40.3086350Z 2025-05-07T19:50:40.3086526Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:40.3086660Z 2025-05-07T19:50:40.3086731Z 2025-05-07T19:50:40.3086910Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:40.3087039Z 2025-05-07T19:50:40.3087108Z 2025-05-07T19:50:40.3087275Z OTHER_SRCS: 2025-05-07T19:50:40.3087382Z 2025-05-07T19:50:40.3087450Z 2025-05-07T19:50:40.3087616Z CC_FLAGS: 2025-05-07T19:50:40.3087721Z 2025-05-07T19:50:40.3087793Z 2025-05-07T19:50:40.3087958Z NVCC_FLAGS: 2025-05-07T19:50:40.3088064Z 2025-05-07T19:50:40.3088135Z 2025-05-07T19:50:40.3088307Z HIPCC_FLAGS: 2025-05-07T19:50:40.3088418Z 2025-05-07T19:50:40.3088495Z 2025-05-07T19:50:40.3088660Z INCLUDE_DIRS: 2025-05-07T19:50:40.3088875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3089160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:40.3089429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:40.3089713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3090171Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:40.3090906Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:40.3091518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:40.3091906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:40.3092299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:40.3092749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:40.3093234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:40.3093667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:40.3094182Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:40.3094649Z 2025-05-07T19:50:40.3094835Z Selected Source Files: 2025-05-07T19:50:40.3095073Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:50:40.3095370Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:50:40.3095625Z 2025-05-07T19:50:40.3095887Z HIPified Source Files: 2025-05-07T19:50:40.3096029Z 2025-05-07T19:50:40.3096098Z 2025-05-07T19:50:40.3096278Z Library Dependencies: 2025-05-07T19:50:40.3096485Z torch 2025-05-07T19:50:40.3096666Z torch_library 2025-05-07T19:50:40.3097061Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:40.3097704Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:40.3098362Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:40.3099105Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:40.3099815Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:40.3100262Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:40.3100700Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:40.3101268Z 2025-05-07T19:50:40.3101458Z Output Library: 2025-05-07T19:50:40.3101692Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:40.3101948Z 2025-05-07T19:50:40.3102142Z Destination Directory: 2025-05-07T19:50:40.3102364Z fbgemm_gpu 2025-05-07T19:50:40.3102589Z ================================================================================ 2025-05-07T19:50:40.3102814Z 2025-05-07T19:50:40.3102818Z 2025-05-07T19:50:40.3102822Z 2025-05-07T19:50:40.3102932Z ================================================================================ 2025-05-07T19:50:40.3103299Z GPU CPP Library Target: fbgemm_gpu_tbe_common (SHARED) 2025-05-07T19:50:40.3103619Z 2025-05-07T19:50:40.3103797Z CPU_SRCS: 2025-05-07T19:50:40.3104044Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:40.3104509Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:40.3104896Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:40.3105180Z 2025-05-07T19:50:40.3105352Z GPU_SRCS: 2025-05-07T19:50:40.3105568Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:40.3105896Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:40.3106223Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:40.3106520Z 2025-05-07T19:50:40.3106695Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:40.3106831Z 2025-05-07T19:50:40.3107027Z 2025-05-07T19:50:40.3107210Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:40.3107337Z 2025-05-07T19:50:40.3107408Z 2025-05-07T19:50:40.3107578Z OTHER_SRCS: 2025-05-07T19:50:40.3107684Z 2025-05-07T19:50:40.3107752Z 2025-05-07T19:50:40.3107917Z CC_FLAGS: 2025-05-07T19:50:40.3108017Z 2025-05-07T19:50:40.3108087Z 2025-05-07T19:50:40.3108256Z NVCC_FLAGS: 2025-05-07T19:50:40.3108457Z --expt-relaxed-constexpr 2025-05-07T19:50:40.3108707Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:40.3108966Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:40.3109248Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:40.3109490Z 2025-05-07T19:50:40.3109657Z HIPCC_FLAGS: 2025-05-07T19:50:40.3109777Z 2025-05-07T19:50:40.3109856Z 2025-05-07T19:50:40.3110030Z INCLUDE_DIRS: 2025-05-07T19:50:40.3110253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3110539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:40.3110806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:40.3111089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3111554Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:40.3112308Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:40.3112920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:40.3113319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:40.3113755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:40.3114252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:40.3114770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:40.3115328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:40.3115901Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:40.3116406Z 2025-05-07T19:50:40.3116651Z Selected Source Files: 2025-05-07T19:50:40.3116959Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:50:40.3117520Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:50:40.3117899Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:50:40.3118266Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:50:40.3118603Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:50:40.3118938Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:50:40.3119219Z 2025-05-07T19:50:40.3119443Z HIPified Source Files: 2025-05-07T19:50:40.3119590Z 2025-05-07T19:50:40.3119694Z 2025-05-07T19:50:40.3119893Z Library Dependencies: 2025-05-07T19:50:40.3120164Z torch 2025-05-07T19:50:40.3120365Z torch_library 2025-05-07T19:50:40.3120818Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:40.3121467Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:40.3122161Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:40.3122907Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:40.3123635Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:40.3124278Z fbgemm 2025-05-07T19:50:40.3124538Z fbgemm_gpu_config 2025-05-07T19:50:40.3124908Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:40.3125303Z 2025-05-07T19:50:40.3125592Z Output Library: 2025-05-07T19:50:40.3125810Z fbgemm_gpu_tbe_common 2025-05-07T19:50:40.3126044Z 2025-05-07T19:50:40.3126235Z Destination Directory: 2025-05-07T19:50:40.3126485Z fbgemm_gpu 2025-05-07T19:50:40.3126710Z ================================================================================ 2025-05-07T19:50:40.3126959Z 2025-05-07T19:50:40.3126963Z 2025-05-07T19:50:40.3126967Z 2025-05-07T19:50:40.3127083Z ================================================================================ 2025-05-07T19:50:40.3127675Z GPU CPP Library Target: fbgemm_gpu_tbe_optimizers (SHARED) 2025-05-07T19:50:40.3128030Z 2025-05-07T19:50:40.3128250Z CPU_SRCS: 2025-05-07T19:50:40.3128373Z 2025-05-07T19:50:40.3128459Z 2025-05-07T19:50:40.3128683Z GPU_SRCS: 2025-05-07T19:50:40.3128959Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:40.3129395Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:40.3129842Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:40.3130201Z 2025-05-07T19:50:40.3130457Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:40.3130611Z 2025-05-07T19:50:40.3130700Z 2025-05-07T19:50:40.3130935Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:40.3131081Z 2025-05-07T19:50:40.3131184Z 2025-05-07T19:50:40.3131414Z OTHER_SRCS: 2025-05-07T19:50:40.3131542Z 2025-05-07T19:50:40.3131630Z 2025-05-07T19:50:40.3131852Z CC_FLAGS: 2025-05-07T19:50:40.3131975Z 2025-05-07T19:50:40.3132087Z 2025-05-07T19:50:40.3132283Z NVCC_FLAGS: 2025-05-07T19:50:40.3132541Z --expt-relaxed-constexpr 2025-05-07T19:50:40.3132828Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:40.3133146Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:40.3133461Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:40.3133751Z 2025-05-07T19:50:40.3133953Z HIPCC_FLAGS: 2025-05-07T19:50:40.3134113Z 2025-05-07T19:50:40.3134203Z 2025-05-07T19:50:40.3134405Z INCLUDE_DIRS: 2025-05-07T19:50:40.3134678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3135025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:40.3135323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:40.3135673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3136171Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:40.3137111Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:40.3137769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:40.3138218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:40.3138658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:40.3139186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:40.3139741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:40.3140218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:40.3140917Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:40.3141467Z 2025-05-07T19:50:40.3141709Z Selected Source Files: 2025-05-07T19:50:40.3142014Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:50:40.3142446Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:50:40.3142905Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:50:40.3143256Z 2025-05-07T19:50:40.3143493Z HIPified Source Files: 2025-05-07T19:50:40.3143652Z 2025-05-07T19:50:40.3143740Z 2025-05-07T19:50:40.3143975Z Library Dependencies: 2025-05-07T19:50:40.3144215Z torch 2025-05-07T19:50:40.3144443Z torch_library 2025-05-07T19:50:40.3144881Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:40.3145587Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:40.3146285Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:40.3147173Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:40.3147954Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:40.3148567Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:40.3149015Z 2025-05-07T19:50:40.3149228Z Output Library: 2025-05-07T19:50:40.3149500Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:40.3149759Z 2025-05-07T19:50:40.3150011Z Destination Directory: 2025-05-07T19:50:40.3150266Z fbgemm_gpu 2025-05-07T19:50:40.3150542Z ================================================================================ 2025-05-07T19:50:40.3150788Z 2025-05-07T19:50:40.3150913Z 2025-05-07T19:50:40.3150917Z 2025-05-07T19:50:40.3151080Z ================================================================================ 2025-05-07T19:50:40.3151516Z GPU CPP Library Target: fbgemm_gpu_tbe_training_forward (SHARED) 2025-05-07T19:50:40.3151938Z 2025-05-07T19:50:40.3152151Z CPU_SRCS: 2025-05-07T19:50:40.3152430Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3152767Z 2025-05-07T19:50:40.3152965Z GPU_SRCS: 2025-05-07T19:50:40.3153325Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:40.3153661Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:40.3154000Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:40.3154343Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:40.3154737Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:40.3155111Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:40.3155481Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:40.3155819Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:40.3156175Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:40.3156536Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:40.3156903Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:40.3157288Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:40.3157662Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:40.3158050Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:40.3158495Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:40.3158894Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:40.3159284Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:40.3159654Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:40.3160026Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:40.3160391Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:40.3160776Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:40.3161155Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:40.3161565Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:40.3161938Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:40.3162300Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:40.3162680Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:40.3163076Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:40.3163491Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:40.3163847Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:40.3164221Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:40.3164570Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:40.3164949Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:40.3165328Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:40.3165678Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:40.3166068Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:40.3166524Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:40.3166926Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:40.3167277Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:40.3167656Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:40.3168069Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:40.3168475Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:40.3168857Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:40.3169231Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:40.3169609Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:40.3169988Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:40.3170375Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:40.3170742Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:40.3171131Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:40.3171751Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:40.3172179Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:40.3172573Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3173115Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3173423Z 2025-05-07T19:50:40.3173612Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:40.3173764Z 2025-05-07T19:50:40.3173836Z 2025-05-07T19:50:40.3174010Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:40.3174149Z 2025-05-07T19:50:40.3174222Z 2025-05-07T19:50:40.3174393Z OTHER_SRCS: 2025-05-07T19:50:40.3174512Z 2025-05-07T19:50:40.3174582Z 2025-05-07T19:50:40.3174756Z CC_FLAGS: 2025-05-07T19:50:40.3174865Z 2025-05-07T19:50:40.3174936Z 2025-05-07T19:50:40.3175114Z NVCC_FLAGS: 2025-05-07T19:50:40.3175315Z --expt-relaxed-constexpr 2025-05-07T19:50:40.3175580Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:40.3175847Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:40.3176329Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:40.3176564Z 2025-05-07T19:50:40.3176759Z HIPCC_FLAGS: 2025-05-07T19:50:40.3176899Z 2025-05-07T19:50:40.3176968Z 2025-05-07T19:50:40.3177151Z INCLUDE_DIRS: 2025-05-07T19:50:40.3177504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3177802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:40.3178076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:40.3178367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3178848Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:40.3179778Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:40.3180492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:40.3180900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:40.3181310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:40.3181775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:40.3182276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:40.3182735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:40.3183274Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:40.3183762Z 2025-05-07T19:50:40.3183949Z Selected Source Files: 2025-05-07T19:50:40.3184229Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3184619Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:50:40.3185014Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:50:40.3185419Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:50:40.3185829Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:50:40.3186231Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:50:40.3186612Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:50:40.3187116Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:40.3187540Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:50:40.3187960Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:40.3188421Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:50:40.3188816Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3189184Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3189525Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:50:40.3189880Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:50:40.3190234Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:50:40.3190620Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:50:40.3191051Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:50:40.3191456Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:50:40.3191847Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:50:40.3192213Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:50:40.3192581Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:50:40.3192955Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:50:40.3193372Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:50:40.3193777Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:50:40.3194181Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:50:40.3194584Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:50:40.3194963Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:50:40.3195383Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:40.3195782Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:50:40.3196172Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:50:40.3196562Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:50:40.3197025Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:40.3197457Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:50:40.3197850Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:40.3198318Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:50:40.3198697Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:50:40.3199106Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:50:40.3199473Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:50:40.3199877Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:40.3200279Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:50:40.3200677Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:50:40.3201088Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:50:40.3201531Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:50:40.3201996Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:50:40.3202402Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:40.3202824Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:50:40.3203233Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:50:40.3203665Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:50:40.3204075Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:50:40.3204482Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:40.3204951Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:40.3205399Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:50:40.3205762Z 2025-05-07T19:50:40.3205957Z HIPified Source Files: 2025-05-07T19:50:40.3206120Z 2025-05-07T19:50:40.3206197Z 2025-05-07T19:50:40.3206399Z Library Dependencies: 2025-05-07T19:50:40.3206622Z torch 2025-05-07T19:50:40.3206810Z torch_library 2025-05-07T19:50:40.3208243Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:40.3208957Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:40.3209671Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:40.3210477Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:40.3211202Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:40.3211689Z fbgemm_gpu_tbe_common 2025-05-07T19:50:40.3212047Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:40.3212458Z 2025-05-07T19:50:40.3212663Z Output Library: 2025-05-07T19:50:40.3212894Z fbgemm_gpu_tbe_training_forward 2025-05-07T19:50:40.3213166Z 2025-05-07T19:50:40.3213355Z Destination Directory: 2025-05-07T19:50:40.3213606Z fbgemm_gpu 2025-05-07T19:50:40.3213835Z ================================================================================ 2025-05-07T19:50:40.3214084Z 2025-05-07T19:50:40.3214089Z 2025-05-07T19:50:40.3214092Z 2025-05-07T19:50:40.3214211Z ================================================================================ 2025-05-07T19:50:40.3214662Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_pt2 (SHARED) 2025-05-07T19:50:40.3215048Z 2025-05-07T19:50:40.3215243Z CPU_SRCS: 2025-05-07T19:50:40.3215470Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:40.3215855Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:40.3216208Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:40.3216544Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:40.3216871Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:40.3217207Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:40.3217591Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:40.3218149Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:40.3218525Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:40.3218905Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:40.3219335Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:40.3219790Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:40.3220274Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:40.3221088Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:40.3221698Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:40.3222206Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:40.3222619Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:40.3223040Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3223493Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3223949Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3224345Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3224744Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3225173Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3225645Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3226174Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3226633Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3227122Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3227635Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3228137Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3228809Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3229479Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3230134Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3230728Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3231152Z 2025-05-07T19:50:40.3231333Z GPU_SRCS: 2025-05-07T19:50:40.3231625Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3232104Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3232552Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3232961Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3233472Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3233885Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3234370Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3234897Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3235354Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3235851Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3236381Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3236869Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3237455Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3238096Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3238740Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3239333Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3239839Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3240207Z 2025-05-07T19:50:40.3240385Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:40.3240519Z 2025-05-07T19:50:40.3240602Z 2025-05-07T19:50:40.3240777Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:40.3240913Z 2025-05-07T19:50:40.3241073Z 2025-05-07T19:50:40.3241243Z OTHER_SRCS: 2025-05-07T19:50:40.3241367Z 2025-05-07T19:50:40.3241440Z 2025-05-07T19:50:40.3241606Z CC_FLAGS: 2025-05-07T19:50:40.3241729Z 2025-05-07T19:50:40.3241802Z 2025-05-07T19:50:40.3241990Z NVCC_FLAGS: 2025-05-07T19:50:40.3242188Z --expt-relaxed-constexpr 2025-05-07T19:50:40.3242458Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:40.3242724Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:40.3243009Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:40.3243248Z 2025-05-07T19:50:40.3243437Z HIPCC_FLAGS: 2025-05-07T19:50:40.3243556Z 2025-05-07T19:50:40.3243631Z 2025-05-07T19:50:40.3243811Z INCLUDE_DIRS: 2025-05-07T19:50:40.3244030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3244333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:40.3244613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:40.3244901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3245375Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:40.3246125Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:40.3246750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:40.3247141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:40.3247555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:40.3248012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:40.3248505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:40.3248958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:40.3249538Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:40.3250018Z 2025-05-07T19:50:40.3250200Z Selected Source Files: 2025-05-07T19:50:40.3250472Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:50:40.3250818Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:40.3251174Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:50:40.3251492Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:50:40.3251810Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:50:40.3252143Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:50:40.3252508Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:50:40.3252922Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:50:40.3253279Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:50:40.3253671Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:40.3254081Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:50:40.3254478Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:40.3254961Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:50:40.3255507Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:40.3256055Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:50:40.3256535Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:50:40.3256950Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:50:40.3257335Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3257783Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3258218Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3258603Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3258988Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3259387Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3259846Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3260429Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3261143Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3261633Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3262147Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3262648Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3263232Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3263897Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3264537Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3265140Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:50:40.3265642Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3266099Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3277998Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3278477Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3278879Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3279308Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3279788Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3280317Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3280781Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3281271Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3281794Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3282456Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3283056Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3283717Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3284377Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3284972Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3285486Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:50:40.3285865Z 2025-05-07T19:50:40.3286048Z HIPified Source Files: 2025-05-07T19:50:40.3286197Z 2025-05-07T19:50:40.3286278Z 2025-05-07T19:50:40.3286456Z Library Dependencies: 2025-05-07T19:50:40.3286681Z torch 2025-05-07T19:50:40.3286860Z torch_library 2025-05-07T19:50:40.3287284Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:40.3287939Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:40.3288622Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:40.3289400Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:40.3290241Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:40.3290677Z fbgemm 2025-05-07T19:50:40.3290852Z fbgemm_gpu_config 2025-05-07T19:50:40.3291063Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:40.3291282Z fbgemm_gpu_tbe_common 2025-05-07T19:50:40.3291503Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:40.3291724Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:40.3292096Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:40.3292475Z 2025-05-07T19:50:40.3292642Z Output Library: 2025-05-07T19:50:40.3292864Z fbgemm_gpu_tbe_training_backward_pt2 2025-05-07T19:50:40.3293117Z 2025-05-07T19:50:40.3293303Z Destination Directory: 2025-05-07T19:50:40.3293518Z fbgemm_gpu 2025-05-07T19:50:40.3293729Z ================================================================================ 2025-05-07T19:50:40.3294040Z 2025-05-07T19:50:40.3294254Z 2025-05-07T19:50:40.3294259Z 2025-05-07T19:50:40.3294372Z ================================================================================ 2025-05-07T19:50:40.3294763Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward (SHARED) 2025-05-07T19:50:40.3295119Z 2025-05-07T19:50:40.3295283Z CPU_SRCS: 2025-05-07T19:50:40.3295585Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:40.3295986Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:40.3296308Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:40.3296653Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:40.3297009Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:40.3297319Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:40.3297625Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:40.3297948Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:40.3298310Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:40.3298722Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:40.3299079Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:40.3299467Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:40.3299878Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:40.3300262Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:40.3301025Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:40.3301592Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:40.3302152Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:40.3302713Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:40.3303119Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:40.3303478Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:40.3303844Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:40.3304131Z 2025-05-07T19:50:40.3304307Z GPU_SRCS: 2025-05-07T19:50:40.3304554Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:40.3304954Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:40.3305414Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:40.3305838Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:40.3306275Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:40.3306728Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:40.3307219Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:40.3307725Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3308244Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3308796Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3309305Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:40.3309786Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3310283Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3310741Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:40.3311160Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3311604Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3312063Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3312537Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3313172Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3313614Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:40.3314044Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3314562Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3315002Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:40.3315475Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3315960Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3316458Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3316971Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3317526Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3318152Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:40.3318609Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3319094Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3319508Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:40.3319867Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3320238Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3320625Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3321044Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3321482Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3321882Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:40.3322243Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3322639Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3323080Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:40.3323449Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3323826Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3324216Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3324634Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3325074Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3325487Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:40.3325858Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3326262Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3326631Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:40.3326998Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3327384Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3327769Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3328187Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3328807Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3329234Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:40.3329627Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3330046Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3330453Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:40.3330870Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3331311Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3331751Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3332227Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3332725Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3333183Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:40.3333606Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3334129Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3334600Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:40.3335101Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3335638Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3336170Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3336735Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3337337Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3337883Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:40.3338398Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3338941Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3339461Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:40.3339969Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3340585Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3341311Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3341885Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3342491Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3343062Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:40.3343659Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3344216Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3344677Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:40.3345076Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3345481Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3345909Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3346360Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3346849Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3347285Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:40.3347683Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3348117Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3348614Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:40.3349198Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3349794Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3350406Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3351043Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3351704Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3352322Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:40.3352903Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3353629Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3354063Z 2025-05-07T19:50:40.3354236Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:40.3354369Z 2025-05-07T19:50:40.3354451Z 2025-05-07T19:50:40.3354619Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:40.3354939Z gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:50:40.3355390Z gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:50:40.3355885Z gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:50:40.3356221Z 2025-05-07T19:50:40.3356393Z OTHER_SRCS: 2025-05-07T19:50:40.3356498Z 2025-05-07T19:50:40.3356575Z 2025-05-07T19:50:40.3356735Z CC_FLAGS: 2025-05-07T19:50:40.3356841Z 2025-05-07T19:50:40.3356916Z 2025-05-07T19:50:40.3357075Z NVCC_FLAGS: 2025-05-07T19:50:40.3357278Z --expt-relaxed-constexpr 2025-05-07T19:50:40.3357524Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:40.3357789Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:40.3358054Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:40.3358287Z 2025-05-07T19:50:40.3358449Z HIPCC_FLAGS: 2025-05-07T19:50:40.3358571Z 2025-05-07T19:50:40.3358641Z 2025-05-07T19:50:40.3358802Z INCLUDE_DIRS: 2025-05-07T19:50:40.3359021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3359313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:40.3359566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:40.3359852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3360312Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:40.3361149Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:40.3361723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:40.3362091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:40.3362474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:40.3362894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:40.3363359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:40.3363818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:40.3364099Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:40.3364166Z 2025-05-07T19:50:40.3364248Z Selected Source Files: 2025-05-07T19:50:40.3364437Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:50:40.3364550Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:50:40.3364662Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:50:40.3364792Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:40.3364900Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:50:40.3364999Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:50:40.3365101Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:50:40.3365210Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:50:40.3365366Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:50:40.3365511Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:50:40.3365615Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:50:40.3365797Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:40.3365915Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:50:40.3366073Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:50:40.3366279Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:50:40.3366496Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:40.3366691Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:50:40.3366850Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:50:40.3366963Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:50:40.3367093Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:50:40.3367194Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:50:40.3367325Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:50:40.3367479Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:50:40.3367630Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:50:40.3367779Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:50:40.3367983Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:50:40.3368157Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:50:40.3368332Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:50:40.3368516Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3368716Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3368919Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3369085Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:50:40.3369267Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3369450Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3369590Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:40.3369743Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3369902Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3370064Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3370252Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3370436Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3370571Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:40.3370739Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3370901Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3371059Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:50:40.3371364Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3371551Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3371737Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3371956Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3372167Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3372334Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:50:40.3372522Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3372723Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3372844Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:50:40.3372982Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3373135Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3373278Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3373443Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3373622Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3373750Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:40.3373892Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3374041Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3374169Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:50:40.3374310Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3374452Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3374605Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3374775Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3374952Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3375087Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:50:40.3375232Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3375437Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3375556Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:50:40.3375701Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3375844Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3376147Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3376592Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3376778Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3376919Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:40.3377085Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3377252Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3377396Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:50:40.3377564Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3377750Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3377927Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3378128Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3378338Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3378490Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:50:40.3378664Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3378852Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3379040Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:50:40.3379524Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3379745Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3379971Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3380221Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3380557Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3380767Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:50:40.3380984Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3381207Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3381401Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:50:40.3381614Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3381833Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3382055Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3382296Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3382542Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3382740Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:50:40.3382963Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3383184Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3383319Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:50:40.3383479Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3383633Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3383794Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3383982Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3384169Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3384383Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:50:40.3384542Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3384709Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3384925Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:50:40.3385166Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3385414Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3385659Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3385937Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3386218Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3386443Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:50:40.3386694Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3386950Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3387023Z 2025-05-07T19:50:40.3387108Z HIPified Source Files: 2025-05-07T19:50:40.3387114Z 2025-05-07T19:50:40.3387182Z 2025-05-07T19:50:40.3387276Z Library Dependencies: 2025-05-07T19:50:40.3387347Z torch 2025-05-07T19:50:40.3387426Z torch_library 2025-05-07T19:50:40.3387728Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:40.3387974Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:40.3388345Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:40.3388690Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:40.3388952Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:40.3389028Z fbgemm 2025-05-07T19:50:40.3389112Z fbgemm_gpu_config 2025-05-07T19:50:40.3389201Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:40.3389285Z fbgemm_gpu_tbe_common 2025-05-07T19:50:40.3389365Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:40.3389471Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:40.3389678Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:40.3389746Z 2025-05-07T19:50:40.3389824Z Output Library: 2025-05-07T19:50:40.3389927Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:40.3389996Z 2025-05-07T19:50:40.3390081Z Destination Directory: 2025-05-07T19:50:40.3390163Z fbgemm_gpu 2025-05-07T19:50:40.3390277Z ================================================================================ 2025-05-07T19:50:40.3390282Z 2025-05-07T19:50:40.3390286Z 2025-05-07T19:50:40.3390290Z 2025-05-07T19:50:40.3390395Z ================================================================================ 2025-05-07T19:50:40.3390601Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_gwd (SHARED) 2025-05-07T19:50:40.3390671Z 2025-05-07T19:50:40.3390748Z CPU_SRCS: 2025-05-07T19:50:40.3390752Z 2025-05-07T19:50:40.3390820Z 2025-05-07T19:50:40.3390901Z GPU_SRCS: 2025-05-07T19:50:40.3391090Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:40.3391301Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:40.3391521Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:40.3391715Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:40.3391933Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:40.3392161Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:40.3392361Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:40.3392582Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:40.3392975Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:40.3393176Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:40.3393390Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:40.3393606Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:40.3393677Z 2025-05-07T19:50:40.3393754Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:40.3393758Z 2025-05-07T19:50:40.3393822Z 2025-05-07T19:50:40.3393903Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:40.3393907Z 2025-05-07T19:50:40.3393972Z 2025-05-07T19:50:40.3394045Z OTHER_SRCS: 2025-05-07T19:50:40.3394049Z 2025-05-07T19:50:40.3394116Z 2025-05-07T19:50:40.3394193Z CC_FLAGS: 2025-05-07T19:50:40.3394197Z 2025-05-07T19:50:40.3394261Z 2025-05-07T19:50:40.3394330Z NVCC_FLAGS: 2025-05-07T19:50:40.3394423Z --expt-relaxed-constexpr 2025-05-07T19:50:40.3394514Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:40.3394604Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:40.3394696Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:40.3394762Z 2025-05-07T19:50:40.3394835Z HIPCC_FLAGS: 2025-05-07T19:50:40.3394839Z 2025-05-07T19:50:40.3394903Z 2025-05-07T19:50:40.3394985Z INCLUDE_DIRS: 2025-05-07T19:50:40.3395081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3395165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:40.3395264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:40.3395354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3395607Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:40.3396008Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:40.3396143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:40.3396287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:40.3396430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:40.3396619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:40.3396796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:40.3396924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:40.3397201Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:40.3397265Z 2025-05-07T19:50:40.3397344Z Selected Source Files: 2025-05-07T19:50:40.3397521Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:50:40.3397725Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:50:40.3397925Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:50:40.3398103Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:50:40.3398308Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:50:40.3398512Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:50:40.3398696Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:50:40.3398909Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:40.3399118Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:40.3399313Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:50:40.3399526Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:50:40.3399756Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:50:40.3399825Z 2025-05-07T19:50:40.3399906Z HIPified Source Files: 2025-05-07T19:50:40.3399910Z 2025-05-07T19:50:40.3399978Z 2025-05-07T19:50:40.3400056Z Library Dependencies: 2025-05-07T19:50:40.3400122Z torch 2025-05-07T19:50:40.3400252Z torch_library 2025-05-07T19:50:40.3400522Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:40.3400748Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:40.3401040Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:40.3401357Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:40.3401600Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:40.3401694Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:40.3401890Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:40.3401960Z 2025-05-07T19:50:40.3402037Z Output Library: 2025-05-07T19:50:40.3402136Z fbgemm_gpu_tbe_training_backward_gwd 2025-05-07T19:50:40.3402202Z 2025-05-07T19:50:40.3402282Z Destination Directory: 2025-05-07T19:50:40.3402355Z fbgemm_gpu 2025-05-07T19:50:40.3402461Z ================================================================================ 2025-05-07T19:50:40.3402465Z 2025-05-07T19:50:40.3402469Z 2025-05-07T19:50:40.3402472Z 2025-05-07T19:50:40.3402568Z ================================================================================ 2025-05-07T19:50:40.3402749Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_vbe (SHARED) 2025-05-07T19:50:40.3402824Z 2025-05-07T19:50:40.3402891Z CPU_SRCS: 2025-05-07T19:50:40.3402895Z 2025-05-07T19:50:40.3402956Z 2025-05-07T19:50:40.3403033Z GPU_SRCS: 2025-05-07T19:50:40.3403213Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:40.3403381Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:40.3403636Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:40.3403817Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:40.3404038Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:40.3404267Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:40.3404413Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:40.3404552Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:40.3404692Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:40.3404846Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:40.3404983Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:40.3405126Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:40.3405295Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:40.3405504Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3405703Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3405867Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:40.3406062Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3406251Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3406430Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:40.3406635Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3406842Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3407017Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:40.3407219Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3407421Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3407638Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:40.3407873Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3408762Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3408989Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:40.3409232Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3409486Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3409622Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:40.3409775Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3409943Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3410087Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:40.3410256Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3410433Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3410572Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:40.3410731Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3410895Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3411047Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:40.3411214Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3411383Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3411528Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:40.3411682Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3411891Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3412045Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:40.3412205Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3412375Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3412442Z 2025-05-07T19:50:40.3412525Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:40.3412529Z 2025-05-07T19:50:40.3412591Z 2025-05-07T19:50:40.3412667Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:40.3412672Z 2025-05-07T19:50:40.3412740Z 2025-05-07T19:50:40.3412811Z OTHER_SRCS: 2025-05-07T19:50:40.3412815Z 2025-05-07T19:50:40.3412877Z 2025-05-07T19:50:40.3412951Z CC_FLAGS: 2025-05-07T19:50:40.3412955Z 2025-05-07T19:50:40.3413018Z 2025-05-07T19:50:40.3413092Z NVCC_FLAGS: 2025-05-07T19:50:40.3413177Z --expt-relaxed-constexpr 2025-05-07T19:50:40.3413273Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:40.3413363Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:40.3413449Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:40.3413518Z 2025-05-07T19:50:40.3413590Z HIPCC_FLAGS: 2025-05-07T19:50:40.3413594Z 2025-05-07T19:50:40.3413659Z 2025-05-07T19:50:40.3413731Z INCLUDE_DIRS: 2025-05-07T19:50:40.3413833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3413922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:40.3414014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:40.3414116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3414372Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:40.3414726Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:40.3414859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:40.3415000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:40.3415141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:40.3415328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:40.3415512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:40.3415639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:40.3415956Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:40.3416028Z 2025-05-07T19:50:40.3416109Z Selected Source Files: 2025-05-07T19:50:40.3416292Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:50:40.3416461Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:50:40.3416653Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:40.3416827Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:50:40.3417050Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:50:40.3417282Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:40.3417422Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:50:40.3417565Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:40.3417712Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:50:40.3417865Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:40.3418001Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:50:40.3418154Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:50:40.3418326Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:50:40.3418522Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3418720Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3418894Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:50:40.3419080Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3419317Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3419508Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:40.3419709Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3419914Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3420099Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:50:40.3420296Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3420581Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3420989Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:50:40.3421243Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3421503Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3421751Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:40.3422021Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3422291Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3422440Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:50:40.3422616Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3422786Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3422936Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:40.3423118Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3423297Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3423442Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:50:40.3423622Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3423806Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3423960Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:40.3424143Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3424391Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3424534Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:50:40.3424701Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3424879Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3425029Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:50:40.3425207Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:50:40.3425384Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:50:40.3425465Z 2025-05-07T19:50:40.3425558Z HIPified Source Files: 2025-05-07T19:50:40.3425566Z 2025-05-07T19:50:40.3425648Z 2025-05-07T19:50:40.3425735Z Library Dependencies: 2025-05-07T19:50:40.3425807Z torch 2025-05-07T19:50:40.3425891Z torch_library 2025-05-07T19:50:40.3426184Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:40.3426432Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:40.3426752Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:40.3427100Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:40.3427363Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:40.3427463Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:40.3427677Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:40.3427751Z 2025-05-07T19:50:40.3427830Z Output Library: 2025-05-07T19:50:40.3427991Z fbgemm_gpu_tbe_training_backward_vbe 2025-05-07T19:50:40.3428068Z 2025-05-07T19:50:40.3428153Z Destination Directory: 2025-05-07T19:50:40.3428228Z fbgemm_gpu 2025-05-07T19:50:40.3428345Z ================================================================================ 2025-05-07T19:50:40.3428353Z 2025-05-07T19:50:40.3428357Z 2025-05-07T19:50:40.3428361Z 2025-05-07T19:50:40.3428469Z ================================================================================ 2025-05-07T19:50:40.3428674Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_dense (SHARED) 2025-05-07T19:50:40.3428754Z 2025-05-07T19:50:40.3428831Z CPU_SRCS: 2025-05-07T19:50:40.3428835Z 2025-05-07T19:50:40.3428904Z 2025-05-07T19:50:40.3428983Z GPU_SRCS: 2025-05-07T19:50:40.3429124Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:40.3429268Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:40.3429426Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3429602Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3429766Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3429938Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:40.3430141Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3430336Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3430479Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:40.3430631Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:40.3430798Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3430968Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3431076Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:40.3431157Z 2025-05-07T19:50:40.3431243Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:40.3431247Z 2025-05-07T19:50:40.3431323Z 2025-05-07T19:50:40.3431420Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:40.3431429Z 2025-05-07T19:50:40.3431500Z 2025-05-07T19:50:40.3431580Z OTHER_SRCS: 2025-05-07T19:50:40.3431584Z 2025-05-07T19:50:40.3431663Z 2025-05-07T19:50:40.3431739Z CC_FLAGS: 2025-05-07T19:50:40.3431743Z 2025-05-07T19:50:40.3431867Z 2025-05-07T19:50:40.3431943Z NVCC_FLAGS: 2025-05-07T19:50:40.3432043Z --expt-relaxed-constexpr 2025-05-07T19:50:40.3432137Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:40.3432238Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:40.3432339Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:40.3432407Z 2025-05-07T19:50:40.3432487Z HIPCC_FLAGS: 2025-05-07T19:50:40.3432492Z 2025-05-07T19:50:40.3432558Z 2025-05-07T19:50:40.3432644Z INCLUDE_DIRS: 2025-05-07T19:50:40.3432744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3432832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:40.3432935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:40.3433151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3433405Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:40.3433757Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:40.3433890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:40.3434032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:40.3434172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:40.3434359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:40.3434538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:40.3434665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:40.3434941Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:40.3435007Z 2025-05-07T19:50:40.3435084Z Selected Source Files: 2025-05-07T19:50:40.3435216Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:50:40.3435427Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:50:40.3435562Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:50:40.3435661Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:50:40.3435794Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:50:40.3435938Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:50:40.3436083Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:50:40.3436241Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:50:40.3436414Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:50:40.3436590Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:50:40.3436725Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:50:40.3436889Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:50:40.3437050Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:50:40.3437118Z 2025-05-07T19:50:40.3437204Z HIPified Source Files: 2025-05-07T19:50:40.3437209Z 2025-05-07T19:50:40.3437274Z 2025-05-07T19:50:40.3437355Z Library Dependencies: 2025-05-07T19:50:40.3437422Z torch 2025-05-07T19:50:40.3437500Z torch_library 2025-05-07T19:50:40.3437776Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:40.3438000Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:40.3438303Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:40.3438617Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:40.3438859Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:40.3438954Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:50:40.3439146Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:40.3439212Z 2025-05-07T19:50:40.3439284Z Output Library: 2025-05-07T19:50:40.3439388Z fbgemm_gpu_tbe_training_backward_dense 2025-05-07T19:50:40.3439453Z 2025-05-07T19:50:40.3439533Z Destination Directory: 2025-05-07T19:50:40.3439670Z fbgemm_gpu 2025-05-07T19:50:40.3439772Z ================================================================================ 2025-05-07T19:50:40.3439776Z 2025-05-07T19:50:40.3439779Z 2025-05-07T19:50:40.3439783Z 2025-05-07T19:50:40.3439882Z ================================================================================ 2025-05-07T19:50:40.3440092Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_split_host (SHARED) 2025-05-07T19:50:40.3440158Z 2025-05-07T19:50:40.3440226Z CPU_SRCS: 2025-05-07T19:50:40.3440230Z 2025-05-07T19:50:40.3440304Z 2025-05-07T19:50:40.3440374Z GPU_SRCS: 2025-05-07T19:50:40.3440475Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:40.3440591Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:40.3440699Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:40.3440795Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:40.3440890Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:40.3440997Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:40.3441134Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:40.3441267Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:40.3441360Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:40.3441525Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:40.3441632Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:40.3441768Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:40.3441957Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:40.3442160Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:40.3442391Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:40.3442543Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:40.3442657Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:40.3442795Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:40.3442939Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:40.3443108Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:40.3443278Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:40.3443397Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:40.3443533Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:40.3443655Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:40.3443787Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:40.3443915Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:40.3444047Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:40.3444185Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:40.3444328Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:40.3444513Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:40.3444704Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:40.3444884Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:40.3445078Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:40.3445201Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:40.3445333Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:40.3445547Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:40.3445763Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:40.3445828Z 2025-05-07T19:50:40.3445902Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:40.3445910Z 2025-05-07T19:50:40.3445979Z 2025-05-07T19:50:40.3446057Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:40.3446061Z 2025-05-07T19:50:40.3446123Z 2025-05-07T19:50:40.3446199Z OTHER_SRCS: 2025-05-07T19:50:40.3446252Z 2025-05-07T19:50:40.3446316Z 2025-05-07T19:50:40.3446388Z CC_FLAGS: 2025-05-07T19:50:40.3446391Z 2025-05-07T19:50:40.3446462Z 2025-05-07T19:50:40.3446536Z NVCC_FLAGS: 2025-05-07T19:50:40.3446620Z --expt-relaxed-constexpr 2025-05-07T19:50:40.3446705Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:40.3446798Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:40.3446880Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:40.3446947Z 2025-05-07T19:50:40.3447019Z HIPCC_FLAGS: 2025-05-07T19:50:40.3447029Z 2025-05-07T19:50:40.3447092Z 2025-05-07T19:50:40.3447163Z INCLUDE_DIRS: 2025-05-07T19:50:40.3447257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3447347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:40.3447437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:40.3447532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3447788Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:40.3448139Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:40.3448266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:40.3448407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:40.3448551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:40.3448731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:40.3448908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:40.3449042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:40.3449313Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:40.3449378Z 2025-05-07T19:50:40.3449510Z Selected Source Files: 2025-05-07T19:50:40.3449610Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:50:40.3449727Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:50:40.3449821Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:50:40.3449923Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:50:40.3450014Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:50:40.3450115Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:50:40.3450257Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:50:40.3450398Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:50:40.3450497Z gen_embedding_backward_split_none.cpp 2025-05-07T19:50:40.3450663Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:40.3450782Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:50:40.3450924Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:50:40.3451112Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:50:40.3451327Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:40.3451506Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:50:40.3451656Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:50:40.3451786Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:50:40.3451925Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:40.3452073Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:40.3452238Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:50:40.3452419Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:50:40.3452543Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:50:40.3452676Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:40.3452814Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:50:40.3452953Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:50:40.3453080Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:50:40.3453227Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:40.3453365Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:50:40.3453564Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:50:40.3453751Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:50:40.3453955Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:50:40.3454137Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:50:40.3454329Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:50:40.3454470Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:50:40.3454605Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:50:40.3454817Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:50:40.3455052Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:50:40.3455122Z 2025-05-07T19:50:40.3455208Z HIPified Source Files: 2025-05-07T19:50:40.3455212Z 2025-05-07T19:50:40.3455286Z 2025-05-07T19:50:40.3455388Z Library Dependencies: 2025-05-07T19:50:40.3455459Z torch 2025-05-07T19:50:40.3455537Z torch_library 2025-05-07T19:50:40.3455830Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:40.3456064Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:40.3456359Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:40.3456689Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:40.3456940Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:40.3457021Z fbgemm_gpu_config 2025-05-07T19:50:40.3457152Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:40.3457362Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:40.3457434Z 2025-05-07T19:50:40.3457515Z Output Library: 2025-05-07T19:50:40.3457642Z fbgemm_gpu_tbe_training_backward_split_host 2025-05-07T19:50:40.3457716Z 2025-05-07T19:50:40.3457802Z Destination Directory: 2025-05-07T19:50:40.3457877Z fbgemm_gpu 2025-05-07T19:50:40.3457994Z ================================================================================ 2025-05-07T19:50:40.3457998Z 2025-05-07T19:50:40.3458002Z 2025-05-07T19:50:40.3458006Z 2025-05-07T19:50:40.3458104Z ================================================================================ 2025-05-07T19:50:40.3458263Z GPU CPP Library Target: fbgemm_gpu_tbe_index_select (SHARED) 2025-05-07T19:50:40.3458347Z 2025-05-07T19:50:40.3458422Z CPU_SRCS: 2025-05-07T19:50:40.3458613Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:40.3458799Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:40.3458871Z 2025-05-07T19:50:40.3458942Z GPU_SRCS: 2025-05-07T19:50:40.3459115Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:40.3459251Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:40.3459364Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:40.3459488Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:40.3459629Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:40.3459753Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:40.3459872Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:40.3460001Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:40.3460073Z 2025-05-07T19:50:40.3460153Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:40.3460157Z 2025-05-07T19:50:40.3460221Z 2025-05-07T19:50:40.3460314Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:40.3460381Z 2025-05-07T19:50:40.3460469Z 2025-05-07T19:50:40.3460546Z OTHER_SRCS: 2025-05-07T19:50:40.3460550Z 2025-05-07T19:50:40.3460633Z 2025-05-07T19:50:40.3460709Z CC_FLAGS: 2025-05-07T19:50:40.3460713Z 2025-05-07T19:50:40.3460957Z 2025-05-07T19:50:40.3461037Z NVCC_FLAGS: 2025-05-07T19:50:40.3461210Z --expt-relaxed-constexpr 2025-05-07T19:50:40.3461307Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:40.3461409Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:40.3461520Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:40.3461592Z 2025-05-07T19:50:40.3461755Z HIPCC_FLAGS: 2025-05-07T19:50:40.3461759Z 2025-05-07T19:50:40.3461844Z 2025-05-07T19:50:40.3461929Z INCLUDE_DIRS: 2025-05-07T19:50:40.3462042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3462135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:40.3462245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:40.3462350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3462627Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:40.3463029Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:40.3463173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:40.3463333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:40.3463493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:40.3463706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:40.3463904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:40.3464048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:40.3464355Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:40.3464430Z 2025-05-07T19:50:40.3464520Z Selected Source Files: 2025-05-07T19:50:40.3464735Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:50:40.3464987Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:50:40.3465177Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:50:40.3465313Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:50:40.3465443Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:50:40.3465579Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:50:40.3465721Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:50:40.3465860Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:50:40.3465993Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:50:40.3466122Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:50:40.3466207Z 2025-05-07T19:50:40.3466299Z HIPified Source Files: 2025-05-07T19:50:40.3466303Z 2025-05-07T19:50:40.3466378Z 2025-05-07T19:50:40.3466469Z Library Dependencies: 2025-05-07T19:50:40.3466553Z torch 2025-05-07T19:50:40.3466637Z torch_library 2025-05-07T19:50:40.3466941Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:40.3467206Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:40.3467529Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:40.3467876Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:40.3468153Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:40.3468254Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:40.3468343Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:40.3468553Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:40.3468639Z 2025-05-07T19:50:40.3468721Z Output Library: 2025-05-07T19:50:40.3468817Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:40.3468904Z 2025-05-07T19:50:40.3468992Z Destination Directory: 2025-05-07T19:50:40.3469075Z fbgemm_gpu 2025-05-07T19:50:40.3469190Z ================================================================================ 2025-05-07T19:50:40.3469195Z 2025-05-07T19:50:40.3469317Z 2025-05-07T19:50:40.3469322Z 2025-05-07T19:50:40.3469430Z ================================================================================ 2025-05-07T19:50:40.3469677Z GPU CPP Library Target: fbgemm_gpu_embedding_inplace_ops (SHARED) 2025-05-07T19:50:40.3469754Z 2025-05-07T19:50:40.3469846Z CPU_SRCS: 2025-05-07T19:50:40.3470020Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:40.3470091Z 2025-05-07T19:50:40.3470183Z GPU_SRCS: 2025-05-07T19:50:40.3470353Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:40.3470506Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:40.3470582Z 2025-05-07T19:50:40.3470682Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:40.3470686Z 2025-05-07T19:50:40.3470759Z 2025-05-07T19:50:40.3470848Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:40.3470852Z 2025-05-07T19:50:40.3470940Z 2025-05-07T19:50:40.3471026Z OTHER_SRCS: 2025-05-07T19:50:40.3471030Z 2025-05-07T19:50:40.3471103Z 2025-05-07T19:50:40.3471194Z CC_FLAGS: 2025-05-07T19:50:40.3471198Z 2025-05-07T19:50:40.3471272Z 2025-05-07T19:50:40.3471354Z NVCC_FLAGS: 2025-05-07T19:50:40.3471454Z --expt-relaxed-constexpr 2025-05-07T19:50:40.3471563Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:40.3471669Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:40.3471768Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:40.3471854Z 2025-05-07T19:50:40.3471933Z HIPCC_FLAGS: 2025-05-07T19:50:40.3471937Z 2025-05-07T19:50:40.3472017Z 2025-05-07T19:50:40.3472100Z INCLUDE_DIRS: 2025-05-07T19:50:40.3472220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3472313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:40.3472412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:40.3472527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3472802Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:40.3473346Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:40.3473481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:40.3473636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:40.3473784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:40.3473969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:40.3474163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:40.3474294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:40.3474567Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:40.3474644Z 2025-05-07T19:50:40.3474727Z Selected Source Files: 2025-05-07T19:50:40.3474885Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:50:40.3475042Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:50:40.3475196Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:50:40.3475270Z 2025-05-07T19:50:40.3475351Z HIPified Source Files: 2025-05-07T19:50:40.3475355Z 2025-05-07T19:50:40.3475433Z 2025-05-07T19:50:40.3475521Z Library Dependencies: 2025-05-07T19:50:40.3475590Z torch 2025-05-07T19:50:40.3475667Z torch_library 2025-05-07T19:50:40.3476106Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:40.3476527Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:40.3476846Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:40.3477192Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:40.3477616Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:40.3477823Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:40.3477910Z 2025-05-07T19:50:40.3477989Z Output Library: 2025-05-07T19:50:40.3478090Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:40.3478159Z 2025-05-07T19:50:40.3478255Z Destination Directory: 2025-05-07T19:50:40.3478428Z fbgemm_gpu 2025-05-07T19:50:40.3478534Z ================================================================================ 2025-05-07T19:50:40.3478539Z 2025-05-07T19:50:40.3478543Z 2025-05-07T19:50:40.3478547Z 2025-05-07T19:50:40.3478659Z ================================================================================ 2025-05-07T19:50:40.3478781Z GPU CPP Library Target: fbgemm_gpu_py (SHARED) 2025-05-07T19:50:40.3478851Z 2025-05-07T19:50:40.3478934Z CPU_SRCS: 2025-05-07T19:50:40.3479035Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:40.3479136Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:40.3479329Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:40.3479555Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:40.3479758Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:40.3479970Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:40.3480197Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:40.3480428Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:40.3480580Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:40.3480714Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:40.3480840Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:40.3480957Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:40.3481101Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:40.3481208Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:40.3481315Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:40.3481507Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:40.3481618Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:40.3481719Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:40.3481812Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:40.3481909Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:40.3482018Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:40.3482114Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:40.3482214Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:40.3482324Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:40.3482559Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:40.3482709Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:40.3482933Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:40.3483162Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:40.3483267Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:40.3483365Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:40.3483478Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:40.3483591Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:40.3483792Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:40.3483894Z src/topology_utils.cpp 2025-05-07T19:50:40.3483965Z 2025-05-07T19:50:40.3484043Z GPU_SRCS: 2025-05-07T19:50:40.3484154Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:40.3484267Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:40.3484479Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:40.3484578Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:40.3484685Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:40.3484870Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:40.3485053Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:40.3485194Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:40.3485329Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:40.3485579Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:40.3485757Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:40.3485995Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:40.3486134Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:40.3486283Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:40.3486424Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:40.3486550Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:40.3486675Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:40.3486797Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:40.3486956Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:40.3487106Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:40.3487235Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:40.3487394Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:40.3487525Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:40.3487624Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:40.3487852Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:40.3488037Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:40.3488328Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:40.3488440Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:40.3488545Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:40.3488664Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:40.3488893Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:40.3488996Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:40.3489082Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:40.3489246Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:40.3489345Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:40.3489460Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:40.3489578Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:40.3489684Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:40.3489810Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:40.3489938Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:40.3490062Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:40.3490164Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:40.3490255Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:40.3490348Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:40.3490447Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:40.3490572Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:40.3490685Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:40.3490781Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:40.3490877Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:40.3490966Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:40.3491071Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:40.3491168Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:40.3491270Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:40.3491366Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:40.3491451Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:40.3491521Z 2025-05-07T19:50:40.3491597Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:50:40.3491602Z 2025-05-07T19:50:40.3491668Z 2025-05-07T19:50:40.3491746Z HIP_SPECIFIC_SRCS: 2025-05-07T19:50:40.3491750Z 2025-05-07T19:50:40.3491814Z 2025-05-07T19:50:40.3491883Z OTHER_SRCS: 2025-05-07T19:50:40.3491887Z 2025-05-07T19:50:40.3491951Z 2025-05-07T19:50:40.3492024Z CC_FLAGS: 2025-05-07T19:50:40.3492027Z 2025-05-07T19:50:40.3492089Z 2025-05-07T19:50:40.3492162Z NVCC_FLAGS: 2025-05-07T19:50:40.3492251Z --expt-relaxed-constexpr 2025-05-07T19:50:40.3492339Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:50:40.3492430Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:50:40.3492512Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:50:40.3492581Z 2025-05-07T19:50:40.3492702Z HIPCC_FLAGS: 2025-05-07T19:50:40.3492705Z 2025-05-07T19:50:40.3492771Z 2025-05-07T19:50:40.3492845Z INCLUDE_DIRS: 2025-05-07T19:50:40.3492938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3493020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:50:40.3493112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:50:40.3493207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:50:40.3493460Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include 2025-05-07T19:50:40.3493813Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:50:40.3493949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:50:40.3494096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:50:40.3494238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:50:40.3494433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:50:40.3494618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:50:40.3494752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:50:40.3495032Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include 2025-05-07T19:50:40.3495098Z 2025-05-07T19:50:40.3495178Z Selected Source Files: 2025-05-07T19:50:40.3495268Z src/memory_utils/memory_utils.cpp 2025-05-07T19:50:40.3495365Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:50:40.3495540Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:40.3495729Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:50:40.3495918Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:50:40.3496242Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:50:40.3496433Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:50:40.3496645Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:50:40.3496790Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:50:40.3496912Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:50:40.3497033Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:50:40.3497569Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:50:40.3497921Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:50:40.3498269Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:50:40.3498585Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:50:40.3498899Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:50:40.3499215Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:50:40.3499480Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:50:40.3499755Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:50:40.3500008Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:50:40.3500278Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:50:40.3500639Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:50:40.3501104Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:50:40.3501406Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:50:40.3501829Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:50:40.3502340Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:50:40.3503082Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:40.3503837Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:50:40.3504286Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:50:40.3504593Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:50:40.3504870Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:50:40.3505173Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:50:40.3505572Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:50:40.3505957Z src/topology_utils.cpp 2025-05-07T19:50:40.3506209Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:50:40.3506599Z src/input_combine_ops/input_combine.cu 2025-05-07T19:50:40.3507014Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:50:40.3507421Z src/memory_utils/memory_utils.cu 2025-05-07T19:50:40.3507700Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:50:40.3508073Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:50:40.3508558Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:50:40.3508969Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:50:40.3509336Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:50:40.3509824Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:50:40.3510384Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:50:40.3510847Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:50:40.3511274Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:50:40.3511672Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:50:40.3512053Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:50:40.3512432Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:50:40.3512785Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:50:40.3513239Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:50:40.3513600Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:50:40.3513996Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:50:40.3514370Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:50:40.3514729Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:50:40.3515105Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:50:40.3515780Z src/metric_ops/metric_ops.cu 2025-05-07T19:50:40.3516168Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:50:40.3516667Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:50:40.3517140Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:50:40.3517533Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:50:40.3517828Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:50:40.3518264Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:50:40.3518584Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:50:40.3518895Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:50:40.3519154Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:50:40.3519447Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:50:40.3519749Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:50:40.3520019Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:50:40.3520366Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:50:40.3520686Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:50:40.3521011Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:50:40.3521359Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:50:40.3521722Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:50:40.3522051Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:50:40.3522334Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:50:40.3522603Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:50:40.3522893Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:50:40.3523203Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:50:40.3523529Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:50:40.3523837Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:50:40.3524108Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:50:40.3524380Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:50:40.3524657Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:50:40.3524956Z src/sparse_ops/sparse_range.cu 2025-05-07T19:50:40.3525223Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:50:40.3525525Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:50:40.3525807Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:50:40.3526103Z 2025-05-07T19:50:40.3526284Z HIPified Source Files: 2025-05-07T19:50:40.3526422Z 2025-05-07T19:50:40.3526497Z 2025-05-07T19:50:40.3526673Z Library Dependencies: 2025-05-07T19:50:40.3526870Z torch 2025-05-07T19:50:40.3527052Z torch_library 2025-05-07T19:50:40.3527437Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so 2025-05-07T19:50:40.3528068Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so 2025-05-07T19:50:40.3528699Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:50:40.3529420Z /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:50:40.3530106Z /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so 2025-05-07T19:50:40.3530521Z fbgemm 2025-05-07T19:50:40.3530711Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:50:40.3530963Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:50:40.3531229Z fbgemm_gpu_tbe_index_select 2025-05-07T19:50:40.3531454Z fbgemm_gpu_tbe_cache 2025-05-07T19:50:40.3531683Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:50:40.3531898Z fbgemm_gpu_tbe_utils 2025-05-07T19:50:40.3532226Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:50:40.3532589Z 2025-05-07T19:50:40.3532749Z Output Library: 2025-05-07T19:50:40.3532947Z fbgemm_gpu_py 2025-05-07T19:50:40.3533122Z 2025-05-07T19:50:40.3533296Z Destination Directory: 2025-05-07T19:50:40.3533496Z fbgemm_gpu 2025-05-07T19:50:40.3533703Z ================================================================================ 2025-05-07T19:50:40.3533910Z 2025-05-07T19:50:40.3534000Z -- Configuring done (9.2s) 2025-05-07T19:50:40.4852402Z -- Generating done (0.1s) 2025-05-07T19:50:40.4870333Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build 2025-05-07T19:50:40.5135610Z Change Dir: '/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build' 2025-05-07T19:50:40.5136748Z 2025-05-07T19:50:40.5137604Z Run Build Command(s): /github/home/miniconda/envs/build_binary/bin/ninja -v -j 48 install 2025-05-07T19:50:40.6323347Z [1/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:50:40.6334539Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.6541933Z [2/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:50:40.6553862Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.6588654Z [3/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:50:40.6600124Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.6654170Z [4/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:50:40.6665743Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.6769834Z [5/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:50:40.6782107Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.6792866Z [6/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:50:40.6804142Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.6861951Z [7/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:50:40.6873716Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.6885368Z [8/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:50:40.6896634Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.6907919Z [9/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:50:40.6918604Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.7030303Z [10/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:50:40.7041463Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.7196462Z [11/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:50:40.7207669Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.7354072Z [12/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:50:40.7365841Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.7499349Z [13/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:50:40.7510886Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.7595389Z [14/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:50:40.7605843Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.7617087Z [15/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:50:40.7628312Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.7643643Z [16/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:50:40.7654317Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.7709200Z [17/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:50:40.7720290Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.7825866Z [18/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:50:40.7837019Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.7855696Z [19/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:50:40.7870405Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.8168802Z [20/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:50:40.8181161Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.8192730Z [21/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:50:40.8204525Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.8540138Z [22/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:50:40.8551742Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.8562933Z [23/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:50:40.8574475Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.8602182Z [24/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:50:40.8613089Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.8634067Z [25/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:50:40.8645420Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.8655996Z [26/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:50:40.8666919Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.8847789Z [27/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:50:40.8858641Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.8982248Z [28/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:50:40.8994125Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.9004272Z [29/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:50:40.9015284Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.9051596Z [30/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:50:40.9074916Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.9214594Z [31/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:50:40.9226025Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.9237261Z [32/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:50:40.9248736Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.9321624Z [33/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:50:40.9332588Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.9384115Z [34/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:50:40.9394973Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.9557703Z [35/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:50:40.9567453Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.9710849Z [36/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:50:40.9722194Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:40.9889497Z [37/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:50:40.9901407Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.0054531Z [38/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:50:41.0066053Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.0101646Z [39/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:50:41.0113406Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.0162632Z [40/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:50:41.0174062Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.0368261Z [41/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:50:41.0379686Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.0570051Z [42/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:50:41.0581813Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.0665009Z [43/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:50:41.0676570Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.0934619Z [44/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:50:41.0946843Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.1549717Z [45/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:50:41.1560977Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.1938545Z [46/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:50:41.1950172Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.2251719Z [47/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:50:41.2263095Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.2554351Z [48/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:50:41.2566108Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.2765871Z [49/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:50:41.2777881Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.2802126Z [50/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:50:41.2813203Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.3247984Z [51/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:50:41.3259583Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.4514940Z [52/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:50:41.4526112Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.4783283Z [53/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:50:41.4795392Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.5788453Z [54/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:50:41.5799990Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.6520534Z [55/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:50:41.6532302Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.6960824Z [56/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:50:41.6973182Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.7267987Z [57/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -mavx512f -mavx512bw -mavx512dq -mavx512vl -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:50:41.7286751Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.7335549Z [58/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:50:41.7351735Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.9241916Z [59/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:50:41.9254450Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:41.9839951Z [60/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:50:41.9852249Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:42.0957068Z [61/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtils.cc 2025-05-07T19:50:42.0974781Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:42.2924985Z [62/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:50:42.2937221Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:42.8884375Z [63/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,asmjit.so -o asmjit.so CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:43.1885701Z [64/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -c /__w/FBGEMM/FBGEMM/src/Utils.cc 2025-05-07T19:50:43.1903014Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:43.5915062Z [65/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -c /__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc 2025-05-07T19:50:43.5932289Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:47.0227969Z [66/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -c /__w/FBGEMM/FBGEMM/src/RefImplementations.cc 2025-05-07T19:50:47.0244032Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:47.5964434Z [67/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -c /__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc 2025-05-07T19:50:47.5981883Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:49.0565927Z [68/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:50:49.0582400Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:49.1092591Z [69/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:50:49.1109426Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:49.1478327Z [70/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:50:49.1495639Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:49.1807781Z [71/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:50:49.1824990Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:49.2323879Z [72/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:50:49.2341327Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:50.2462004Z [73/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:50:50.2480061Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:50.7694238Z [74/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:50:50.7711354Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:51.7789843Z [75/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:50:51.7809486Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:52.3604431Z [76/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_config_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -MF CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o.d -o CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/config/feature_gates.cpp 2025-05-07T19:50:52.3623475Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:52.4732272Z [77/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc 2025-05-07T19:50:52.4748889Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:52.9271240Z [78/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_config.so -o fbgemm_gpu_config.so CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:50:53.7778634Z [79/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:50:53.7797647Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:56.2463905Z [80/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:50:56.2478146Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:56.5430921Z [81/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:50:56.5445647Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:50:57.9193846Z [82/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:50:57.9211988Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:00.4470399Z [83/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:51:00.4489008Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:00.8899143Z [84/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:51:00.8916256Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:01.0752928Z [85/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:51:01.0771699Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:03.5560654Z [86/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:51:03.5577386Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:05.0302352Z [87/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:51:05.0318683Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:07.0245779Z [88/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc 2025-05-07T19:51:07.0263078Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:10.0003438Z [89/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:51:10.0022320Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:10.4563336Z [90/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:51:10.4582191Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:12.9428665Z [91/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:51:12.9443545Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:14.2082328Z [92/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:51:14.2097246Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:15.7452993Z [93/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:51:15.7471658Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:16.9010063Z [94/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:51:16.9028480Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:19.1705114Z [95/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:51:19.1725733Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:19.6490752Z [96/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:51:19.6510298Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:22.1889646Z [97/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:51:22.1909955Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:23.3548319Z [98/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:23.3564568Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:24.9132447Z [99/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:51:24.9151524Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:26.3745474Z [100/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:26.3765768Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:28.4677033Z [101/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:51:28.4697656Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:37.2188829Z [102/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:51:37.2207628Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:37.7904760Z [103/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:51:37.7922877Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:51:43.8924973Z [104/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_lookup.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o 2025-05-07T19:51:43.8948148Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8951009Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8952164Z ^ 2025-05-07T19:51:43.8952426Z 2025-05-07T19:51:43.8952853Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:43.8953514Z 2025-05-07T19:51:43.8955465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8958186Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8959383Z ^ 2025-05-07T19:51:43.8959742Z 2025-05-07T19:51:43.8961424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8964108Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8965311Z ^ 2025-05-07T19:51:43.8965560Z 2025-05-07T19:51:43.8966006Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:43.8966803Z 2025-05-07T19:51:43.8968491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8970816Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8971733Z ^ 2025-05-07T19:51:43.8972031Z 2025-05-07T19:51:43.8973402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8975546Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8976998Z ^ 2025-05-07T19:51:43.8977196Z 2025-05-07T19:51:43.8977566Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:43.8978154Z 2025-05-07T19:51:43.8979736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8982416Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8983555Z ^ 2025-05-07T19:51:43.8983919Z 2025-05-07T19:51:43.8985512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8988066Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8989145Z ^ 2025-05-07T19:51:43.8989392Z 2025-05-07T19:51:43.8989808Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:43.8990479Z 2025-05-07T19:51:43.8992006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.8994826Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.8995941Z ^ 2025-05-07T19:51:43.8996292Z 2025-05-07T19:51:43.8997817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.9000254Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.9001390Z ^ 2025-05-07T19:51:43.9001642Z 2025-05-07T19:51:43.9002088Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:43.9002769Z 2025-05-07T19:51:43.9004444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:43.9006951Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:43.9008066Z ^ 2025-05-07T19:51:43.9008407Z 2025-05-07T19:51:44.6281471Z [105/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o 2025-05-07T19:51:44.6305401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6307997Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6309335Z ^ 2025-05-07T19:51:44.6309570Z 2025-05-07T19:51:44.6310001Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:44.6310621Z 2025-05-07T19:51:44.6312168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6314886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6315989Z ^ 2025-05-07T19:51:44.6316316Z 2025-05-07T19:51:44.6317708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6319908Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6320922Z ^ 2025-05-07T19:51:44.6321156Z 2025-05-07T19:51:44.6321576Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:44.6322218Z 2025-05-07T19:51:44.6323816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6326420Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6327579Z ^ 2025-05-07T19:51:44.6327957Z 2025-05-07T19:51:44.6329618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6332601Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6333966Z ^ 2025-05-07T19:51:44.6334194Z 2025-05-07T19:51:44.6334623Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:44.6335204Z 2025-05-07T19:51:44.6336640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6339060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6340272Z ^ 2025-05-07T19:51:44.6340821Z 2025-05-07T19:51:44.6342515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6345282Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6346400Z ^ 2025-05-07T19:51:44.6346669Z 2025-05-07T19:51:44.6347119Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:44.6347791Z 2025-05-07T19:51:44.6349762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6352553Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6353625Z ^ 2025-05-07T19:51:44.6353950Z 2025-05-07T19:51:44.6355516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6358177Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6359561Z ^ 2025-05-07T19:51:44.6359815Z 2025-05-07T19:51:44.6360268Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:44.6360954Z 2025-05-07T19:51:44.6362629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6365366Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6366556Z ^ 2025-05-07T19:51:44.6366921Z 2025-05-07T19:51:44.6898416Z [106/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o 2025-05-07T19:51:44.6922140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6924597Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6925655Z ^ 2025-05-07T19:51:44.6925910Z 2025-05-07T19:51:44.6926310Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:44.6926918Z 2025-05-07T19:51:44.6928356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6930829Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6931755Z ^ 2025-05-07T19:51:44.6932082Z 2025-05-07T19:51:44.6933706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6936261Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6937386Z ^ 2025-05-07T19:51:44.6937627Z 2025-05-07T19:51:44.6938061Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:44.6938710Z 2025-05-07T19:51:44.6940283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6942833Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6944276Z ^ 2025-05-07T19:51:44.6944659Z 2025-05-07T19:51:44.6946270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6949021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6950181Z ^ 2025-05-07T19:51:44.6950445Z 2025-05-07T19:51:44.6950890Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:44.6951560Z 2025-05-07T19:51:44.6953382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6955939Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6957105Z ^ 2025-05-07T19:51:44.6957453Z 2025-05-07T19:51:44.6958983Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6961692Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6962907Z ^ 2025-05-07T19:51:44.6963170Z 2025-05-07T19:51:44.6963886Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:44.6964592Z 2025-05-07T19:51:44.6966320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6969107Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6970323Z ^ 2025-05-07T19:51:44.6970691Z 2025-05-07T19:51:44.6972405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6975001Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6976443Z ^ 2025-05-07T19:51:44.6976697Z 2025-05-07T19:51:44.6977161Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:44.6977841Z 2025-05-07T19:51:44.6979417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:44.6982152Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:44.6983436Z ^ 2025-05-07T19:51:44.6983822Z 2025-05-07T19:51:45.0748760Z [107/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o 2025-05-07T19:51:45.0769712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.0772311Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.0773430Z ^ 2025-05-07T19:51:45.0773900Z 2025-05-07T19:51:45.0774329Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.0774957Z 2025-05-07T19:51:45.0776839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.0779630Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.0780805Z ^ 2025-05-07T19:51:45.0781158Z 2025-05-07T19:51:45.0782675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.0784706Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.0785528Z ^ 2025-05-07T19:51:45.0785714Z 2025-05-07T19:51:45.0786053Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.0786518Z 2025-05-07T19:51:45.0787693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.0790293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.0791388Z ^ 2025-05-07T19:51:45.0791735Z 2025-05-07T19:51:45.0793252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.0795704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.0796767Z ^ 2025-05-07T19:51:45.0796997Z 2025-05-07T19:51:45.0797421Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.0798032Z 2025-05-07T19:51:45.0799590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.0802020Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.0803096Z ^ 2025-05-07T19:51:45.0803594Z 2025-05-07T19:51:45.0805082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.0807806Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.0808927Z ^ 2025-05-07T19:51:45.0809178Z 2025-05-07T19:51:45.0809641Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.0810286Z 2025-05-07T19:51:45.0811873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.0814506Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.0815655Z ^ 2025-05-07T19:51:45.0815995Z 2025-05-07T19:51:45.0817555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.0820072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.0821329Z ^ 2025-05-07T19:51:45.0821617Z 2025-05-07T19:51:45.0822156Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.0822762Z 2025-05-07T19:51:45.0824466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.0826828Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.0827909Z ^ 2025-05-07T19:51:45.0828245Z 2025-05-07T19:51:45.4166761Z [108/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o 2025-05-07T19:51:45.4189068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4191674Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4192739Z ^ 2025-05-07T19:51:45.4193008Z 2025-05-07T19:51:45.4193432Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.4194060Z 2025-05-07T19:51:45.4195698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4198069Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4199042Z ^ 2025-05-07T19:51:45.4199343Z 2025-05-07T19:51:45.4200907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4203493Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4204559Z ^ 2025-05-07T19:51:45.4204747Z 2025-05-07T19:51:45.4205173Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.4205832Z 2025-05-07T19:51:45.4207247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4209956Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4211147Z ^ 2025-05-07T19:51:45.4211512Z 2025-05-07T19:51:45.4213027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4215813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4216962Z ^ 2025-05-07T19:51:45.4217237Z 2025-05-07T19:51:45.4217676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.4218305Z 2025-05-07T19:51:45.4219948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4222702Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4223907Z ^ 2025-05-07T19:51:45.4224264Z 2025-05-07T19:51:45.4226244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4228755Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4229885Z ^ 2025-05-07T19:51:45.4230145Z 2025-05-07T19:51:45.4230581Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.4231260Z 2025-05-07T19:51:45.4232828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4235453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4236623Z ^ 2025-05-07T19:51:45.4237008Z 2025-05-07T19:51:45.4238514Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4241190Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4242464Z ^ 2025-05-07T19:51:45.4242708Z 2025-05-07T19:51:45.4243195Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.4243856Z 2025-05-07T19:51:45.4245427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4247903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4249171Z ^ 2025-05-07T19:51:45.4249663Z 2025-05-07T19:51:45.4357096Z [109/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o 2025-05-07T19:51:45.4377656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4380309Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4381633Z ^ 2025-05-07T19:51:45.4381906Z 2025-05-07T19:51:45.4382340Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.4383019Z 2025-05-07T19:51:45.4384716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4387575Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4388754Z ^ 2025-05-07T19:51:45.4389156Z 2025-05-07T19:51:45.4390789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4393503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4394680Z ^ 2025-05-07T19:51:45.4394972Z 2025-05-07T19:51:45.4395412Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.4396413Z 2025-05-07T19:51:45.4398033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4400739Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4401865Z ^ 2025-05-07T19:51:45.4402218Z 2025-05-07T19:51:45.4403824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4406505Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4407739Z ^ 2025-05-07T19:51:45.4408005Z 2025-05-07T19:51:45.4408439Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.4409125Z 2025-05-07T19:51:45.4410737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4413431Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4414560Z ^ 2025-05-07T19:51:45.4414913Z 2025-05-07T19:51:45.4416627Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4419045Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4420246Z ^ 2025-05-07T19:51:45.4420467Z 2025-05-07T19:51:45.4421008Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.4421552Z 2025-05-07T19:51:45.4422999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4425633Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4426743Z ^ 2025-05-07T19:51:45.4427064Z 2025-05-07T19:51:45.4428579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4431237Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4432556Z ^ 2025-05-07T19:51:45.4432819Z 2025-05-07T19:51:45.4433220Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.4433815Z 2025-05-07T19:51:45.4435276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.4437571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.4438573Z ^ 2025-05-07T19:51:45.4439084Z 2025-05-07T19:51:45.8063539Z [110/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o 2025-05-07T19:51:45.8087587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8090386Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8091601Z ^ 2025-05-07T19:51:45.8091898Z 2025-05-07T19:51:45.8092370Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.8093052Z 2025-05-07T19:51:45.8094803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8097547Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8098808Z ^ 2025-05-07T19:51:45.8099195Z 2025-05-07T19:51:45.8101005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8103766Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8105350Z ^ 2025-05-07T19:51:45.8105623Z 2025-05-07T19:51:45.8106120Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.8106808Z 2025-05-07T19:51:45.8108544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8111510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8112817Z ^ 2025-05-07T19:51:45.8113202Z 2025-05-07T19:51:45.8114952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8117813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8119070Z ^ 2025-05-07T19:51:45.8119383Z 2025-05-07T19:51:45.8119863Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.8120572Z 2025-05-07T19:51:45.8122405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8125379Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8126623Z ^ 2025-05-07T19:51:45.8127008Z 2025-05-07T19:51:45.8128735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8131492Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8132719Z ^ 2025-05-07T19:51:45.8132990Z 2025-05-07T19:51:45.8133465Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.8134091Z 2025-05-07T19:51:45.8135388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8137449Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8138390Z ^ 2025-05-07T19:51:45.8138707Z 2025-05-07T19:51:45.8139968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8142159Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8143084Z ^ 2025-05-07T19:51:45.8143318Z 2025-05-07T19:51:45.8143668Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.8144197Z 2025-05-07T19:51:45.8145519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8148020Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8149425Z ^ 2025-05-07T19:51:45.8149941Z 2025-05-07T19:51:45.8584102Z [111/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o 2025-05-07T19:51:45.8607868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8610610Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8611851Z ^ 2025-05-07T19:51:45.8612119Z 2025-05-07T19:51:45.8612565Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.8613253Z 2025-05-07T19:51:45.8614947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8617667Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8618750Z ^ 2025-05-07T19:51:45.8619224Z 2025-05-07T19:51:45.8620924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8624155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8625364Z ^ 2025-05-07T19:51:45.8625642Z 2025-05-07T19:51:45.8626100Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.8626783Z 2025-05-07T19:51:45.8628536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8631333Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8632588Z ^ 2025-05-07T19:51:45.8632968Z 2025-05-07T19:51:45.8634702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8637137Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8638226Z ^ 2025-05-07T19:51:45.8638433Z 2025-05-07T19:51:45.8638821Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.8639397Z 2025-05-07T19:51:45.8641102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8643808Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8645001Z ^ 2025-05-07T19:51:45.8645380Z 2025-05-07T19:51:45.8647024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8649687Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8650845Z ^ 2025-05-07T19:51:45.8651109Z 2025-05-07T19:51:45.8651549Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.8652204Z 2025-05-07T19:51:45.8653875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8656525Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8657774Z ^ 2025-05-07T19:51:45.8658140Z 2025-05-07T19:51:45.8659738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8662547Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8663714Z ^ 2025-05-07T19:51:45.8663973Z 2025-05-07T19:51:45.8664438Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:45.8665091Z 2025-05-07T19:51:45.8666750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:45.8669540Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:45.8670662Z ^ 2025-05-07T19:51:45.8671046Z 2025-05-07T19:51:47.0499450Z [112/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o 2025-05-07T19:51:47.7284261Z [113/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/get_infos_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o 2025-05-07T19:51:47.7306532Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.7309280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.7310486Z ^ 2025-05-07T19:51:47.7310767Z 2025-05-07T19:51:47.7311222Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:47.7311882Z 2025-05-07T19:51:47.7313962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.7316773Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.7317998Z ^ 2025-05-07T19:51:47.7318370Z 2025-05-07T19:51:47.7320052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.7322869Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.7324053Z ^ 2025-05-07T19:51:47.7324307Z 2025-05-07T19:51:47.7324765Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:47.7325444Z 2025-05-07T19:51:47.7327088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.7329813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.7331022Z ^ 2025-05-07T19:51:47.7331412Z 2025-05-07T19:51:47.7333065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.7335945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.7337207Z ^ 2025-05-07T19:51:47.7337508Z 2025-05-07T19:51:47.7337990Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:47.7338714Z 2025-05-07T19:51:47.7340767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.7343715Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.7344954Z ^ 2025-05-07T19:51:47.7345340Z 2025-05-07T19:51:47.7347033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.7349781Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.7350965Z ^ 2025-05-07T19:51:47.7351216Z 2025-05-07T19:51:47.7351681Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:47.7352401Z 2025-05-07T19:51:47.7354068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.7356777Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.7357989Z ^ 2025-05-07T19:51:47.7358381Z 2025-05-07T19:51:47.7360216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.7362916Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.7364147Z ^ 2025-05-07T19:51:47.7364413Z 2025-05-07T19:51:47.7364887Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:47.7365575Z 2025-05-07T19:51:47.7367276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:47.7370034Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:47.7371280Z ^ 2025-05-07T19:51:47.7371671Z 2025-05-07T19:51:56.9970373Z [114/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v1.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o 2025-05-07T19:51:56.9992376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:56.9994794Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:56.9995969Z ^ 2025-05-07T19:51:56.9996204Z 2025-05-07T19:51:56.9997193Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:56.9997855Z 2025-05-07T19:51:56.9999474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:57.0001950Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:57.0003055Z ^ 2025-05-07T19:51:57.0003440Z 2025-05-07T19:51:57.0005018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:57.0007587Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:57.0008585Z ^ 2025-05-07T19:51:57.0008817Z 2025-05-07T19:51:57.0009273Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:57.0009914Z 2025-05-07T19:51:57.0011419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:57.0013994Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:57.0015189Z ^ 2025-05-07T19:51:57.0015505Z 2025-05-07T19:51:57.0017027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:57.0019721Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:57.0021058Z ^ 2025-05-07T19:51:57.0025030Z 2025-05-07T19:51:57.0025479Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:57.0026071Z 2025-05-07T19:51:57.0027662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:57.0030406Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:57.0031467Z ^ 2025-05-07T19:51:57.0031818Z 2025-05-07T19:51:57.0033393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:57.0035932Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:57.0037045Z ^ 2025-05-07T19:51:57.0037284Z 2025-05-07T19:51:57.0037673Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:57.0038257Z 2025-05-07T19:51:57.0039840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:57.0042578Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:57.0043780Z ^ 2025-05-07T19:51:57.0044396Z 2025-05-07T19:51:57.0045899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:57.0048555Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:57.0049676Z ^ 2025-05-07T19:51:57.0049956Z 2025-05-07T19:51:57.0050396Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:57.0051039Z 2025-05-07T19:51:57.0052664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:57.0055090Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:57.0056193Z ^ 2025-05-07T19:51:57.0056550Z 2025-05-07T19:51:58.1864269Z [115/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v2.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o 2025-05-07T19:51:58.1892940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1896691Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1898226Z ^ 2025-05-07T19:51:58.1898556Z 2025-05-07T19:51:58.1899099Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.1899958Z 2025-05-07T19:51:58.1902375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1906120Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1907699Z ^ 2025-05-07T19:51:58.1908153Z 2025-05-07T19:51:58.1910214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1913481Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1914933Z ^ 2025-05-07T19:51:58.1915278Z 2025-05-07T19:51:58.1915844Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.1916664Z 2025-05-07T19:51:58.1918694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1922125Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1923662Z ^ 2025-05-07T19:51:58.1924131Z 2025-05-07T19:51:58.1926355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1929909Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1931754Z ^ 2025-05-07T19:51:58.1932089Z 2025-05-07T19:51:58.1932625Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.1933422Z 2025-05-07T19:51:58.1935464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1938804Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1940303Z ^ 2025-05-07T19:51:58.1940876Z 2025-05-07T19:51:58.1942899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1946188Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1947633Z ^ 2025-05-07T19:51:58.1947951Z 2025-05-07T19:51:58.1948491Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.1949350Z 2025-05-07T19:51:58.1951332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1954231Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1955104Z ^ 2025-05-07T19:51:58.1955411Z 2025-05-07T19:51:58.1956684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1958660Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1959541Z ^ 2025-05-07T19:51:58.1959735Z 2025-05-07T19:51:58.1960066Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:51:58.1960569Z 2025-05-07T19:51:58.1961730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:51:58.1963654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:51:58.1964506Z ^ 2025-05-07T19:51:58.1964799Z 2025-05-07T19:52:02.6749766Z [116/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc 2025-05-07T19:52:02.6766301Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:03.3425687Z [117/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm.so -o fbgemm.so CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so asmjit.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T19:52:03.7078213Z [118/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o 2025-05-07T19:52:03.7099030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:03.7101545Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:03.7102568Z ^ 2025-05-07T19:52:03.7102817Z 2025-05-07T19:52:03.7103251Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:03.7103850Z 2025-05-07T19:52:03.7105317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:03.7107643Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:03.7108709Z ^ 2025-05-07T19:52:03.7109058Z 2025-05-07T19:52:03.7110504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:03.7112856Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:03.7113912Z ^ 2025-05-07T19:52:03.7114147Z 2025-05-07T19:52:03.7114546Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:03.7115168Z 2025-05-07T19:52:03.7116985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:03.7119302Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:03.7120328Z ^ 2025-05-07T19:52:03.7120649Z 2025-05-07T19:52:03.7122052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:03.7124422Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:03.7125434Z ^ 2025-05-07T19:52:03.7125695Z 2025-05-07T19:52:03.7126091Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:03.7126657Z 2025-05-07T19:52:03.7128119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:03.7130491Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:03.7131521Z ^ 2025-05-07T19:52:03.7131849Z 2025-05-07T19:52:03.7133536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:03.7135908Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:03.7136962Z ^ 2025-05-07T19:52:03.7137185Z 2025-05-07T19:52:03.7137583Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:03.7138163Z 2025-05-07T19:52:03.7139593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:03.7142154Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:03.7143218Z ^ 2025-05-07T19:52:03.7143558Z 2025-05-07T19:52:03.7144945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:03.7147350Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:03.7148365Z ^ 2025-05-07T19:52:03.7148598Z 2025-05-07T19:52:03.7148984Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:03.7149566Z 2025-05-07T19:52:03.7151034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:03.7153264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:03.7154315Z ^ 2025-05-07T19:52:03.7154642Z 2025-05-07T19:52:04.0159236Z [119/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_common.so -o fbgemm_gpu_tbe_common.so CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && : 2025-05-07T19:52:06.0008469Z [120/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o 2025-05-07T19:52:06.0030038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0032453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0033459Z ^ 2025-05-07T19:52:06.0033729Z 2025-05-07T19:52:06.0034154Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:06.0034721Z 2025-05-07T19:52:06.0036238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0038991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0040116Z ^ 2025-05-07T19:52:06.0040456Z 2025-05-07T19:52:06.0042009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0044462Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0045501Z ^ 2025-05-07T19:52:06.0045711Z 2025-05-07T19:52:06.0046125Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:06.0046733Z 2025-05-07T19:52:06.0048322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0050718Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0051843Z ^ 2025-05-07T19:52:06.0052191Z 2025-05-07T19:52:06.0053972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0056529Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0057606Z ^ 2025-05-07T19:52:06.0057864Z 2025-05-07T19:52:06.0058317Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:06.0058981Z 2025-05-07T19:52:06.0060503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0063580Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0064695Z ^ 2025-05-07T19:52:06.0065027Z 2025-05-07T19:52:06.0066574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0069055Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0070189Z ^ 2025-05-07T19:52:06.0070440Z 2025-05-07T19:52:06.0070871Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:06.0071491Z 2025-05-07T19:52:06.0083998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0086483Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0087480Z ^ 2025-05-07T19:52:06.0087839Z 2025-05-07T19:52:06.0089277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0092210Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0093260Z ^ 2025-05-07T19:52:06.0093502Z 2025-05-07T19:52:06.0093933Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:06.0094626Z 2025-05-07T19:52:06.0096122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:06.0098868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:06.0100055Z ^ 2025-05-07T19:52:06.0100409Z 2025-05-07T19:52:08.3710984Z [121/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o 2025-05-07T19:52:08.3732764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.3735278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.3736211Z ^ 2025-05-07T19:52:08.3736420Z 2025-05-07T19:52:08.3737137Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:08.3737748Z 2025-05-07T19:52:08.3739279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.3741910Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.3743041Z ^ 2025-05-07T19:52:08.3743373Z 2025-05-07T19:52:08.3744908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.3747442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.3748491Z ^ 2025-05-07T19:52:08.3748735Z 2025-05-07T19:52:08.3749155Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:08.3749756Z 2025-05-07T19:52:08.3751283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.3753903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.3754987Z ^ 2025-05-07T19:52:08.3755304Z 2025-05-07T19:52:08.3756751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.3759116Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.3760172Z ^ 2025-05-07T19:52:08.3760633Z 2025-05-07T19:52:08.3761058Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:08.3761657Z 2025-05-07T19:52:08.3763165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.3765641Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.3766739Z ^ 2025-05-07T19:52:08.3767086Z 2025-05-07T19:52:08.3768659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.3771133Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.3772288Z ^ 2025-05-07T19:52:08.3772532Z 2025-05-07T19:52:08.3772932Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:08.3773546Z 2025-05-07T19:52:08.3775058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.3778109Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.3779521Z ^ 2025-05-07T19:52:08.3779917Z 2025-05-07T19:52:08.3781519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.3784065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.3785178Z ^ 2025-05-07T19:52:08.3785444Z 2025-05-07T19:52:08.3785883Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:08.3786532Z 2025-05-07T19:52:08.3788063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:08.3790358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:08.3791398Z ^ 2025-05-07T19:52:08.3791684Z 2025-05-07T19:52:11.6865537Z [122/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o 2025-05-07T19:52:11.6887007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.6890164Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.6891319Z ^ 2025-05-07T19:52:11.6891603Z 2025-05-07T19:52:11.6892041Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:11.6892675Z 2025-05-07T19:52:11.6894162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.6896869Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.6897833Z ^ 2025-05-07T19:52:11.6898128Z 2025-05-07T19:52:11.6899386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.6901646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.6902532Z ^ 2025-05-07T19:52:11.6902774Z 2025-05-07T19:52:11.6903165Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:11.6903750Z 2025-05-07T19:52:11.6905347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.6907951Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.6909227Z ^ 2025-05-07T19:52:11.6909549Z 2025-05-07T19:52:11.6910937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.6913413Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.6914886Z ^ 2025-05-07T19:52:11.6915124Z 2025-05-07T19:52:11.6915556Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:11.6916190Z 2025-05-07T19:52:11.6917669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.6919884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.6920962Z ^ 2025-05-07T19:52:11.6921298Z 2025-05-07T19:52:11.6922615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.6924881Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.6925954Z ^ 2025-05-07T19:52:11.6926179Z 2025-05-07T19:52:11.6926562Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:11.6927161Z 2025-05-07T19:52:11.6928783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.6931274Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.6932247Z ^ 2025-05-07T19:52:11.6932544Z 2025-05-07T19:52:11.6933836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.6935869Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.6936889Z ^ 2025-05-07T19:52:11.6937118Z 2025-05-07T19:52:11.6937501Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:11.6938010Z 2025-05-07T19:52:11.6939487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:11.6942010Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:11.6943156Z ^ 2025-05-07T19:52:11.6943495Z 2025-05-07T19:52:12.6115668Z [123/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o 2025-05-07T19:52:12.6139006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6142002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6143059Z ^ 2025-05-07T19:52:12.6143301Z 2025-05-07T19:52:12.6143650Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:12.6144207Z 2025-05-07T19:52:12.6145692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6148140Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6149330Z ^ 2025-05-07T19:52:12.6149703Z 2025-05-07T19:52:12.6151325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6153774Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6154838Z ^ 2025-05-07T19:52:12.6155044Z 2025-05-07T19:52:12.6155395Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:12.6155939Z 2025-05-07T19:52:12.6157423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6159924Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6160986Z ^ 2025-05-07T19:52:12.6161581Z 2025-05-07T19:52:12.6163115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6165499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6166606Z ^ 2025-05-07T19:52:12.6166839Z 2025-05-07T19:52:12.6167225Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:12.6167827Z 2025-05-07T19:52:12.6169268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6171705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6172844Z ^ 2025-05-07T19:52:12.6173218Z 2025-05-07T19:52:12.6174597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6177259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6178316Z ^ 2025-05-07T19:52:12.6178576Z 2025-05-07T19:52:12.6178938Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:12.6179794Z 2025-05-07T19:52:12.6181415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6183757Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6184843Z ^ 2025-05-07T19:52:12.6185178Z 2025-05-07T19:52:12.6186711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6189125Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6190190Z ^ 2025-05-07T19:52:12.6190422Z 2025-05-07T19:52:12.6190807Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:12.6191356Z 2025-05-07T19:52:12.6192720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:12.6195114Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:12.6196209Z ^ 2025-05-07T19:52:12.6196568Z 2025-05-07T19:52:13.1106549Z [124/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o 2025-05-07T19:52:13.1128101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.1130503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.1131534Z ^ 2025-05-07T19:52:13.1131782Z 2025-05-07T19:52:13.1132309Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.1132923Z 2025-05-07T19:52:13.1134481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.1137060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.1138157Z ^ 2025-05-07T19:52:13.1138483Z 2025-05-07T19:52:13.1139999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.1142603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.1143683Z ^ 2025-05-07T19:52:13.1143923Z 2025-05-07T19:52:13.1144336Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.1144963Z 2025-05-07T19:52:13.1146487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.1149239Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.1150341Z ^ 2025-05-07T19:52:13.1150708Z 2025-05-07T19:52:13.1152207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.1154588Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.1155671Z ^ 2025-05-07T19:52:13.1155931Z 2025-05-07T19:52:13.1156353Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.1156983Z 2025-05-07T19:52:13.1158524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.1160996Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.1162036Z ^ 2025-05-07T19:52:13.1162377Z 2025-05-07T19:52:13.1163865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.1166513Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.1167625Z ^ 2025-05-07T19:52:13.1167857Z 2025-05-07T19:52:13.1168277Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.1168904Z 2025-05-07T19:52:13.1170394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.1172940Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.1174007Z ^ 2025-05-07T19:52:13.1174354Z 2025-05-07T19:52:13.1176287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.1178633Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.1179626Z ^ 2025-05-07T19:52:13.1179845Z 2025-05-07T19:52:13.1180257Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:13.1180924Z 2025-05-07T19:52:13.1182400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:13.1184793Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:13.1185851Z ^ 2025-05-07T19:52:13.1186181Z 2025-05-07T19:52:14.0367303Z [125/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cu -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o 2025-05-07T19:52:14.0378984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:14.0380384Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:14.0381101Z ^ 2025-05-07T19:52:14.0381262Z 2025-05-07T19:52:14.0381503Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:14.0381854Z 2025-05-07T19:52:14.0382738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:14.0384115Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:14.0384749Z ^ 2025-05-07T19:52:14.0384945Z 2025-05-07T19:52:14.0385792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:14.0387163Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:14.0387784Z ^ 2025-05-07T19:52:14.0387923Z 2025-05-07T19:52:14.0388166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:14.0388529Z 2025-05-07T19:52:14.0389385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:14.0390974Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:14.0391587Z ^ 2025-05-07T19:52:14.0391800Z 2025-05-07T19:52:14.0392642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:14.0394017Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:14.0394622Z ^ 2025-05-07T19:52:14.0394762Z 2025-05-07T19:52:14.0395009Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:14.0395360Z 2025-05-07T19:52:14.0396211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:14.0397759Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:14.0398389Z ^ 2025-05-07T19:52:14.0398585Z 2025-05-07T19:52:14.0399542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:14.0400921Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:14.0401541Z ^ 2025-05-07T19:52:14.0401683Z 2025-05-07T19:52:14.0401917Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:14.0402261Z 2025-05-07T19:52:14.0403132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:14.0404496Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:14.0405127Z ^ 2025-05-07T19:52:14.0405327Z 2025-05-07T19:52:14.0406187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:14.0407544Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:14.0408167Z ^ 2025-05-07T19:52:14.0408303Z 2025-05-07T19:52:14.0408551Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:14.0408895Z 2025-05-07T19:52:14.0409750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:14.0411131Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:14.0411750Z ^ 2025-05-07T19:52:14.0411958Z 2025-05-07T19:52:44.4307805Z [126/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o 2025-05-07T19:52:44.4332213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.4334998Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.4336306Z ^ 2025-05-07T19:52:44.4336634Z 2025-05-07T19:52:44.4337127Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:44.4337769Z 2025-05-07T19:52:44.4339560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.4342508Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.4343806Z ^ 2025-05-07T19:52:44.4344225Z 2025-05-07T19:52:44.4346113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.4349128Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.4350451Z ^ 2025-05-07T19:52:44.4350684Z 2025-05-07T19:52:44.4351109Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:44.4352133Z 2025-05-07T19:52:44.4353823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.4356930Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.4358062Z ^ 2025-05-07T19:52:44.4358470Z 2025-05-07T19:52:44.4360406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.4363286Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.4364598Z ^ 2025-05-07T19:52:44.4364878Z 2025-05-07T19:52:44.4365357Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:44.4366097Z 2025-05-07T19:52:44.4367870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.4370769Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.4372048Z ^ 2025-05-07T19:52:44.4372448Z 2025-05-07T19:52:44.4374487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.4377857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.4379181Z ^ 2025-05-07T19:52:44.4379469Z 2025-05-07T19:52:44.4379979Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:44.4380981Z 2025-05-07T19:52:44.4382816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.4385772Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.4387130Z ^ 2025-05-07T19:52:44.4387543Z 2025-05-07T19:52:44.4389355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.4392334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.4393662Z ^ 2025-05-07T19:52:44.4393947Z 2025-05-07T19:52:44.4394443Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:44.4395172Z 2025-05-07T19:52:44.4397014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:44.4399924Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:44.4401221Z ^ 2025-05-07T19:52:44.4402014Z 2025-05-07T19:52:45.1111317Z [127/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_cache.so -o fbgemm_gpu_tbe_cache.so CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:52:46.3269732Z [128/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:46.3295564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.3298537Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.3299904Z ^ 2025-05-07T19:52:46.3300361Z 2025-05-07T19:52:46.3300983Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:46.3301710Z 2025-05-07T19:52:46.3303519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.3306419Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.3307727Z ^ 2025-05-07T19:52:46.3308139Z 2025-05-07T19:52:46.3309905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.3312789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.3314088Z ^ 2025-05-07T19:52:46.3314372Z 2025-05-07T19:52:46.3314853Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:46.3315572Z 2025-05-07T19:52:46.3317387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.3320269Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.3321942Z ^ 2025-05-07T19:52:46.3322305Z 2025-05-07T19:52:46.3324080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.3327055Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.3328384Z ^ 2025-05-07T19:52:46.3328674Z 2025-05-07T19:52:46.3329196Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:46.3329931Z 2025-05-07T19:52:46.3331844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.3334828Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.3336309Z ^ 2025-05-07T19:52:46.3336727Z 2025-05-07T19:52:46.3338450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.3341290Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.3342992Z ^ 2025-05-07T19:52:46.3343293Z 2025-05-07T19:52:46.3343775Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:46.3344487Z 2025-05-07T19:52:46.3346293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.3349179Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.3350485Z ^ 2025-05-07T19:52:46.3350884Z 2025-05-07T19:52:46.3352653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.3355519Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.3356995Z ^ 2025-05-07T19:52:46.3357278Z 2025-05-07T19:52:46.3357782Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:46.3358539Z 2025-05-07T19:52:46.3360374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.3363370Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.3364708Z ^ 2025-05-07T19:52:46.3365139Z 2025-05-07T19:52:46.6537669Z [129/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o 2025-05-07T19:52:46.6562709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.6565644Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.6566928Z ^ 2025-05-07T19:52:46.6567230Z 2025-05-07T19:52:46.6567724Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:46.6568494Z 2025-05-07T19:52:46.6570309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.6573230Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.6574543Z ^ 2025-05-07T19:52:46.6574947Z 2025-05-07T19:52:46.6577174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.6580038Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.6581417Z ^ 2025-05-07T19:52:46.6581736Z 2025-05-07T19:52:46.6582221Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:46.6582958Z 2025-05-07T19:52:46.6584664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.6587898Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.6589189Z ^ 2025-05-07T19:52:46.6589602Z 2025-05-07T19:52:46.6591322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.6594125Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.6595114Z ^ 2025-05-07T19:52:46.6595416Z 2025-05-07T19:52:46.6595896Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:46.6596626Z 2025-05-07T19:52:46.6598346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.6601137Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.6602396Z ^ 2025-05-07T19:52:46.6602794Z 2025-05-07T19:52:46.6604934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.6607613Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.6608920Z ^ 2025-05-07T19:52:46.6609205Z 2025-05-07T19:52:46.6609698Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:46.6610408Z 2025-05-07T19:52:46.6612188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.6614932Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.6616240Z ^ 2025-05-07T19:52:46.6616634Z 2025-05-07T19:52:46.6618384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.6621347Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.6622638Z ^ 2025-05-07T19:52:46.6622915Z 2025-05-07T19:52:46.6623392Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:46.6624050Z 2025-05-07T19:52:46.6625755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:46.6628671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:46.6629814Z ^ 2025-05-07T19:52:46.6630214Z 2025-05-07T19:52:47.2558033Z [130/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_optimizers.so -o fbgemm_gpu_tbe_optimizers.so CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:52:57.5378663Z [131/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/transpose_embedding_input.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o 2025-05-07T19:52:57.5398904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:57.5401469Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:57.5402533Z ^ 2025-05-07T19:52:57.5402801Z 2025-05-07T19:52:57.5403207Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:57.5403802Z 2025-05-07T19:52:57.5405291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:57.5407591Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:57.5408661Z ^ 2025-05-07T19:52:57.5408980Z 2025-05-07T19:52:57.5410896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:57.5413370Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:57.5414464Z ^ 2025-05-07T19:52:57.5414705Z 2025-05-07T19:52:57.5415252Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:57.5415867Z 2025-05-07T19:52:57.5417434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:57.5419829Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:57.5421073Z ^ 2025-05-07T19:52:57.5421438Z 2025-05-07T19:52:57.5422876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:57.5425238Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:57.5426273Z ^ 2025-05-07T19:52:57.5426533Z 2025-05-07T19:52:57.5426935Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:57.5427580Z 2025-05-07T19:52:57.5429048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:57.5431534Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:57.5432632Z ^ 2025-05-07T19:52:57.5433281Z 2025-05-07T19:52:57.5434794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:57.5437159Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:57.5438174Z ^ 2025-05-07T19:52:57.5438428Z 2025-05-07T19:52:57.5438875Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:57.5439436Z 2025-05-07T19:52:57.5440956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:57.5443351Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:57.5444502Z ^ 2025-05-07T19:52:57.5445009Z 2025-05-07T19:52:57.5446497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:57.5448949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:57.5450045Z ^ 2025-05-07T19:52:57.5450271Z 2025-05-07T19:52:57.5450640Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:57.5451644Z 2025-05-07T19:52:57.5453138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:57.5455557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:57.5456634Z ^ 2025-05-07T19:52:57.5457005Z 2025-05-07T19:52:59.4898172Z [132/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o 2025-05-07T19:52:59.4919011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.4921469Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.4922644Z ^ 2025-05-07T19:52:59.4922885Z 2025-05-07T19:52:59.4923292Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:59.4923918Z 2025-05-07T19:52:59.4925350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.4927997Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.4929027Z ^ 2025-05-07T19:52:59.4929375Z 2025-05-07T19:52:59.4930798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.4933083Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.4934096Z ^ 2025-05-07T19:52:59.4934351Z 2025-05-07T19:52:59.4934742Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:59.4935323Z 2025-05-07T19:52:59.4936765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.4939069Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.4940176Z ^ 2025-05-07T19:52:59.4940488Z 2025-05-07T19:52:59.4942006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.4944325Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.4945383Z ^ 2025-05-07T19:52:59.4945619Z 2025-05-07T19:52:59.4945998Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:59.4946610Z 2025-05-07T19:52:59.4948038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.4950647Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.4951689Z ^ 2025-05-07T19:52:59.4952028Z 2025-05-07T19:52:59.4953423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.4955586Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.4956608Z ^ 2025-05-07T19:52:59.4956861Z 2025-05-07T19:52:59.4957265Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:59.4957833Z 2025-05-07T19:52:59.4959257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.4961579Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.4962625Z ^ 2025-05-07T19:52:59.4962960Z 2025-05-07T19:52:59.4964356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.4966895Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.4967965Z ^ 2025-05-07T19:52:59.4968222Z 2025-05-07T19:52:59.4968603Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:52:59.4969174Z 2025-05-07T19:52:59.4970615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:52:59.4972887Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:52:59.4973948Z ^ 2025-05-07T19:52:59.4974263Z 2025-05-07T19:53:02.1475617Z [133/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o 2025-05-07T19:53:02.1499714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.1502568Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.1503757Z ^ 2025-05-07T19:53:02.1504036Z 2025-05-07T19:53:02.1504473Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:02.1505201Z 2025-05-07T19:53:02.1507208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.1509981Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.1511168Z ^ 2025-05-07T19:53:02.1511559Z 2025-05-07T19:53:02.1513334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.1516065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.1517171Z ^ 2025-05-07T19:53:02.1517452Z 2025-05-07T19:53:02.1517902Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:02.1518563Z 2025-05-07T19:53:02.1520202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.1522834Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.1524022Z ^ 2025-05-07T19:53:02.1524398Z 2025-05-07T19:53:02.1526047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.1528669Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.1529842Z ^ 2025-05-07T19:53:02.1530094Z 2025-05-07T19:53:02.1530535Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:02.1531899Z 2025-05-07T19:53:02.1533553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.1536297Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.1537474Z ^ 2025-05-07T19:53:02.1537876Z 2025-05-07T19:53:02.1539545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.1542466Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.1543760Z ^ 2025-05-07T19:53:02.1544039Z 2025-05-07T19:53:02.1544488Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:02.1545169Z 2025-05-07T19:53:02.1546806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.1549416Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.1550626Z ^ 2025-05-07T19:53:02.1550988Z 2025-05-07T19:53:02.1552781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.1555625Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.1556890Z ^ 2025-05-07T19:53:02.1557129Z 2025-05-07T19:53:02.1557560Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:02.1558270Z 2025-05-07T19:53:02.1559927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:02.1562709Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:02.1563861Z ^ 2025-05-07T19:53:02.1564273Z 2025-05-07T19:53:05.1912226Z [134/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:05.1934082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.1937159Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.1938311Z ^ 2025-05-07T19:53:05.1938564Z 2025-05-07T19:53:05.1938969Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.1939666Z 2025-05-07T19:53:05.1941334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.1943858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.1945003Z ^ 2025-05-07T19:53:05.1945398Z 2025-05-07T19:53:05.1946971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.1949514Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.1950634Z ^ 2025-05-07T19:53:05.1950939Z 2025-05-07T19:53:05.1951372Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.1952007Z 2025-05-07T19:53:05.1953628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.1956010Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.1957152Z ^ 2025-05-07T19:53:05.1957489Z 2025-05-07T19:53:05.1958984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.1961354Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.1962771Z ^ 2025-05-07T19:53:05.1963022Z 2025-05-07T19:53:05.1963448Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.1964109Z 2025-05-07T19:53:05.1965632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.1968100Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.1969255Z ^ 2025-05-07T19:53:05.1969659Z 2025-05-07T19:53:05.1971099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.1973527Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.1974615Z ^ 2025-05-07T19:53:05.1974855Z 2025-05-07T19:53:05.1975327Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.1976259Z 2025-05-07T19:53:05.1977875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.1980837Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.1981995Z ^ 2025-05-07T19:53:05.1982343Z 2025-05-07T19:53:05.1983855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.1986380Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.1987525Z ^ 2025-05-07T19:53:05.1987764Z 2025-05-07T19:53:05.1988191Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.1988824Z 2025-05-07T19:53:05.1990381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.1992816Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.1993889Z ^ 2025-05-07T19:53:05.1994223Z 2025-05-07T19:53:05.7760560Z [135/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o 2025-05-07T19:53:05.7784222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.7786841Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.7787964Z ^ 2025-05-07T19:53:05.7788235Z 2025-05-07T19:53:05.7788644Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.7789203Z 2025-05-07T19:53:05.7790698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.7793188Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.7794367Z ^ 2025-05-07T19:53:05.7794738Z 2025-05-07T19:53:05.7796458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.7798904Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.7799958Z ^ 2025-05-07T19:53:05.7800203Z 2025-05-07T19:53:05.7800614Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.7801256Z 2025-05-07T19:53:05.7802728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.7805132Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.7806184Z ^ 2025-05-07T19:53:05.7806575Z 2025-05-07T19:53:05.7808111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.7811047Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.7812205Z ^ 2025-05-07T19:53:05.7812494Z 2025-05-07T19:53:05.7812923Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.7813529Z 2025-05-07T19:53:05.7815060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.7817493Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.7818698Z ^ 2025-05-07T19:53:05.7819054Z 2025-05-07T19:53:05.7820500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.7823077Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.7824156Z ^ 2025-05-07T19:53:05.7824394Z 2025-05-07T19:53:05.7824826Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.7825469Z 2025-05-07T19:53:05.7827231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.7829725Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.7830804Z ^ 2025-05-07T19:53:05.7831191Z 2025-05-07T19:53:05.7832691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.7835246Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.7836356Z ^ 2025-05-07T19:53:05.7836600Z 2025-05-07T19:53:05.7837075Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:05.7837683Z 2025-05-07T19:53:05.7839181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:05.7841663Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:05.7842802Z ^ 2025-05-07T19:53:05.7843160Z 2025-05-07T19:53:14.6571528Z [136/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o 2025-05-07T19:53:14.6595006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6597903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6599117Z ^ 2025-05-07T19:53:14.6599385Z 2025-05-07T19:53:14.6599845Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:14.6600561Z 2025-05-07T19:53:14.6602318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6605073Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6606331Z ^ 2025-05-07T19:53:14.6606728Z 2025-05-07T19:53:14.6608416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6611034Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6612236Z ^ 2025-05-07T19:53:14.6612524Z 2025-05-07T19:53:14.6612988Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:14.6613630Z 2025-05-07T19:53:14.6615336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6618116Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6619714Z ^ 2025-05-07T19:53:14.6620096Z 2025-05-07T19:53:14.6621925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6624677Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6625842Z ^ 2025-05-07T19:53:14.6626113Z 2025-05-07T19:53:14.6626594Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:14.6627325Z 2025-05-07T19:53:14.6629005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6631688Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6632898Z ^ 2025-05-07T19:53:14.6633319Z 2025-05-07T19:53:14.6635010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6638065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6639229Z ^ 2025-05-07T19:53:14.6639533Z 2025-05-07T19:53:14.6640005Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:14.6640672Z 2025-05-07T19:53:14.6642344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6645113Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6646329Z ^ 2025-05-07T19:53:14.6646707Z 2025-05-07T19:53:14.6648389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6651148Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6652365Z ^ 2025-05-07T19:53:14.6652640Z 2025-05-07T19:53:14.6653106Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:14.6653784Z 2025-05-07T19:53:14.6655490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.6658197Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:14.6659432Z ^ 2025-05-07T19:53:14.6659817Z 2025-05-07T19:53:19.2239867Z [137/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o 2025-05-07T19:53:19.2263309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.2266073Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.2267251Z ^ 2025-05-07T19:53:19.2267466Z 2025-05-07T19:53:19.2267876Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:19.2268479Z 2025-05-07T19:53:19.2269954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.2272451Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.2273545Z ^ 2025-05-07T19:53:19.2273896Z 2025-05-07T19:53:19.2275369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:19.2277536Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:19.2278052Z ^ 2025-05-07T19:53:19.2278341Z 2025-05-07T19:53:19.2279791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:19.2281973Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:19.2282478Z ^ 2025-05-07T19:53:19.2282756Z 2025-05-07T19:53:19.2284214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:19.2286045Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:19.2286532Z ^ 2025-05-07T19:53:19.2286801Z 2025-05-07T19:53:19.2288467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.2291158Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.2292363Z ^ 2025-05-07T19:53:19.2292614Z 2025-05-07T19:53:19.2293222Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:19.2293893Z 2025-05-07T19:53:19.2295461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.2298057Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.2299499Z ^ 2025-05-07T19:53:19.2299860Z 2025-05-07T19:53:19.2301447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:19.2303343Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:19.2303880Z ^ 2025-05-07T19:53:19.2304171Z 2025-05-07T19:53:19.2305664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:19.2307559Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:19.2308080Z ^ 2025-05-07T19:53:19.2308357Z 2025-05-07T19:53:19.2309866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:19.2311750Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:19.2312413Z ^ 2025-05-07T19:53:19.2312700Z 2025-05-07T19:53:19.2314298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.2317062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.2318289Z ^ 2025-05-07T19:53:19.2318538Z 2025-05-07T19:53:19.2318973Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:19.2319635Z 2025-05-07T19:53:19.2321242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.2324029Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.2325168Z ^ 2025-05-07T19:53:19.2325552Z 2025-05-07T19:53:19.2326980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:19.2328863Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:19.2329430Z ^ 2025-05-07T19:53:19.2329735Z 2025-05-07T19:53:19.2331361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:19.2333332Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:19.2333930Z ^ 2025-05-07T19:53:19.2334227Z 2025-05-07T19:53:19.2335832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:19.2337875Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:19.2338381Z ^ 2025-05-07T19:53:19.2338665Z 2025-05-07T19:53:19.2340133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.2342834Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.2343868Z ^ 2025-05-07T19:53:19.2344099Z 2025-05-07T19:53:19.2344523Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:19.2345175Z 2025-05-07T19:53:19.2346779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.2349351Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.2350472Z ^ 2025-05-07T19:53:19.2350828Z 2025-05-07T19:53:19.2352527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:19.2354526Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:19.2355100Z ^ 2025-05-07T19:53:19.2355389Z 2025-05-07T19:53:19.2356890Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:19.2358845Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:19.2359410Z ^ 2025-05-07T19:53:19.2359702Z 2025-05-07T19:53:19.2361291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:19.2363380Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:19.2364001Z ^ 2025-05-07T19:53:19.2364322Z 2025-05-07T19:53:19.2366039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.2369024Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.2370061Z ^ 2025-05-07T19:53:19.2370300Z 2025-05-07T19:53:19.2370681Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:19.2371285Z 2025-05-07T19:53:19.2372719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.2375060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:19.2376379Z ^ 2025-05-07T19:53:19.2376752Z 2025-05-07T19:53:19.2378350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:19.2380306Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:19.2380978Z ^ 2025-05-07T19:53:19.2381275Z 2025-05-07T19:53:19.2382695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:19.2387126Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:19.2387794Z ^ 2025-05-07T19:53:19.2388065Z 2025-05-07T19:53:19.2389438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:19.2391420Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:19.2392022Z ^ 2025-05-07T19:53:19.2392332Z 2025-05-07T19:53:26.2460031Z [138/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:26.2483833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.2486418Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.2487587Z ^ 2025-05-07T19:53:26.2488071Z 2025-05-07T19:53:26.2488466Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:26.2489284Z 2025-05-07T19:53:26.2490841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.2493924Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.2495111Z ^ 2025-05-07T19:53:26.2495469Z 2025-05-07T19:53:26.2497105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.2499686Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.2500978Z ^ 2025-05-07T19:53:26.2501627Z 2025-05-07T19:53:26.2502102Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:26.2502786Z 2025-05-07T19:53:26.2504442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.2506920Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.2508008Z ^ 2025-05-07T19:53:26.2508383Z 2025-05-07T19:53:26.2510028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.2512705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.2513874Z ^ 2025-05-07T19:53:26.2514133Z 2025-05-07T19:53:26.2514528Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:26.2515185Z 2025-05-07T19:53:26.2516894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.2520013Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.2521222Z ^ 2025-05-07T19:53:26.2521593Z 2025-05-07T19:53:26.2523215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.2525936Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.2527148Z ^ 2025-05-07T19:53:26.2527406Z 2025-05-07T19:53:26.2527900Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:26.2528556Z 2025-05-07T19:53:26.2530285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.2532873Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.2534019Z ^ 2025-05-07T19:53:26.2534415Z 2025-05-07T19:53:26.2536009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.2538883Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.2540033Z ^ 2025-05-07T19:53:26.2540316Z 2025-05-07T19:53:26.2540915Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:26.2541576Z 2025-05-07T19:53:26.2543187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.2545804Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:26.2546960Z ^ 2025-05-07T19:53:26.2547294Z 2025-05-07T19:53:27.0632135Z [139/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:27.0655002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.0657752Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.0658954Z ^ 2025-05-07T19:53:27.0659216Z 2025-05-07T19:53:27.0659676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:27.0660343Z 2025-05-07T19:53:27.0662662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.0665355Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.0666549Z ^ 2025-05-07T19:53:27.0666915Z 2025-05-07T19:53:27.0668537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.0671092Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.0672150Z ^ 2025-05-07T19:53:27.0672363Z 2025-05-07T19:53:27.0672764Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:27.0673406Z 2025-05-07T19:53:27.0675302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.0678081Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.0679212Z ^ 2025-05-07T19:53:27.0679608Z 2025-05-07T19:53:27.0681144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.0683639Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.0684750Z ^ 2025-05-07T19:53:27.0685024Z 2025-05-07T19:53:27.0685457Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:27.0686510Z 2025-05-07T19:53:27.0688092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.0690632Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.0691950Z ^ 2025-05-07T19:53:27.0692314Z 2025-05-07T19:53:27.0693944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.0696554Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.0697683Z ^ 2025-05-07T19:53:27.0697930Z 2025-05-07T19:53:27.0698356Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:27.0698989Z 2025-05-07T19:53:27.0700713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.0703262Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.0704391Z ^ 2025-05-07T19:53:27.0704743Z 2025-05-07T19:53:27.0706554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.0708889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.0709918Z ^ 2025-05-07T19:53:27.0710148Z 2025-05-07T19:53:27.0710585Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:27.0711261Z 2025-05-07T19:53:27.0712821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.0715155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:27.0716345Z ^ 2025-05-07T19:53:27.0716716Z 2025-05-07T19:53:28.7869845Z [140/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:28.7897837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.7901421Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.7902775Z ^ 2025-05-07T19:53:28.7903085Z 2025-05-07T19:53:28.7903517Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:28.7904168Z 2025-05-07T19:53:28.7906031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.7908836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.7910110Z ^ 2025-05-07T19:53:28.7910478Z 2025-05-07T19:53:28.7912212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.7914593Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:28.7915478Z ^ 2025-05-07T19:53:28.7915809Z 2025-05-07T19:53:28.7917534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.7919721Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.7920371Z ^ 2025-05-07T19:53:28.7920694Z 2025-05-07T19:53:28.7922393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.7924604Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.7925230Z ^ 2025-05-07T19:53:28.7925562Z 2025-05-07T19:53:28.7927286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.7929962Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.7930554Z ^ 2025-05-07T19:53:28.7930878Z 2025-05-07T19:53:28.7932643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.7935062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.7936311Z ^ 2025-05-07T19:53:28.7936538Z 2025-05-07T19:53:28.7937009Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:28.7937667Z 2025-05-07T19:53:28.7939385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.7942703Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.7943866Z ^ 2025-05-07T19:53:28.7944276Z 2025-05-07T19:53:28.7945941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.7948558Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:28.7949374Z ^ 2025-05-07T19:53:28.7949611Z 2025-05-07T19:53:28.7951312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.7953405Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.7954034Z ^ 2025-05-07T19:53:28.7954374Z 2025-05-07T19:53:28.7956072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.7958173Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.7958798Z ^ 2025-05-07T19:53:28.7959128Z 2025-05-07T19:53:28.7960649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.7962770Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.7963378Z ^ 2025-05-07T19:53:28.7963690Z 2025-05-07T19:53:28.7965524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.7968342Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.7969654Z ^ 2025-05-07T19:53:28.7969940Z 2025-05-07T19:53:28.7970443Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:28.7971182Z 2025-05-07T19:53:28.7973036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.7976289Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.7978010Z ^ 2025-05-07T19:53:28.7978434Z 2025-05-07T19:53:28.7980147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.7982681Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:28.7983521Z ^ 2025-05-07T19:53:28.7983864Z 2025-05-07T19:53:28.7985588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.7987789Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.7988436Z ^ 2025-05-07T19:53:28.7988754Z 2025-05-07T19:53:28.7990493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.7992654Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.7993302Z ^ 2025-05-07T19:53:28.7993614Z 2025-05-07T19:53:28.7995303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.7997736Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.7998377Z ^ 2025-05-07T19:53:28.7998673Z 2025-05-07T19:53:28.8000485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.8003480Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.8004814Z ^ 2025-05-07T19:53:28.8005098Z 2025-05-07T19:53:28.8005591Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:28.8006319Z 2025-05-07T19:53:28.8008153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.8011110Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.8012443Z ^ 2025-05-07T19:53:28.8012884Z 2025-05-07T19:53:28.8014594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.8016939Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:28.8017785Z ^ 2025-05-07T19:53:28.8018105Z 2025-05-07T19:53:28.8019846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.8022142Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.8022785Z ^ 2025-05-07T19:53:28.8023110Z 2025-05-07T19:53:28.8024778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.8028467Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.8029119Z ^ 2025-05-07T19:53:28.8029434Z 2025-05-07T19:53:28.8031113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.8045119Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.8045813Z ^ 2025-05-07T19:53:28.8046157Z 2025-05-07T19:53:28.8048012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.8051017Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.8052310Z ^ 2025-05-07T19:53:28.8052626Z 2025-05-07T19:53:28.8053027Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:28.8053686Z 2025-05-07T19:53:28.8055500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.8058498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:28.8060144Z ^ 2025-05-07T19:53:28.8060721Z 2025-05-07T19:53:28.8062452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.8064847Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:28.8065707Z ^ 2025-05-07T19:53:28.8066035Z 2025-05-07T19:53:28.8067714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.8069908Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.8070540Z ^ 2025-05-07T19:53:28.8070880Z 2025-05-07T19:53:28.8072601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.8074796Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.8075432Z ^ 2025-05-07T19:53:28.8075757Z 2025-05-07T19:53:28.8077782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:28.8079938Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:28.8080560Z ^ 2025-05-07T19:53:28.8080872Z 2025-05-07T19:53:29.9671610Z [141/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:29.9696984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.9699836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.9701263Z ^ 2025-05-07T19:53:29.9701552Z 2025-05-07T19:53:29.9702042Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:29.9702751Z 2025-05-07T19:53:29.9704781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.9707597Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.9708834Z ^ 2025-05-07T19:53:29.9709206Z 2025-05-07T19:53:29.9710780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9712992Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:29.9713753Z ^ 2025-05-07T19:53:29.9714062Z 2025-05-07T19:53:29.9715791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9718110Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:29.9718949Z ^ 2025-05-07T19:53:29.9719244Z 2025-05-07T19:53:29.9720850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9722871Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:29.9723433Z ^ 2025-05-07T19:53:29.9723745Z 2025-05-07T19:53:29.9725344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9727404Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:29.9727970Z ^ 2025-05-07T19:53:29.9728276Z 2025-05-07T19:53:29.9729995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.9732800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.9734032Z ^ 2025-05-07T19:53:29.9734287Z 2025-05-07T19:53:29.9734765Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:29.9735469Z 2025-05-07T19:53:29.9737428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.9740405Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.9741645Z ^ 2025-05-07T19:53:29.9742042Z 2025-05-07T19:53:29.9743658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9745890Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:29.9746673Z ^ 2025-05-07T19:53:29.9746982Z 2025-05-07T19:53:29.9748570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9750620Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:29.9751184Z ^ 2025-05-07T19:53:29.9751499Z 2025-05-07T19:53:29.9753096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9755131Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:29.9755859Z ^ 2025-05-07T19:53:29.9756137Z 2025-05-07T19:53:29.9757696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9759652Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:29.9760202Z ^ 2025-05-07T19:53:29.9760479Z 2025-05-07T19:53:29.9762182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.9764886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.9766222Z ^ 2025-05-07T19:53:29.9766472Z 2025-05-07T19:53:29.9767102Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:29.9767819Z 2025-05-07T19:53:29.9769544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.9772349Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.9773588Z ^ 2025-05-07T19:53:29.9773992Z 2025-05-07T19:53:29.9775585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9778116Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:29.9778884Z ^ 2025-05-07T19:53:29.9779196Z 2025-05-07T19:53:29.9780903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9782911Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:29.9783495Z ^ 2025-05-07T19:53:29.9783790Z 2025-05-07T19:53:29.9785669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9787699Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:29.9788276Z ^ 2025-05-07T19:53:29.9788557Z 2025-05-07T19:53:29.9790188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9792180Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:29.9792755Z ^ 2025-05-07T19:53:29.9793044Z 2025-05-07T19:53:29.9794743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.9797548Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.9798764Z ^ 2025-05-07T19:53:29.9799027Z 2025-05-07T19:53:29.9799503Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:29.9800203Z 2025-05-07T19:53:29.9801936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.9804746Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.9805993Z ^ 2025-05-07T19:53:29.9806370Z 2025-05-07T19:53:29.9807985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9810178Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:29.9811186Z ^ 2025-05-07T19:53:29.9811488Z 2025-05-07T19:53:29.9813094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9815130Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:29.9815714Z ^ 2025-05-07T19:53:29.9816004Z 2025-05-07T19:53:29.9817595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9819624Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:29.9820202Z ^ 2025-05-07T19:53:29.9820612Z 2025-05-07T19:53:29.9822201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9824226Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:29.9824786Z ^ 2025-05-07T19:53:29.9825087Z 2025-05-07T19:53:29.9826797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.9829592Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.9830816Z ^ 2025-05-07T19:53:29.9831219Z 2025-05-07T19:53:29.9831676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:29.9832366Z 2025-05-07T19:53:29.9834138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:29.9836946Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:29.9838186Z ^ 2025-05-07T19:53:29.9838568Z 2025-05-07T19:53:29.9840207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9842406Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:53:29.9843204Z ^ 2025-05-07T19:53:29.9843495Z 2025-05-07T19:53:29.9845081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9847117Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:29.9847690Z ^ 2025-05-07T19:53:29.9847985Z 2025-05-07T19:53:29.9849606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9851615Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:29.9852180Z ^ 2025-05-07T19:53:29.9852487Z 2025-05-07T19:53:29.9854068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:29.9856115Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:53:29.9856823Z ^ 2025-05-07T19:53:29.9857121Z 2025-05-07T19:53:30.2322395Z [142/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/radix_sort_pairs.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o 2025-05-07T19:53:30.2343923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.2346653Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.2347777Z ^ 2025-05-07T19:53:30.2348045Z 2025-05-07T19:53:30.2348472Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:30.2349154Z 2025-05-07T19:53:30.2350848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.2353268Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.2354211Z ^ 2025-05-07T19:53:30.2354503Z 2025-05-07T19:53:30.2355632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.2357432Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.2358334Z ^ 2025-05-07T19:53:30.2358965Z 2025-05-07T19:53:30.2359374Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:30.2360001Z 2025-05-07T19:53:30.2361622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.2364173Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.2365319Z ^ 2025-05-07T19:53:30.2365677Z 2025-05-07T19:53:30.2367002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.2369108Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.2370065Z ^ 2025-05-07T19:53:30.2370276Z 2025-05-07T19:53:30.2370651Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:30.2371229Z 2025-05-07T19:53:30.2372997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.2375727Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.2377487Z ^ 2025-05-07T19:53:30.2377804Z 2025-05-07T19:53:30.2379105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.2381576Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.2382686Z ^ 2025-05-07T19:53:30.2382939Z 2025-05-07T19:53:30.2383353Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:30.2383995Z 2025-05-07T19:53:30.2385648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.2388321Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.2389479Z ^ 2025-05-07T19:53:30.2389845Z 2025-05-07T19:53:30.2391402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.2394022Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.2395215Z ^ 2025-05-07T19:53:30.2395470Z 2025-05-07T19:53:30.2395918Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:30.2396597Z 2025-05-07T19:53:30.2398266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.2400977Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:30.2402507Z ^ 2025-05-07T19:53:30.2402904Z 2025-05-07T19:53:30.8805356Z [143/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_utils.so -o fbgemm_gpu_tbe_utils.so CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:53:31.4922455Z [144/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_sparse_async_cumsum.so -o fbgemm_gpu_sparse_async_cumsum.so CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:53:33.1553835Z [145/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o 2025-05-07T19:53:33.1575112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.1578248Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.1579444Z ^ 2025-05-07T19:53:33.1579682Z 2025-05-07T19:53:33.1580099Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:33.1581193Z 2025-05-07T19:53:33.1582788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.1585335Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.1586422Z ^ 2025-05-07T19:53:33.1586758Z 2025-05-07T19:53:33.1588247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.1590713Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.1591954Z ^ 2025-05-07T19:53:33.1592192Z 2025-05-07T19:53:33.1592601Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:33.1593211Z 2025-05-07T19:53:33.1594747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.1597374Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.1598529Z ^ 2025-05-07T19:53:33.1598885Z 2025-05-07T19:53:33.1600829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.1603417Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.1604490Z ^ 2025-05-07T19:53:33.1604743Z 2025-05-07T19:53:33.1605206Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:33.1605814Z 2025-05-07T19:53:33.1607290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.1609762Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.1610844Z ^ 2025-05-07T19:53:33.1611191Z 2025-05-07T19:53:33.1612620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.1615055Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.1616019Z ^ 2025-05-07T19:53:33.1616248Z 2025-05-07T19:53:33.1616665Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:33.1617235Z 2025-05-07T19:53:33.1618789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.1621454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.1622621Z ^ 2025-05-07T19:53:33.1623218Z 2025-05-07T19:53:33.1624807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.1627378Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.1628517Z ^ 2025-05-07T19:53:33.1628768Z 2025-05-07T19:53:33.1629189Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:33.1629817Z 2025-05-07T19:53:33.1631398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:33.1633859Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:33.1634970Z ^ 2025-05-07T19:53:33.1635328Z 2025-05-07T19:53:37.3356756Z [146/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o 2025-05-07T19:53:37.3381339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:37.3384201Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:37.3385642Z ^ 2025-05-07T19:53:37.3385906Z 2025-05-07T19:53:37.3386370Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:37.3387085Z 2025-05-07T19:53:37.3388804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:37.3391607Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:37.3392857Z ^ 2025-05-07T19:53:37.3393257Z 2025-05-07T19:53:37.3394961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:37.3397749Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:37.3398959Z ^ 2025-05-07T19:53:37.3399238Z 2025-05-07T19:53:37.3399697Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:37.3400376Z 2025-05-07T19:53:37.3402124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:37.3405148Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:37.3406376Z ^ 2025-05-07T19:53:37.3406760Z 2025-05-07T19:53:37.3408499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:37.3411260Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:37.3412476Z ^ 2025-05-07T19:53:37.3412733Z 2025-05-07T19:53:37.3413191Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:37.3413898Z 2025-05-07T19:53:37.3415649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:37.3418456Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:37.3419684Z ^ 2025-05-07T19:53:37.3420204Z 2025-05-07T19:53:37.3421940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:37.3424716Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:37.3425922Z ^ 2025-05-07T19:53:37.3426187Z 2025-05-07T19:53:37.3426668Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:37.3427368Z 2025-05-07T19:53:37.3429096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:37.3432084Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:37.3433339Z ^ 2025-05-07T19:53:37.3433710Z 2025-05-07T19:53:37.3435415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:37.3438177Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:37.3439394Z ^ 2025-05-07T19:53:37.3439675Z 2025-05-07T19:53:37.3440131Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:37.3440817Z 2025-05-07T19:53:37.3442590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:37.3445376Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:37.3446598Z ^ 2025-05-07T19:53:37.3446974Z 2025-05-07T19:53:39.2230217Z [147/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:53:39.2249591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.2251777Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.2252703Z ^ 2025-05-07T19:53:39.2252915Z 2025-05-07T19:53:39.2253256Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:39.2253787Z 2025-05-07T19:53:39.2255139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.2257271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.2258256Z ^ 2025-05-07T19:53:39.2258544Z 2025-05-07T19:53:39.2259846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.2262099Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.2263057Z ^ 2025-05-07T19:53:39.2263256Z 2025-05-07T19:53:39.2263633Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:39.2264154Z 2025-05-07T19:53:39.2265792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.2267934Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.2268894Z ^ 2025-05-07T19:53:39.2269183Z 2025-05-07T19:53:39.2270498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.2272731Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.2273701Z ^ 2025-05-07T19:53:39.2273943Z 2025-05-07T19:53:39.2274314Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:39.2274867Z 2025-05-07T19:53:39.2276539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.2278731Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.2279713Z ^ 2025-05-07T19:53:39.2280014Z 2025-05-07T19:53:39.2281362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.2283527Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.2284486Z ^ 2025-05-07T19:53:39.2284695Z 2025-05-07T19:53:39.2285062Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:39.2285995Z 2025-05-07T19:53:39.2287343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.2289544Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.2290506Z ^ 2025-05-07T19:53:39.2290827Z 2025-05-07T19:53:39.2292186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.2294359Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.2295302Z ^ 2025-05-07T19:53:39.2295521Z 2025-05-07T19:53:39.2295879Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:39.2296422Z 2025-05-07T19:53:39.2297793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.2300157Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:39.2301119Z ^ 2025-05-07T19:53:39.2301401Z 2025-05-07T19:53:41.5108586Z [148/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:41.5132303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:41.5135047Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:41.5136206Z ^ 2025-05-07T19:53:41.5136477Z 2025-05-07T19:53:41.5136946Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:41.5137627Z 2025-05-07T19:53:41.5139332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:41.5142032Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:41.5143104Z ^ 2025-05-07T19:53:41.5143436Z 2025-05-07T19:53:41.5144657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:41.5147510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:41.5148531Z ^ 2025-05-07T19:53:41.5148748Z 2025-05-07T19:53:41.5149154Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:41.5149695Z 2025-05-07T19:53:41.5150934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:41.5153229Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:41.5154333Z ^ 2025-05-07T19:53:41.5154704Z 2025-05-07T19:53:41.5156234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:41.5158538Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:41.5159592Z ^ 2025-05-07T19:53:41.5159814Z 2025-05-07T19:53:41.5160209Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:41.5160805Z 2025-05-07T19:53:41.5162421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:41.5164754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:41.5165840Z ^ 2025-05-07T19:53:41.5166147Z 2025-05-07T19:53:41.5167892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:41.5170539Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:41.5171954Z ^ 2025-05-07T19:53:41.5172175Z 2025-05-07T19:53:41.5172532Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:41.5173159Z 2025-05-07T19:53:41.5174565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:41.5177185Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:41.5178202Z ^ 2025-05-07T19:53:41.5178518Z 2025-05-07T19:53:41.5179867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:41.5182266Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:41.5183167Z ^ 2025-05-07T19:53:41.5183379Z 2025-05-07T19:53:41.5183735Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:41.5184264Z 2025-05-07T19:53:41.5185743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:41.5188314Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:41.5189462Z ^ 2025-05-07T19:53:41.5189826Z 2025-05-07T19:53:54.5403180Z [149/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:53:54.5424008Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:55.0023737Z [150/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:53:55.0044482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.0047037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.0048145Z ^ 2025-05-07T19:53:55.0048379Z 2025-05-07T19:53:55.0048798Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:55.0049394Z 2025-05-07T19:53:55.0050918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.0053543Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.0054561Z ^ 2025-05-07T19:53:55.0055323Z 2025-05-07T19:53:55.0056712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.0059087Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.0060408Z ^ 2025-05-07T19:53:55.0060645Z 2025-05-07T19:53:55.0061005Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:55.0061583Z 2025-05-07T19:53:55.0063125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.0065420Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.0066555Z ^ 2025-05-07T19:53:55.0066898Z 2025-05-07T19:53:55.0068433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.0070808Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.0071846Z ^ 2025-05-07T19:53:55.0072068Z 2025-05-07T19:53:55.0072425Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:55.0073243Z 2025-05-07T19:53:55.0074694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.0077240Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.0078325Z ^ 2025-05-07T19:53:55.0078659Z 2025-05-07T19:53:55.0080140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.0082519Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.0083574Z ^ 2025-05-07T19:53:55.0083845Z 2025-05-07T19:53:55.0084275Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:55.0084902Z 2025-05-07T19:53:55.0086352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.0088796Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.0089771Z ^ 2025-05-07T19:53:55.0090114Z 2025-05-07T19:53:55.0091573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.0093932Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.0094940Z ^ 2025-05-07T19:53:55.0095185Z 2025-05-07T19:53:55.0095585Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:53:55.0096619Z 2025-05-07T19:53:55.0098071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:55.0100561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:53:55.0101635Z ^ 2025-05-07T19:53:55.0101949Z 2025-05-07T19:54:03.0082158Z [151/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:54:03.0102051Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:06.2702475Z [152/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:54:06.2721347Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:07.3904631Z [153/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:07.3928046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:07.3930801Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:07.3932244Z ^ 2025-05-07T19:54:07.3932518Z 2025-05-07T19:54:07.3932971Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:07.3933651Z 2025-05-07T19:54:07.3935243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:07.3937637Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:07.3938795Z ^ 2025-05-07T19:54:07.3939113Z 2025-05-07T19:54:07.3940726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:07.3943043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:07.3944167Z ^ 2025-05-07T19:54:07.3944400Z 2025-05-07T19:54:07.3944833Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:07.3945423Z 2025-05-07T19:54:07.3946877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:07.3951276Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:07.3952573Z ^ 2025-05-07T19:54:07.3952956Z 2025-05-07T19:54:07.3954446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:07.3957184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:07.3958327Z ^ 2025-05-07T19:54:07.3958573Z 2025-05-07T19:54:07.3959020Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:07.3959660Z 2025-05-07T19:54:07.3961265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:07.3963862Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:07.3964903Z ^ 2025-05-07T19:54:07.3965250Z 2025-05-07T19:54:07.3966859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:07.3969599Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:07.3970782Z ^ 2025-05-07T19:54:07.3971025Z 2025-05-07T19:54:07.3971476Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:07.3972184Z 2025-05-07T19:54:07.3973880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:07.3977230Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:07.3978364Z ^ 2025-05-07T19:54:07.3978704Z 2025-05-07T19:54:07.3980539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:07.3983054Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:07.3984179Z ^ 2025-05-07T19:54:07.3984438Z 2025-05-07T19:54:07.3984883Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:07.3985516Z 2025-05-07T19:54:07.3986934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:07.3989478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:07.3990612Z ^ 2025-05-07T19:54:07.3990949Z 2025-05-07T19:54:08.4106090Z [154/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:08.4125493Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:11.1299311Z [155/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:54:11.1319597Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:17.3753137Z [156/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:54:17.3772057Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:19.1916887Z [157/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:54:19.1936560Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:21.0154731Z [158/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:54:21.0175197Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:22.6908656Z [159/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:22.6931395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:22.6933879Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:22.6935346Z ^ 2025-05-07T19:54:22.6935620Z 2025-05-07T19:54:22.6936041Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:22.6936684Z 2025-05-07T19:54:22.6938345Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:22.6940906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:22.6941823Z ^ 2025-05-07T19:54:22.6942141Z 2025-05-07T19:54:22.6943615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:22.6946229Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:22.6947355Z ^ 2025-05-07T19:54:22.6947643Z 2025-05-07T19:54:22.6948102Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:22.6948745Z 2025-05-07T19:54:22.6950476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:22.6953485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:22.6954836Z ^ 2025-05-07T19:54:22.6955200Z 2025-05-07T19:54:22.6956976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:22.6959558Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:22.6960704Z ^ 2025-05-07T19:54:22.6960957Z 2025-05-07T19:54:22.6961388Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:22.6962050Z 2025-05-07T19:54:22.6963872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:22.6966523Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:22.6967655Z ^ 2025-05-07T19:54:22.6968035Z 2025-05-07T19:54:22.6969618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:22.6972051Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:22.6973183Z ^ 2025-05-07T19:54:22.6973451Z 2025-05-07T19:54:22.6973896Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:22.6974560Z 2025-05-07T19:54:22.6976617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:22.6979148Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:22.6980795Z ^ 2025-05-07T19:54:22.6981181Z 2025-05-07T19:54:22.6982782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:22.6985039Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:22.6986229Z ^ 2025-05-07T19:54:22.6986465Z 2025-05-07T19:54:22.6986888Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:22.6987472Z 2025-05-07T19:54:22.6988984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:22.6991532Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:22.6992645Z ^ 2025-05-07T19:54:22.6993028Z 2025-05-07T19:54:23.8586496Z [160/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:23.8607261Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:27.5481466Z [161/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:54:27.5502361Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:27.6732312Z [162/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:54:27.6752388Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:28.1284447Z [163/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o 2025-05-07T19:54:28.1308648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.1311461Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.1312713Z ^ 2025-05-07T19:54:28.1312982Z 2025-05-07T19:54:28.1313474Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:28.1314175Z 2025-05-07T19:54:28.1315909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.1318693Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.1319917Z ^ 2025-05-07T19:54:28.1320766Z 2025-05-07T19:54:28.1322268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.1325064Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.1326267Z ^ 2025-05-07T19:54:28.1326566Z 2025-05-07T19:54:28.1327032Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:28.1327731Z 2025-05-07T19:54:28.1329600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.1332279Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.1333515Z ^ 2025-05-07T19:54:28.1333879Z 2025-05-07T19:54:28.1335527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.1338197Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.1339379Z ^ 2025-05-07T19:54:28.1339646Z 2025-05-07T19:54:28.1340385Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:28.1341340Z 2025-05-07T19:54:28.1343013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.1345857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.1347077Z ^ 2025-05-07T19:54:28.1347498Z 2025-05-07T19:54:28.1349188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.1351950Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.1353179Z ^ 2025-05-07T19:54:28.1353482Z 2025-05-07T19:54:28.1353953Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:28.1354648Z 2025-05-07T19:54:28.1356283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.1359148Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.1360357Z ^ 2025-05-07T19:54:28.1360733Z 2025-05-07T19:54:28.1362393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.1365093Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.1366289Z ^ 2025-05-07T19:54:28.1366553Z 2025-05-07T19:54:28.1367006Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:28.1367937Z 2025-05-07T19:54:28.1369756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.1372571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:28.1373760Z ^ 2025-05-07T19:54:28.1374154Z 2025-05-07T19:54:29.9607106Z [164/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:29.9627094Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:29.9919762Z [165/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:29.9941101Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:30.0242278Z [166/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.0262730Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:30.0541295Z [167/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.0562382Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:30.0880593Z [168/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.0905139Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:30.1182688Z [169/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.1203687Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:30.1485254Z [170/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.1505419Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:30.1801022Z [171/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.1822739Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:30.2104542Z [172/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.2122088Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:30.2407735Z [173/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.2430143Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:30.2705339Z [174/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.2726034Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:30.3007389Z [175/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.3029740Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:30.3309151Z [176/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.3330565Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:30.3612088Z [177/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:30.3633500Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:31.6698381Z [178/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:31.6719299Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:33.5800519Z [179/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:54:33.5821409Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:34.1920753Z [180/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o 2025-05-07T19:54:34.1945137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:34.1948362Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:34.1949559Z ^ 2025-05-07T19:54:34.1949836Z 2025-05-07T19:54:34.1950277Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:34.1950954Z 2025-05-07T19:54:34.1952617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:34.1955289Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:34.1956503Z ^ 2025-05-07T19:54:34.1956873Z 2025-05-07T19:54:34.1958122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.1959928Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.1960766Z ^ 2025-05-07T19:54:34.1964156Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:34.1967268Z 2025-05-07T19:54:34.1968543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.1970462Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.1971346Z ^ 2025-05-07T19:54:34.1974917Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:34.1978421Z 2025-05-07T19:54:34.1979715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.1981790Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.1982730Z ^ 2025-05-07T19:54:34.1986296Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:34.1989511Z 2025-05-07T19:54:34.1991239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.1993197Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.1994050Z ^ 2025-05-07T19:54:34.1997525Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:34.2000766Z 2025-05-07T19:54:34.2002116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2004306Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2005262Z ^ 2025-05-07T19:54:34.2008844Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:34.2012239Z 2025-05-07T19:54:34.2013629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2015702Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2016698Z ^ 2025-05-07T19:54:34.2020469Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:34.2024171Z 2025-05-07T19:54:34.2025339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2027308Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2028229Z ^ 2025-05-07T19:54:34.2031632Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:34.2034783Z 2025-05-07T19:54:34.2036008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2038137Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2039032Z ^ 2025-05-07T19:54:34.2042310Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:34.2045297Z 2025-05-07T19:54:34.2046508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2048279Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2049171Z ^ 2025-05-07T19:54:34.2052627Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:34.2056022Z 2025-05-07T19:54:34.2057360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2059455Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2060558Z ^ 2025-05-07T19:54:34.2064138Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:34.2067761Z 2025-05-07T19:54:34.2069067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2071028Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2071993Z ^ 2025-05-07T19:54:34.2075209Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:34.2078460Z 2025-05-07T19:54:34.2079620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2081356Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2082548Z ^ 2025-05-07T19:54:34.2085928Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:34.2089174Z 2025-05-07T19:54:34.2090481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2092304Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2093187Z ^ 2025-05-07T19:54:34.2096525Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:34.2099671Z 2025-05-07T19:54:34.2101180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2103346Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2104298Z ^ 2025-05-07T19:54:34.2107830Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:34.2111412Z 2025-05-07T19:54:34.2112669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2114662Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2115588Z ^ 2025-05-07T19:54:34.2118974Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:34.2122071Z 2025-05-07T19:54:34.2123399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2125423Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2126356Z ^ 2025-05-07T19:54:34.2129985Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:34.2133343Z 2025-05-07T19:54:34.2134612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2136637Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2137558Z ^ 2025-05-07T19:54:34.2141250Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:34.2144787Z 2025-05-07T19:54:34.2146065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2148152Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2149093Z ^ 2025-05-07T19:54:34.2152899Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:34.2156445Z 2025-05-07T19:54:34.2157766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2159767Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2160631Z ^ 2025-05-07T19:54:34.2163888Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:34.2166922Z 2025-05-07T19:54:34.2168107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2169935Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2170804Z ^ 2025-05-07T19:54:34.2174270Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:34.2177499Z 2025-05-07T19:54:34.2178701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2180615Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2181478Z ^ 2025-05-07T19:54:34.2184670Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:34.2188084Z 2025-05-07T19:54:34.2189460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2191528Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2192512Z ^ 2025-05-07T19:54:34.2195953Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:34.2199385Z 2025-05-07T19:54:34.2200608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2202562Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2203491Z ^ 2025-05-07T19:54:34.2206796Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:34.2209843Z 2025-05-07T19:54:34.2211087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2213004Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2213964Z ^ 2025-05-07T19:54:34.2220397Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:34.2223866Z 2025-05-07T19:54:34.2225506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:34.2228262Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:34.2229437Z ^ 2025-05-07T19:54:34.2229701Z 2025-05-07T19:54:34.2230148Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:34.2230808Z 2025-05-07T19:54:34.2232510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:34.2235266Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:34.2236474Z ^ 2025-05-07T19:54:34.2236871Z 2025-05-07T19:54:34.2238163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2240166Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2241075Z ^ 2025-05-07T19:54:34.2244548Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:34.2247955Z 2025-05-07T19:54:34.2249249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2251132Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2252031Z ^ 2025-05-07T19:54:34.2255704Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:34.2259176Z 2025-05-07T19:54:34.2260716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2262810Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2263883Z ^ 2025-05-07T19:54:34.2267705Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:34.2271025Z 2025-05-07T19:54:34.2272466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2274543Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2275503Z ^ 2025-05-07T19:54:34.2279461Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:34.2282556Z 2025-05-07T19:54:34.2283931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2285750Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2286715Z ^ 2025-05-07T19:54:34.2290317Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:34.2293949Z 2025-05-07T19:54:34.2295129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2296769Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2297582Z ^ 2025-05-07T19:54:34.2301060Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:34.2304267Z 2025-05-07T19:54:34.2305496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2307657Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2308579Z ^ 2025-05-07T19:54:34.2312609Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:34.2315859Z 2025-05-07T19:54:34.2317158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2318993Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2319897Z ^ 2025-05-07T19:54:34.2323393Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:34.2326887Z 2025-05-07T19:54:34.2328212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2330284Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2331235Z ^ 2025-05-07T19:54:34.2334843Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:34.2338383Z 2025-05-07T19:54:34.2339697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2341866Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2342803Z ^ 2025-05-07T19:54:34.2346341Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:34.2349503Z 2025-05-07T19:54:34.2350717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2352635Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2353556Z ^ 2025-05-07T19:54:34.2357278Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:34.2360547Z 2025-05-07T19:54:34.2361840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2364009Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2364989Z ^ 2025-05-07T19:54:34.2368711Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:34.2372127Z 2025-05-07T19:54:34.2373378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2375583Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2376811Z ^ 2025-05-07T19:54:34.2380448Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:34.2384034Z 2025-05-07T19:54:34.2385342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2387284Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2388137Z ^ 2025-05-07T19:54:34.2391709Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:34.2395021Z 2025-05-07T19:54:34.2396372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2398432Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2399369Z ^ 2025-05-07T19:54:34.2403319Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:34.2406736Z 2025-05-07T19:54:34.2408064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2410149Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2411076Z ^ 2025-05-07T19:54:34.2414751Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:34.2418187Z 2025-05-07T19:54:34.2419519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2421777Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2422710Z ^ 2025-05-07T19:54:34.2426403Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:34.2429476Z 2025-05-07T19:54:34.2430697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2432735Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2433652Z ^ 2025-05-07T19:54:34.2437509Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:34.2441021Z 2025-05-07T19:54:34.2442497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2444540Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2445488Z ^ 2025-05-07T19:54:34.2448941Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:34.2451743Z 2025-05-07T19:54:34.2452801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2454672Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2455574Z ^ 2025-05-07T19:54:34.2459140Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:34.2462552Z 2025-05-07T19:54:34.2463809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2465761Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2466673Z ^ 2025-05-07T19:54:34.2470168Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:34.2473514Z 2025-05-07T19:54:34.2474870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2477411Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2478357Z ^ 2025-05-07T19:54:34.2481906Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:34.2485256Z 2025-05-07T19:54:34.2486616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2488705Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2489635Z ^ 2025-05-07T19:54:34.2507737Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:34.2510951Z 2025-05-07T19:54:34.2512197Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2513827Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2514703Z ^ 2025-05-07T19:54:34.2518092Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:34.2521426Z 2025-05-07T19:54:34.2522991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:34.2525684Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:34.2526882Z ^ 2025-05-07T19:54:34.2527143Z 2025-05-07T19:54:34.2527618Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:34.2528307Z 2025-05-07T19:54:34.2530014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:34.2532799Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:34.2533988Z ^ 2025-05-07T19:54:34.2534349Z 2025-05-07T19:54:34.2535915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2537811Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2538681Z ^ 2025-05-07T19:54:34.2542228Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:34.2545615Z 2025-05-07T19:54:34.2546934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2549007Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2550063Z ^ 2025-05-07T19:54:34.2553962Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:34.2557105Z 2025-05-07T19:54:34.2558336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2560223Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2561203Z ^ 2025-05-07T19:54:34.2564945Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:34.2568510Z 2025-05-07T19:54:34.2569841Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2571851Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2572794Z ^ 2025-05-07T19:54:34.2576329Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:34.2579662Z 2025-05-07T19:54:34.2581079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2583404Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2584344Z ^ 2025-05-07T19:54:34.2587365Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:34.2590195Z 2025-05-07T19:54:34.2591417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2593392Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2594299Z ^ 2025-05-07T19:54:34.2598052Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:34.2601131Z 2025-05-07T19:54:34.2602479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2604398Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2605264Z ^ 2025-05-07T19:54:34.2608350Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:34.2611407Z 2025-05-07T19:54:34.2612755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2614966Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2615889Z ^ 2025-05-07T19:54:34.2619541Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:34.2623066Z 2025-05-07T19:54:34.2624387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2626642Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2627492Z ^ 2025-05-07T19:54:34.2630799Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:34.2633873Z 2025-05-07T19:54:34.2635113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2637119Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2638020Z ^ 2025-05-07T19:54:34.2641533Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:34.2645061Z 2025-05-07T19:54:34.2646336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2648441Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2649356Z ^ 2025-05-07T19:54:34.2652953Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:34.2656327Z 2025-05-07T19:54:34.2657669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2659685Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2660741Z ^ 2025-05-07T19:54:34.2664455Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:34.2667768Z 2025-05-07T19:54:34.2669069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2671004Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2672080Z ^ 2025-05-07T19:54:34.2675601Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:34.2679002Z 2025-05-07T19:54:34.2680267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2682065Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2682802Z ^ 2025-05-07T19:54:34.2685405Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:34.2687806Z 2025-05-07T19:54:34.2689092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2690647Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2691410Z ^ 2025-05-07T19:54:34.2694469Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:34.2697261Z 2025-05-07T19:54:34.2698530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2700446Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2701306Z ^ 2025-05-07T19:54:34.2704538Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:34.2707941Z 2025-05-07T19:54:34.2709271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2711284Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2712509Z ^ 2025-05-07T19:54:34.2716141Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:34.2719528Z 2025-05-07T19:54:34.2720844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2722853Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2723795Z ^ 2025-05-07T19:54:34.2727462Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:34.2730871Z 2025-05-07T19:54:34.2732370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2734406Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2735488Z ^ 2025-05-07T19:54:34.2739250Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:34.2742857Z 2025-05-07T19:54:34.2744350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2746378Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2747319Z ^ 2025-05-07T19:54:34.2750998Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:34.2754274Z 2025-05-07T19:54:34.2755578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2757489Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2758403Z ^ 2025-05-07T19:54:34.2762139Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:34.2765483Z 2025-05-07T19:54:34.2766796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2768794Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2769726Z ^ 2025-05-07T19:54:34.2773359Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:34.2777078Z 2025-05-07T19:54:34.2778662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2780646Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2781496Z ^ 2025-05-07T19:54:34.2784772Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:34.2787657Z 2025-05-07T19:54:34.2788730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2790751Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2791697Z ^ 2025-05-07T19:54:34.2795330Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:34.2798730Z 2025-05-07T19:54:34.2800462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:34.2803235Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:34.2807404Z ^ 2025-05-07T19:54:34.2807663Z 2025-05-07T19:54:34.2808123Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:34.2808803Z 2025-05-07T19:54:34.2810507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:34.2813298Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:34.2814537Z ^ 2025-05-07T19:54:34.2814919Z 2025-05-07T19:54:34.2816259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2818273Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2819229Z ^ 2025-05-07T19:54:34.2822985Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:34.2826318Z 2025-05-07T19:54:34.2827836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2829795Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2830628Z ^ 2025-05-07T19:54:34.2834024Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:34.2837287Z 2025-05-07T19:54:34.2838589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2840568Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2841533Z ^ 2025-05-07T19:54:34.2845243Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:34.2848597Z 2025-05-07T19:54:34.2849961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2851985Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2853121Z ^ 2025-05-07T19:54:34.2856803Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:34.2859748Z 2025-05-07T19:54:34.2860894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2862431Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2863211Z ^ 2025-05-07T19:54:34.2866450Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:34.2869608Z 2025-05-07T19:54:34.2871117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2873157Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2874075Z ^ 2025-05-07T19:54:34.2877744Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:34.2880697Z 2025-05-07T19:54:34.2881960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2883888Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2884732Z ^ 2025-05-07T19:54:34.2888438Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:34.2891893Z 2025-05-07T19:54:34.2893239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2895377Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2896312Z ^ 2025-05-07T19:54:34.2900589Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:34.2903942Z 2025-05-07T19:54:34.2905208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2907235Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2908145Z ^ 2025-05-07T19:54:34.2911686Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:34.2914947Z 2025-05-07T19:54:34.2916246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2918488Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2919415Z ^ 2025-05-07T19:54:34.2922958Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:34.2926321Z 2025-05-07T19:54:34.2927534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2929524Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2930340Z ^ 2025-05-07T19:54:34.2933393Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:34.2936723Z 2025-05-07T19:54:34.2938088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2940273Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2941202Z ^ 2025-05-07T19:54:34.2944912Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:34.2948346Z 2025-05-07T19:54:34.2949633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2951567Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2952489Z ^ 2025-05-07T19:54:34.2955976Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:34.2959132Z 2025-05-07T19:54:34.2960186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2962245Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2963175Z ^ 2025-05-07T19:54:34.2966743Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:34.2970095Z 2025-05-07T19:54:34.2971403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2973418Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2974329Z ^ 2025-05-07T19:54:34.2978148Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:34.2981806Z 2025-05-07T19:54:34.2983146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2985309Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2986223Z ^ 2025-05-07T19:54:34.2989837Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:34.2993491Z 2025-05-07T19:54:34.2994775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.2996776Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.2997673Z ^ 2025-05-07T19:54:34.3001211Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:34.3004641Z 2025-05-07T19:54:34.3005919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3007707Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3008683Z ^ 2025-05-07T19:54:34.3012138Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:34.3015387Z 2025-05-07T19:54:34.3016465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3018414Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3019366Z ^ 2025-05-07T19:54:34.3023045Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:34.3026403Z 2025-05-07T19:54:34.3027776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3029853Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3030827Z ^ 2025-05-07T19:54:34.3033879Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:34.3036564Z 2025-05-07T19:54:34.3037666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3039185Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3039951Z ^ 2025-05-07T19:54:34.3042742Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:34.3045043Z 2025-05-07T19:54:34.3045957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3047632Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3048419Z ^ 2025-05-07T19:54:34.3051570Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:34.3054357Z 2025-05-07T19:54:34.3055435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3057143Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3057940Z ^ 2025-05-07T19:54:34.3060985Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:34.3063858Z 2025-05-07T19:54:34.3065166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3067113Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3068017Z ^ 2025-05-07T19:54:34.3071493Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:34.3074893Z 2025-05-07T19:54:34.3076830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:34.3079484Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:34.3080604Z ^ 2025-05-07T19:54:34.3080838Z 2025-05-07T19:54:34.3081244Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:34.3081861Z 2025-05-07T19:54:34.3083628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:34.3086451Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:34.3087499Z ^ 2025-05-07T19:54:34.3087887Z 2025-05-07T19:54:34.3089212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3090900Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3092040Z ^ 2025-05-07T19:54:34.3095448Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 955 2025-05-07T19:54:34.3098698Z 2025-05-07T19:54:34.3100008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3102103Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3103030Z ^ 2025-05-07T19:54:34.3106502Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1007 2025-05-07T19:54:34.3109800Z 2025-05-07T19:54:34.3111097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3113093Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3114007Z ^ 2025-05-07T19:54:34.3117528Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1059 2025-05-07T19:54:34.3121020Z 2025-05-07T19:54:34.3122305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3124410Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3125366Z ^ 2025-05-07T19:54:34.3128954Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1111 2025-05-07T19:54:34.3132397Z 2025-05-07T19:54:34.3133742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3135759Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3136905Z ^ 2025-05-07T19:54:34.3140582Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1163 2025-05-07T19:54:34.3143839Z 2025-05-07T19:54:34.3145089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3147059Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3147951Z ^ 2025-05-07T19:54:34.3151471Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1215 2025-05-07T19:54:34.3154728Z 2025-05-07T19:54:34.3156038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3158039Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3158976Z ^ 2025-05-07T19:54:34.3162338Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1267 2025-05-07T19:54:34.3165792Z 2025-05-07T19:54:34.3167141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3169294Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3170266Z ^ 2025-05-07T19:54:34.3173878Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1319 2025-05-07T19:54:34.3177380Z 2025-05-07T19:54:34.3178677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3180804Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3181776Z ^ 2025-05-07T19:54:34.3185927Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1371 2025-05-07T19:54:34.3189294Z 2025-05-07T19:54:34.3190528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3192524Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3193414Z ^ 2025-05-07T19:54:34.3196909Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1423 2025-05-07T19:54:34.3200014Z 2025-05-07T19:54:34.3201306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3203238Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3204155Z ^ 2025-05-07T19:54:34.3207529Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1475 2025-05-07T19:54:34.3211039Z 2025-05-07T19:54:34.3212324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3214320Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3215236Z ^ 2025-05-07T19:54:34.3218883Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1527 2025-05-07T19:54:34.3222163Z 2025-05-07T19:54:34.3223367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3225402Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3226399Z ^ 2025-05-07T19:54:34.3230172Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1579 2025-05-07T19:54:34.3233401Z 2025-05-07T19:54:34.3234686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3236635Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3237542Z ^ 2025-05-07T19:54:34.3241045Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1631 2025-05-07T19:54:34.3244280Z 2025-05-07T19:54:34.3245526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3247573Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3248524Z ^ 2025-05-07T19:54:34.3252235Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1683 2025-05-07T19:54:34.3255842Z 2025-05-07T19:54:34.3257161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3259209Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3260337Z ^ 2025-05-07T19:54:34.3263875Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1735 2025-05-07T19:54:34.3267134Z 2025-05-07T19:54:34.3268405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3270285Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3271157Z ^ 2025-05-07T19:54:34.3274792Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1787 2025-05-07T19:54:34.3278138Z 2025-05-07T19:54:34.3279292Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3281087Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3281953Z ^ 2025-05-07T19:54:34.3285368Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1839 2025-05-07T19:54:34.3288664Z 2025-05-07T19:54:34.3290012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3292015Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3292940Z ^ 2025-05-07T19:54:34.3296463Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1891 2025-05-07T19:54:34.3300074Z 2025-05-07T19:54:34.3301528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3303511Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3304433Z ^ 2025-05-07T19:54:34.3308208Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1943 2025-05-07T19:54:34.3311617Z 2025-05-07T19:54:34.3312979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3314988Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3315937Z ^ 2025-05-07T19:54:34.3320005Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1995 2025-05-07T19:54:34.3323419Z 2025-05-07T19:54:34.3324713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3326778Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3327613Z ^ 2025-05-07T19:54:34.3331089Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2047 2025-05-07T19:54:34.3334326Z 2025-05-07T19:54:34.3335550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3337456Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3338336Z ^ 2025-05-07T19:54:34.3341871Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2099 2025-05-07T19:54:34.3345217Z 2025-05-07T19:54:34.3346445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:34.3348430Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:54:34.3349385Z ^ 2025-05-07T19:54:34.3353100Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2151 2025-05-07T19:54:34.3356529Z 2025-05-07T19:54:35.2801766Z [181/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:54:35.2822214Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:35.6739387Z [182/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:35.6762721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.6765573Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:35.6766671Z ^ 2025-05-07T19:54:35.6766920Z 2025-05-07T19:54:35.6767326Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:35.6768013Z 2025-05-07T19:54:35.6769730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.6772324Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:35.6773363Z ^ 2025-05-07T19:54:35.6773696Z 2025-05-07T19:54:35.6775256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.6778270Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:35.6779335Z ^ 2025-05-07T19:54:35.6779574Z 2025-05-07T19:54:35.6779992Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:35.6780797Z 2025-05-07T19:54:35.6782502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.6785022Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:35.6786672Z ^ 2025-05-07T19:54:35.6787051Z 2025-05-07T19:54:35.6788552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.6790895Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:35.6791942Z ^ 2025-05-07T19:54:35.6792172Z 2025-05-07T19:54:35.6792599Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:35.6793193Z 2025-05-07T19:54:35.6794657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.6797024Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:35.6798056Z ^ 2025-05-07T19:54:35.6798372Z 2025-05-07T19:54:35.6799927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.6802866Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:35.6804120Z ^ 2025-05-07T19:54:35.6804348Z 2025-05-07T19:54:35.6804777Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:35.6805411Z 2025-05-07T19:54:35.6806934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.6809470Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:35.6810533Z ^ 2025-05-07T19:54:35.6810832Z 2025-05-07T19:54:35.6812208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.6814468Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:35.6815486Z ^ 2025-05-07T19:54:35.6815735Z 2025-05-07T19:54:35.6816164Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:35.6816814Z 2025-05-07T19:54:35.6818473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:35.6821358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:35.6822479Z ^ 2025-05-07T19:54:35.6822862Z 2025-05-07T19:54:36.1818837Z [183/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o 2025-05-07T19:54:36.1841564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1844043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1845196Z ^ 2025-05-07T19:54:36.1845451Z 2025-05-07T19:54:36.1845882Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:36.1846539Z 2025-05-07T19:54:36.1848105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1850754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1852043Z ^ 2025-05-07T19:54:36.1852414Z 2025-05-07T19:54:36.1854112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1856845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1858087Z ^ 2025-05-07T19:54:36.1858341Z 2025-05-07T19:54:36.1858820Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:36.1859494Z 2025-05-07T19:54:36.1861352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1864388Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1865586Z ^ 2025-05-07T19:54:36.1865995Z 2025-05-07T19:54:36.1867674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1870416Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1871726Z ^ 2025-05-07T19:54:36.1871993Z 2025-05-07T19:54:36.1872439Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:36.1873088Z 2025-05-07T19:54:36.1874770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1877671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1878859Z ^ 2025-05-07T19:54:36.1879226Z 2025-05-07T19:54:36.1883011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1885727Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1886920Z ^ 2025-05-07T19:54:36.1887178Z 2025-05-07T19:54:36.1887632Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:36.1888283Z 2025-05-07T19:54:36.1889935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1892618Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1893788Z ^ 2025-05-07T19:54:36.1894188Z 2025-05-07T19:54:36.1895825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1898488Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1899805Z ^ 2025-05-07T19:54:36.1900073Z 2025-05-07T19:54:36.1900641Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:36.1901302Z 2025-05-07T19:54:36.1902980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:36.1905711Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:36.1906940Z ^ 2025-05-07T19:54:36.1907316Z 2025-05-07T19:54:41.1367689Z [184/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:41.1386308Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:41.8492426Z [185/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:41.8512374Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:41.8806673Z [186/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:41.8826081Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:42.7640551Z [187/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:42.7660061Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:42.7958732Z [188/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:42.7976394Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:42.8271442Z [189/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:42.8290125Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:42.8578257Z [190/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:54:42.8589415Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:42.8899888Z [191/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:42.8911284Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:42.9218359Z [192/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:54:42.9229514Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:42.9547641Z [193/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:42.9565557Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:47.4984262Z [194/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:47.5002649Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:49.9209686Z [195/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o 2025-05-07T19:54:49.9234490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.9237223Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.9238410Z ^ 2025-05-07T19:54:49.9238661Z 2025-05-07T19:54:49.9239146Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:49.9239857Z 2025-05-07T19:54:49.9241839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.9244593Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.9245785Z ^ 2025-05-07T19:54:49.9246334Z 2025-05-07T19:54:49.9248018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.9250065Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.9250638Z ^ 2025-05-07T19:54:49.9250956Z 2025-05-07T19:54:49.9252592Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.9254643Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.9255181Z ^ 2025-05-07T19:54:49.9255501Z 2025-05-07T19:54:49.9257225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.9259265Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.9259806Z ^ 2025-05-07T19:54:49.9260222Z 2025-05-07T19:54:49.9261923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.9263839Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.9264724Z ^ 2025-05-07T19:54:49.9264915Z 2025-05-07T19:54:49.9265272Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:49.9265762Z 2025-05-07T19:54:49.9267177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.9269976Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.9271195Z ^ 2025-05-07T19:54:49.9271564Z 2025-05-07T19:54:49.9273233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.9275136Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.9276201Z ^ 2025-05-07T19:54:49.9276528Z 2025-05-07T19:54:49.9278168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.9279971Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.9280517Z ^ 2025-05-07T19:54:49.9280827Z 2025-05-07T19:54:49.9282453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.9284313Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.9284804Z ^ 2025-05-07T19:54:49.9285089Z 2025-05-07T19:54:49.9287089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.9289517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.9290643Z ^ 2025-05-07T19:54:49.9290900Z 2025-05-07T19:54:49.9291348Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:49.9292005Z 2025-05-07T19:54:49.9293695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.9295893Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.9296910Z ^ 2025-05-07T19:54:49.9297276Z 2025-05-07T19:54:49.9298760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.9300628Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.9301075Z ^ 2025-05-07T19:54:49.9301337Z 2025-05-07T19:54:49.9302611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.9304470Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.9304983Z ^ 2025-05-07T19:54:49.9305268Z 2025-05-07T19:54:49.9306862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.9308873Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.9309864Z ^ 2025-05-07T19:54:49.9310173Z 2025-05-07T19:54:49.9311904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.9314332Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.9315709Z ^ 2025-05-07T19:54:49.9315941Z 2025-05-07T19:54:49.9316377Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:49.9317056Z 2025-05-07T19:54:49.9318605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.9321281Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.9322417Z ^ 2025-05-07T19:54:49.9322794Z 2025-05-07T19:54:49.9324371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.9326418Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.9326962Z ^ 2025-05-07T19:54:49.9327265Z 2025-05-07T19:54:49.9328982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.9330908Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.9331446Z ^ 2025-05-07T19:54:49.9331700Z 2025-05-07T19:54:49.9332866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.9334926Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.9335373Z ^ 2025-05-07T19:54:49.9335612Z 2025-05-07T19:54:49.9336993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.9339086Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.9340291Z ^ 2025-05-07T19:54:49.9340531Z 2025-05-07T19:54:49.9340919Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:49.9341447Z 2025-05-07T19:54:49.9342721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.9345181Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:49.9346402Z ^ 2025-05-07T19:54:49.9346774Z 2025-05-07T19:54:49.9348383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.9350381Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.9351203Z ^ 2025-05-07T19:54:49.9351549Z 2025-05-07T19:54:49.9353035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.9354832Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.9355370Z ^ 2025-05-07T19:54:49.9355722Z 2025-05-07T19:54:49.9357032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:49.9358699Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:54:49.9359216Z ^ 2025-05-07T19:54:49.9359489Z 2025-05-07T19:54:50.3639831Z [196/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:54:50.3661776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.3664411Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:50.3665594Z ^ 2025-05-07T19:54:50.3665839Z 2025-05-07T19:54:50.3666743Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:50.3667568Z 2025-05-07T19:54:50.3669130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.3671763Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:50.3672891Z ^ 2025-05-07T19:54:50.3673274Z 2025-05-07T19:54:50.3674840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.3677997Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:50.3679093Z ^ 2025-05-07T19:54:50.3679364Z 2025-05-07T19:54:50.3679784Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:50.3680404Z 2025-05-07T19:54:50.3681940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.3684478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:50.3685573Z ^ 2025-05-07T19:54:50.3685915Z 2025-05-07T19:54:50.3687852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.3690334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:50.3691405Z ^ 2025-05-07T19:54:50.3691634Z 2025-05-07T19:54:50.3692030Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:50.3692650Z 2025-05-07T19:54:50.3694199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.3696941Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:50.3698032Z ^ 2025-05-07T19:54:50.3698382Z 2025-05-07T19:54:50.3699891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.3702475Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:50.3703558Z ^ 2025-05-07T19:54:50.3703797Z 2025-05-07T19:54:50.3704227Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:50.3704831Z 2025-05-07T19:54:50.3706356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.3709147Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:50.3710249Z ^ 2025-05-07T19:54:50.3710936Z 2025-05-07T19:54:50.3712241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.3714264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:50.3715553Z ^ 2025-05-07T19:54:50.3715797Z 2025-05-07T19:54:50.3716290Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:50.3717016Z 2025-05-07T19:54:50.3718930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.3721086Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:50.3722011Z ^ 2025-05-07T19:54:50.3722293Z 2025-05-07T19:54:51.3160925Z [197/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:51.3182020Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:51.4740601Z [198/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:51.4753201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.4754637Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.4755269Z ^ 2025-05-07T19:54:51.4755437Z 2025-05-07T19:54:51.4755687Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:51.4756047Z 2025-05-07T19:54:51.4756947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.4758365Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.4759012Z ^ 2025-05-07T19:54:51.4759217Z 2025-05-07T19:54:51.4760101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.4761498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.4762130Z ^ 2025-05-07T19:54:51.4762271Z 2025-05-07T19:54:51.4762520Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:51.4762891Z 2025-05-07T19:54:51.4763775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.4765317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.4765962Z ^ 2025-05-07T19:54:51.4766185Z 2025-05-07T19:54:51.4767057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.4768482Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.4769109Z ^ 2025-05-07T19:54:51.4769264Z 2025-05-07T19:54:51.4769507Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:51.4769868Z 2025-05-07T19:54:51.4770756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.4772163Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.4772809Z ^ 2025-05-07T19:54:51.4773018Z 2025-05-07T19:54:51.4775098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.4776858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.4777540Z ^ 2025-05-07T19:54:51.4777696Z 2025-05-07T19:54:51.4777952Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:51.4778343Z 2025-05-07T19:54:51.4779232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.4780851Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.4781514Z ^ 2025-05-07T19:54:51.4781758Z 2025-05-07T19:54:51.4782647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.4784106Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.4784758Z ^ 2025-05-07T19:54:51.4784909Z 2025-05-07T19:54:51.4785177Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:51.4785542Z 2025-05-07T19:54:51.4786430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.4787889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.4788580Z ^ 2025-05-07T19:54:51.4788791Z 2025-05-07T19:54:51.9815045Z [199/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o 2025-05-07T19:54:51.9838504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9841279Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.9842481Z ^ 2025-05-07T19:54:51.9842778Z 2025-05-07T19:54:51.9843235Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:51.9843917Z 2025-05-07T19:54:51.9845671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9848481Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.9849686Z ^ 2025-05-07T19:54:51.9850059Z 2025-05-07T19:54:51.9851906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9854564Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.9855755Z ^ 2025-05-07T19:54:51.9856176Z 2025-05-07T19:54:51.9856651Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:51.9857328Z 2025-05-07T19:54:51.9859000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9861866Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.9863057Z ^ 2025-05-07T19:54:51.9863447Z 2025-05-07T19:54:51.9865096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9867744Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.9868914Z ^ 2025-05-07T19:54:51.9869199Z 2025-05-07T19:54:51.9869649Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:51.9870316Z 2025-05-07T19:54:51.9872014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9874659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.9876421Z ^ 2025-05-07T19:54:51.9876794Z 2025-05-07T19:54:51.9878461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9881118Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.9882303Z ^ 2025-05-07T19:54:51.9882555Z 2025-05-07T19:54:51.9882994Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:51.9883675Z 2025-05-07T19:54:51.9885339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9888216Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.9889423Z ^ 2025-05-07T19:54:51.9889828Z 2025-05-07T19:54:51.9891506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9894256Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.9895462Z ^ 2025-05-07T19:54:51.9895744Z 2025-05-07T19:54:51.9896217Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:51.9896898Z 2025-05-07T19:54:51.9898612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9901465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:51.9902932Z ^ 2025-05-07T19:54:51.9903311Z 2025-05-07T19:54:52.2035803Z [200/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:52.2053913Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:52.4461777Z [201/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:52.4483380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.4486112Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:52.4487394Z ^ 2025-05-07T19:54:52.4487619Z 2025-05-07T19:54:52.4487990Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:52.4488686Z 2025-05-07T19:54:52.4490658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.4493063Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:52.4494160Z ^ 2025-05-07T19:54:52.4494479Z 2025-05-07T19:54:52.4495901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.4498202Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:52.4499376Z ^ 2025-05-07T19:54:52.4499599Z 2025-05-07T19:54:52.4500027Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:52.4500717Z 2025-05-07T19:54:52.4502076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.4504510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:52.4505606Z ^ 2025-05-07T19:54:52.4505973Z 2025-05-07T19:54:52.4507604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.4510074Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:52.4511177Z ^ 2025-05-07T19:54:52.4511429Z 2025-05-07T19:54:52.4511847Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:52.4512444Z 2025-05-07T19:54:52.4513954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.4516949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:52.4518065Z ^ 2025-05-07T19:54:52.4518404Z 2025-05-07T19:54:52.4519948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.4522268Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:52.4523434Z ^ 2025-05-07T19:54:52.4523694Z 2025-05-07T19:54:52.4524171Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:52.4525084Z 2025-05-07T19:54:52.4526886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.4529470Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:52.4530596Z ^ 2025-05-07T19:54:52.4530969Z 2025-05-07T19:54:52.4532608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.4535219Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:52.4536116Z ^ 2025-05-07T19:54:52.4536345Z 2025-05-07T19:54:52.4536673Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:52.4537236Z 2025-05-07T19:54:52.4538645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.4541274Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:52.4542321Z ^ 2025-05-07T19:54:52.4542647Z 2025-05-07T19:54:52.5797958Z [202/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:54:52.5808995Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:53.6021764Z [203/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:54:53.6040477Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:54.3286066Z [204/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:54:54.3307049Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:55.0437441Z [205/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:54:55.0457584Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:55.9617702Z [206/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:55.9641285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.9643930Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:55.9645059Z ^ 2025-05-07T19:54:55.9645307Z 2025-05-07T19:54:55.9645729Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:55.9646381Z 2025-05-07T19:54:55.9647915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.9650447Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:55.9651525Z ^ 2025-05-07T19:54:55.9651916Z 2025-05-07T19:54:55.9653572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.9656745Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:55.9657930Z ^ 2025-05-07T19:54:55.9658194Z 2025-05-07T19:54:55.9658664Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:55.9659344Z 2025-05-07T19:54:55.9661257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.9663850Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:55.9665017Z ^ 2025-05-07T19:54:55.9665372Z 2025-05-07T19:54:55.9667024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.9669707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:55.9670878Z ^ 2025-05-07T19:54:55.9671121Z 2025-05-07T19:54:55.9671560Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:55.9672192Z 2025-05-07T19:54:55.9674130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.9677113Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:55.9678327Z ^ 2025-05-07T19:54:55.9678716Z 2025-05-07T19:54:55.9680433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.9683037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:55.9684141Z ^ 2025-05-07T19:54:55.9684395Z 2025-05-07T19:54:55.9684881Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:55.9685566Z 2025-05-07T19:54:55.9687289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.9690066Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:55.9691270Z ^ 2025-05-07T19:54:55.9691654Z 2025-05-07T19:54:55.9693311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.9695918Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:55.9697081Z ^ 2025-05-07T19:54:55.9697337Z 2025-05-07T19:54:55.9697796Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:54:55.9698485Z 2025-05-07T19:54:55.9700751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:55.9703509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:54:55.9704719Z ^ 2025-05-07T19:54:55.9705072Z 2025-05-07T19:54:57.1861711Z [207/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:54:57.1880891Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:00.0561981Z [208/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:55:00.0582768Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.2208201Z [209/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:55:01.2226730Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.7905780Z [210/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:02.7926932Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:05.1255361Z [211/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:55:05.1276185Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:05.2999938Z [212/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:55:05.3018571Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:05.5250092Z [213/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:55:05.5268894Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:05.5522453Z [214/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:55:05.5542212Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:05.5800650Z [215/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:55:05.5821066Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:05.9707821Z [216/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:55:05.9727212Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:06.6328301Z [217/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:55:06.6347186Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:07.1976868Z [218/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:55:07.1993276Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:07.9441093Z [219/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:07.9461257Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:08.8617436Z [220/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:55:08.8635529Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:09.1524519Z [221/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:55:09.1539757Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:09.5054780Z [222/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:55:09.5077608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:09.5080231Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:09.5081332Z ^ 2025-05-07T19:55:09.5081594Z 2025-05-07T19:55:09.5082004Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:09.5082614Z 2025-05-07T19:55:09.5084146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:09.5086667Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:09.5087759Z ^ 2025-05-07T19:55:09.5088093Z 2025-05-07T19:55:09.5089625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:09.5092129Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:09.5093382Z ^ 2025-05-07T19:55:09.5093599Z 2025-05-07T19:55:09.5094031Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:09.5094636Z 2025-05-07T19:55:09.5096149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:09.5098759Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:09.5100161Z ^ 2025-05-07T19:55:09.5100891Z 2025-05-07T19:55:09.5102394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:09.5104820Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:09.5105867Z ^ 2025-05-07T19:55:09.5106110Z 2025-05-07T19:55:09.5106507Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:09.5107122Z 2025-05-07T19:55:09.5108639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:09.5111063Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:09.5112306Z ^ 2025-05-07T19:55:09.5112654Z 2025-05-07T19:55:09.5114196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:09.5116682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:09.5117782Z ^ 2025-05-07T19:55:09.5118017Z 2025-05-07T19:55:09.5118734Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:09.5119344Z 2025-05-07T19:55:09.5120934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:09.5123465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:09.5124554Z ^ 2025-05-07T19:55:09.5124915Z 2025-05-07T19:55:09.5126469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:09.5128984Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:09.5130071Z ^ 2025-05-07T19:55:09.5130323Z 2025-05-07T19:55:09.5130743Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:09.5131366Z 2025-05-07T19:55:09.5132949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:09.5135445Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:09.5136583Z ^ 2025-05-07T19:55:09.5136935Z 2025-05-07T19:55:10.0035144Z [223/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:10.0056023Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:10.1991628Z [224/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:10.2011672Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:10.2589189Z [225/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:55:10.2609906Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:11.0048720Z [226/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_pt2.so -o fbgemm_gpu_tbe_training_backward_pt2.so CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && : 2025-05-07T19:55:11.1503803Z [227/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:55:11.1522609Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:13.0631994Z [228/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o 2025-05-07T19:55:13.0656769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.0659751Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.0661123Z ^ 2025-05-07T19:55:13.0661382Z 2025-05-07T19:55:13.0661842Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.0662706Z 2025-05-07T19:55:13.0664355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.0667065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.0668302Z ^ 2025-05-07T19:55:13.0668690Z 2025-05-07T19:55:13.0670379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.0672433Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.0673020Z ^ 2025-05-07T19:55:13.0673326Z 2025-05-07T19:55:13.0675071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.0677322Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.0678062Z ^ 2025-05-07T19:55:13.0678359Z 2025-05-07T19:55:13.0679939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.0681957Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.0682490Z ^ 2025-05-07T19:55:13.0683012Z 2025-05-07T19:55:13.0684808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.0687550Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.0688749Z ^ 2025-05-07T19:55:13.0689024Z 2025-05-07T19:55:13.0689477Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.0690162Z 2025-05-07T19:55:13.0691897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.0694779Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.0696102Z ^ 2025-05-07T19:55:13.0696472Z 2025-05-07T19:55:13.0698244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.0700238Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.0700946Z ^ 2025-05-07T19:55:13.0701239Z 2025-05-07T19:55:13.0702840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.0704812Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.0705367Z ^ 2025-05-07T19:55:13.0705664Z 2025-05-07T19:55:13.0707274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.0709435Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.0709990Z ^ 2025-05-07T19:55:13.0710307Z 2025-05-07T19:55:13.0711950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.0714803Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.0716009Z ^ 2025-05-07T19:55:13.0716281Z 2025-05-07T19:55:13.0716739Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.0717590Z 2025-05-07T19:55:13.0719321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.0722040Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.0723252Z ^ 2025-05-07T19:55:13.0723630Z 2025-05-07T19:55:13.0725249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.0727294Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.0727883Z ^ 2025-05-07T19:55:13.0728193Z 2025-05-07T19:55:13.0729994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.0732061Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.0732646Z ^ 2025-05-07T19:55:13.0732946Z 2025-05-07T19:55:13.0734583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.0736636Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.0737276Z ^ 2025-05-07T19:55:13.0737593Z 2025-05-07T19:55:13.0739299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.0742165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.0743357Z ^ 2025-05-07T19:55:13.0743626Z 2025-05-07T19:55:13.0744079Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.0744765Z 2025-05-07T19:55:13.0746445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.0749214Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.0750449Z ^ 2025-05-07T19:55:13.0750806Z 2025-05-07T19:55:13.0752437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.0754501Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.0755183Z ^ 2025-05-07T19:55:13.0755486Z 2025-05-07T19:55:13.0757120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.0759178Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.0759742Z ^ 2025-05-07T19:55:13.0760062Z 2025-05-07T19:55:13.0761695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.0763894Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.0764443Z ^ 2025-05-07T19:55:13.0764756Z 2025-05-07T19:55:13.0766400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.0769229Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.0770393Z ^ 2025-05-07T19:55:13.0770662Z 2025-05-07T19:55:13.0771182Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:13.0771779Z 2025-05-07T19:55:13.0773582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:13.0776602Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:13.0777818Z ^ 2025-05-07T19:55:13.0778192Z 2025-05-07T19:55:13.0779921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.0782029Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.0782600Z ^ 2025-05-07T19:55:13.0783015Z 2025-05-07T19:55:13.0784777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.0786586Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.0787077Z ^ 2025-05-07T19:55:13.0787398Z 2025-05-07T19:55:13.0789233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:13.0791286Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:13.0791844Z ^ 2025-05-07T19:55:13.0792338Z 2025-05-07T19:55:13.3075733Z [229/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:13.3098681Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:13.9623128Z [230/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:13.9644153Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:15.8478060Z [231/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:15.8499067Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:17.5446995Z [232/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o 2025-05-07T19:55:17.5472187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.5474692Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.5475715Z ^ 2025-05-07T19:55:17.5476206Z 2025-05-07T19:55:17.5476602Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:17.5477695Z 2025-05-07T19:55:17.5479209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.5481813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.5482963Z ^ 2025-05-07T19:55:17.5483329Z 2025-05-07T19:55:17.5484727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5486950Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:17.5487661Z ^ 2025-05-07T19:55:17.5487919Z 2025-05-07T19:55:17.5489291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5491216Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5491753Z ^ 2025-05-07T19:55:17.5492028Z 2025-05-07T19:55:17.5493426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5495227Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5495769Z ^ 2025-05-07T19:55:17.5496051Z 2025-05-07T19:55:17.5497498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5499223Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5499722Z ^ 2025-05-07T19:55:17.5499995Z 2025-05-07T19:55:17.5501579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.5504207Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.5505224Z ^ 2025-05-07T19:55:17.5505460Z 2025-05-07T19:55:17.5505863Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:17.5506485Z 2025-05-07T19:55:17.5508046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.5510515Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.5511632Z ^ 2025-05-07T19:55:17.5511973Z 2025-05-07T19:55:17.5513364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5515316Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:17.5516070Z ^ 2025-05-07T19:55:17.5516353Z 2025-05-07T19:55:17.5518052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5519887Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5520415Z ^ 2025-05-07T19:55:17.5520694Z 2025-05-07T19:55:17.5522134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5524008Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5524539Z ^ 2025-05-07T19:55:17.5524824Z 2025-05-07T19:55:17.5526270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5528392Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5528917Z ^ 2025-05-07T19:55:17.5529175Z 2025-05-07T19:55:17.5530711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.5533206Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.5534217Z ^ 2025-05-07T19:55:17.5534435Z 2025-05-07T19:55:17.5534838Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:17.5535401Z 2025-05-07T19:55:17.5536833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.5539358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.5540606Z ^ 2025-05-07T19:55:17.5540972Z 2025-05-07T19:55:17.5542394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5544418Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:17.5545084Z ^ 2025-05-07T19:55:17.5545356Z 2025-05-07T19:55:17.5546747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5548710Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5549263Z ^ 2025-05-07T19:55:17.5549538Z 2025-05-07T19:55:17.5550918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5552705Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5553226Z ^ 2025-05-07T19:55:17.5553509Z 2025-05-07T19:55:17.5554913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5556738Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5557393Z ^ 2025-05-07T19:55:17.5557670Z 2025-05-07T19:55:17.5559384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.5561854Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.5563011Z ^ 2025-05-07T19:55:17.5563248Z 2025-05-07T19:55:17.5563687Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:17.5564292Z 2025-05-07T19:55:17.5565826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.5568209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.5569422Z ^ 2025-05-07T19:55:17.5569776Z 2025-05-07T19:55:17.5571224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5573164Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:17.5573913Z ^ 2025-05-07T19:55:17.5574207Z 2025-05-07T19:55:17.5575499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5577495Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5578035Z ^ 2025-05-07T19:55:17.5578305Z 2025-05-07T19:55:17.5579782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5581668Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5582208Z ^ 2025-05-07T19:55:17.5582482Z 2025-05-07T19:55:17.5583871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5585906Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5586444Z ^ 2025-05-07T19:55:17.5586709Z 2025-05-07T19:55:17.5588261Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.5590797Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.5591883Z ^ 2025-05-07T19:55:17.5592117Z 2025-05-07T19:55:17.5592541Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:17.5593126Z 2025-05-07T19:55:17.5594633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.5597034Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:17.5598123Z ^ 2025-05-07T19:55:17.5598464Z 2025-05-07T19:55:17.5599847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5602075Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:17.5602781Z ^ 2025-05-07T19:55:17.5603048Z 2025-05-07T19:55:17.5604434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5606240Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5606742Z ^ 2025-05-07T19:55:17.5607005Z 2025-05-07T19:55:17.5608387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5612324Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5612885Z ^ 2025-05-07T19:55:17.5613145Z 2025-05-07T19:55:17.5614535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:17.5616412Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:17.5616965Z ^ 2025-05-07T19:55:17.5617249Z 2025-05-07T19:55:18.0554298Z [233/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:55:18.0573922Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:19.4652351Z [234/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:55:19.4672348Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:19.5980411Z [235/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:55:19.5999653Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:20.5601828Z [236/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:55:20.5622239Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:23.1523419Z [237/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o 2025-05-07T19:55:23.1544288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:23.1546705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:23.1547680Z ^ 2025-05-07T19:55:23.1547923Z 2025-05-07T19:55:23.1548309Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:23.1548871Z 2025-05-07T19:55:23.1550240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:23.1552467Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:23.1553830Z ^ 2025-05-07T19:55:23.1554141Z 2025-05-07T19:55:23.1555446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1557292Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:23.1557959Z ^ 2025-05-07T19:55:23.1558203Z 2025-05-07T19:55:23.1559550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1561244Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:23.1561720Z ^ 2025-05-07T19:55:23.1561987Z 2025-05-07T19:55:23.1563308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1564967Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:23.1565439Z ^ 2025-05-07T19:55:23.1565694Z 2025-05-07T19:55:23.1567029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1568700Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:23.1569164Z ^ 2025-05-07T19:55:23.1569587Z 2025-05-07T19:55:23.1571030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:23.1573305Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:23.1574336Z ^ 2025-05-07T19:55:23.1574563Z 2025-05-07T19:55:23.1574970Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:23.1575545Z 2025-05-07T19:55:23.1577279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:23.1579755Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:23.1580894Z ^ 2025-05-07T19:55:23.1581212Z 2025-05-07T19:55:23.1582550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1584430Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:23.1585068Z ^ 2025-05-07T19:55:23.1585339Z 2025-05-07T19:55:23.1586671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1588403Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:23.1588882Z ^ 2025-05-07T19:55:23.1589156Z 2025-05-07T19:55:23.1590511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1592348Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:23.1592868Z ^ 2025-05-07T19:55:23.1593125Z 2025-05-07T19:55:23.1594619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1596451Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:23.1596984Z ^ 2025-05-07T19:55:23.1597248Z 2025-05-07T19:55:23.1598783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:23.1601043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:23.1602062Z ^ 2025-05-07T19:55:23.1602282Z 2025-05-07T19:55:23.1602659Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:23.1603232Z 2025-05-07T19:55:23.1604667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:23.1606958Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:23.1607971Z ^ 2025-05-07T19:55:23.1608309Z 2025-05-07T19:55:23.1609869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1611722Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:23.1612362Z ^ 2025-05-07T19:55:23.1612613Z 2025-05-07T19:55:23.1613954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1615615Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:23.1616196Z ^ 2025-05-07T19:55:23.1616455Z 2025-05-07T19:55:23.1617821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1619482Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:23.1619975Z ^ 2025-05-07T19:55:23.1620216Z 2025-05-07T19:55:23.1621716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1623415Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:23.1623917Z ^ 2025-05-07T19:55:23.1624160Z 2025-05-07T19:55:23.1625571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:23.1627928Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:23.1628959Z ^ 2025-05-07T19:55:23.1629189Z 2025-05-07T19:55:23.1629577Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:23.1630267Z 2025-05-07T19:55:23.1631749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:23.1634074Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:23.1635136Z ^ 2025-05-07T19:55:23.1635466Z 2025-05-07T19:55:23.1636826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1638713Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:23.1639395Z ^ 2025-05-07T19:55:23.1639659Z 2025-05-07T19:55:23.1641045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1642722Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:23.1643228Z ^ 2025-05-07T19:55:23.1643479Z 2025-05-07T19:55:23.1644837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1646577Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:23.1647245Z ^ 2025-05-07T19:55:23.1647489Z 2025-05-07T19:55:23.1648888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1650604Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:23.1651092Z ^ 2025-05-07T19:55:23.1651360Z 2025-05-07T19:55:23.1652857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:23.1655260Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:23.1656406Z ^ 2025-05-07T19:55:23.1656655Z 2025-05-07T19:55:23.1657082Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:23.1657709Z 2025-05-07T19:55:23.1659316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:23.1661931Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:23.1663037Z ^ 2025-05-07T19:55:23.1663378Z 2025-05-07T19:55:23.1664793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1666738Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:23.1667481Z ^ 2025-05-07T19:55:23.1667747Z 2025-05-07T19:55:23.1669151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1671071Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:23.1671575Z ^ 2025-05-07T19:55:23.1671855Z 2025-05-07T19:55:23.1673265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1675082Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:23.1675593Z ^ 2025-05-07T19:55:23.1675864Z 2025-05-07T19:55:23.1678935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:23.1680753Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:23.1681254Z ^ 2025-05-07T19:55:23.1681521Z 2025-05-07T19:55:26.9923404Z [238/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o 2025-05-07T19:55:26.9947758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:26.9950514Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:26.9951692Z ^ 2025-05-07T19:55:26.9953223Z 2025-05-07T19:55:26.9953696Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:26.9954381Z 2025-05-07T19:55:26.9956080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:26.9959208Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:26.9960399Z ^ 2025-05-07T19:55:26.9960720Z 2025-05-07T19:55:26.9962421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:26.9965067Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:26.9966300Z ^ 2025-05-07T19:55:26.9966564Z 2025-05-07T19:55:26.9967016Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:26.9967815Z 2025-05-07T19:55:26.9969526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:26.9972414Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:26.9973783Z ^ 2025-05-07T19:55:26.9974163Z 2025-05-07T19:55:26.9975878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:26.9978152Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:26.9979345Z ^ 2025-05-07T19:55:26.9979610Z 2025-05-07T19:55:26.9980102Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:26.9980935Z 2025-05-07T19:55:26.9982655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:26.9985558Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:26.9986782Z ^ 2025-05-07T19:55:26.9987186Z 2025-05-07T19:55:26.9989031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:26.9991999Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:26.9993218Z ^ 2025-05-07T19:55:26.9993521Z 2025-05-07T19:55:26.9993994Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:26.9994705Z 2025-05-07T19:55:26.9996500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:26.9999320Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:27.0000712Z ^ 2025-05-07T19:55:27.0001095Z 2025-05-07T19:55:27.0002875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:27.0005682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:27.0006954Z ^ 2025-05-07T19:55:27.0007238Z 2025-05-07T19:55:27.0007717Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:27.0008296Z 2025-05-07T19:55:27.0009831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:27.0012670Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:27.0013933Z ^ 2025-05-07T19:55:27.0014348Z 2025-05-07T19:55:33.3807589Z [239/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:33.3831404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.3834377Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.3835597Z ^ 2025-05-07T19:55:33.3835864Z 2025-05-07T19:55:33.3836327Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:33.3836973Z 2025-05-07T19:55:33.3838612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.3841342Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.3842530Z ^ 2025-05-07T19:55:33.3842921Z 2025-05-07T19:55:33.3844447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3846633Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:33.3847394Z ^ 2025-05-07T19:55:33.3847715Z 2025-05-07T19:55:33.3849270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3851405Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.3851987Z ^ 2025-05-07T19:55:33.3852279Z 2025-05-07T19:55:33.3853871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3855842Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.3856423Z ^ 2025-05-07T19:55:33.3856713Z 2025-05-07T19:55:33.3858253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3860190Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.3860871Z ^ 2025-05-07T19:55:33.3861136Z 2025-05-07T19:55:33.3862672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.3865093Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.3866172Z ^ 2025-05-07T19:55:33.3866427Z 2025-05-07T19:55:33.3866859Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:33.3867548Z 2025-05-07T19:55:33.3869248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.3871997Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.3873090Z ^ 2025-05-07T19:55:33.3873437Z 2025-05-07T19:55:33.3874907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3877579Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:33.3878344Z ^ 2025-05-07T19:55:33.3878627Z 2025-05-07T19:55:33.3880136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3881983Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.3882527Z ^ 2025-05-07T19:55:33.3882803Z 2025-05-07T19:55:33.3884300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3886147Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.3886720Z ^ 2025-05-07T19:55:33.3886984Z 2025-05-07T19:55:33.3888439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3890347Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.3890889Z ^ 2025-05-07T19:55:33.3891190Z 2025-05-07T19:55:33.3892603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.3895685Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.3896814Z ^ 2025-05-07T19:55:33.3897039Z 2025-05-07T19:55:33.3897499Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:33.3898138Z 2025-05-07T19:55:33.3899703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.3902295Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.3903581Z ^ 2025-05-07T19:55:33.3903935Z 2025-05-07T19:55:33.3905409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3907427Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:33.3908200Z ^ 2025-05-07T19:55:33.3908496Z 2025-05-07T19:55:33.3909902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3911734Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.3912274Z ^ 2025-05-07T19:55:33.3912593Z 2025-05-07T19:55:33.3914115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3915993Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.3916501Z ^ 2025-05-07T19:55:33.3916769Z 2025-05-07T19:55:33.3918466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3920364Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.3920944Z ^ 2025-05-07T19:55:33.3921228Z 2025-05-07T19:55:33.3922856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.3925370Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.3926487Z ^ 2025-05-07T19:55:33.3926731Z 2025-05-07T19:55:33.3927211Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:33.3927853Z 2025-05-07T19:55:33.3929442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.3932014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.3933164Z ^ 2025-05-07T19:55:33.3933548Z 2025-05-07T19:55:33.3935045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3937499Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:33.3938260Z ^ 2025-05-07T19:55:33.3938573Z 2025-05-07T19:55:33.3940071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3942149Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.3942703Z ^ 2025-05-07T19:55:33.3942996Z 2025-05-07T19:55:33.3944518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3946552Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.3947106Z ^ 2025-05-07T19:55:33.3947360Z 2025-05-07T19:55:33.3948823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3950768Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.3951277Z ^ 2025-05-07T19:55:33.3951550Z 2025-05-07T19:55:33.3953161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.3955746Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.3956879Z ^ 2025-05-07T19:55:33.3957126Z 2025-05-07T19:55:33.3957566Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:33.3958241Z 2025-05-07T19:55:33.3959903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.3962571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:33.3963802Z ^ 2025-05-07T19:55:33.3964169Z 2025-05-07T19:55:33.3965627Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3967601Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:33.3968319Z ^ 2025-05-07T19:55:33.3968560Z 2025-05-07T19:55:33.3969979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3971682Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.3972173Z ^ 2025-05-07T19:55:33.3972461Z 2025-05-07T19:55:33.3974018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3975805Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.3976684Z ^ 2025-05-07T19:55:33.3976960Z 2025-05-07T19:55:33.3978467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:33.3980804Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:33.3981375Z ^ 2025-05-07T19:55:33.3981636Z 2025-05-07T19:55:39.6313465Z [240/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o 2025-05-07T19:55:39.6337200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.6339699Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.6340938Z ^ 2025-05-07T19:55:39.6341175Z 2025-05-07T19:55:39.6341593Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:39.6342180Z 2025-05-07T19:55:39.6343665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.6346098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.6347184Z ^ 2025-05-07T19:55:39.6347531Z 2025-05-07T19:55:39.6349251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6351406Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:39.6352124Z ^ 2025-05-07T19:55:39.6352388Z 2025-05-07T19:55:39.6353904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6355951Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.6356501Z ^ 2025-05-07T19:55:39.6356731Z 2025-05-07T19:55:39.6358442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6360465Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.6361036Z ^ 2025-05-07T19:55:39.6361316Z 2025-05-07T19:55:39.6362913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6364650Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.6365132Z ^ 2025-05-07T19:55:39.6365364Z 2025-05-07T19:55:39.6366814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.6369183Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.6370219Z ^ 2025-05-07T19:55:39.6370515Z 2025-05-07T19:55:39.6370977Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:39.6371662Z 2025-05-07T19:55:39.6373407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.6376699Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.6377751Z ^ 2025-05-07T19:55:39.6378094Z 2025-05-07T19:55:39.6379612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6381724Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:39.6382451Z ^ 2025-05-07T19:55:39.6382712Z 2025-05-07T19:55:39.6384060Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6385821Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.6386376Z ^ 2025-05-07T19:55:39.6386640Z 2025-05-07T19:55:39.6387984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6389734Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.6390252Z ^ 2025-05-07T19:55:39.6390514Z 2025-05-07T19:55:39.6392446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6394539Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.6395126Z ^ 2025-05-07T19:55:39.6395444Z 2025-05-07T19:55:39.6397126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.6399831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.6401215Z ^ 2025-05-07T19:55:39.6401485Z 2025-05-07T19:55:39.6401983Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:39.6402674Z 2025-05-07T19:55:39.6404387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.6407021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.6408001Z ^ 2025-05-07T19:55:39.6408308Z 2025-05-07T19:55:39.6409791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6412035Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:39.6412835Z ^ 2025-05-07T19:55:39.6413125Z 2025-05-07T19:55:39.6414619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6416601Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.6419895Z ^ 2025-05-07T19:55:39.6420270Z 2025-05-07T19:55:39.6421769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6423681Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.6424200Z ^ 2025-05-07T19:55:39.6424510Z 2025-05-07T19:55:39.6425923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6427628Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.6428119Z ^ 2025-05-07T19:55:39.6428365Z 2025-05-07T19:55:39.6429826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.6432344Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.6433457Z ^ 2025-05-07T19:55:39.6433704Z 2025-05-07T19:55:39.6434119Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:39.6434693Z 2025-05-07T19:55:39.6436427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.6438934Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.6440086Z ^ 2025-05-07T19:55:39.6440522Z 2025-05-07T19:55:39.6441984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6444246Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:39.6445024Z ^ 2025-05-07T19:55:39.6445541Z 2025-05-07T19:55:39.6447225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6449228Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.6449793Z ^ 2025-05-07T19:55:39.6450081Z 2025-05-07T19:55:39.6451662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6453632Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.6454221Z ^ 2025-05-07T19:55:39.6454502Z 2025-05-07T19:55:39.6456067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6458023Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.6458597Z ^ 2025-05-07T19:55:39.6458883Z 2025-05-07T19:55:39.6460700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.6463372Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.6464820Z ^ 2025-05-07T19:55:39.6465082Z 2025-05-07T19:55:39.6465535Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:39.6466251Z 2025-05-07T19:55:39.6468111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:39.6470907Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:39.6472138Z ^ 2025-05-07T19:55:39.6472537Z 2025-05-07T19:55:39.6474089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6476350Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:55:39.6476965Z ^ 2025-05-07T19:55:39.6477192Z 2025-05-07T19:55:39.6478453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6480377Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.6480964Z ^ 2025-05-07T19:55:39.6481238Z 2025-05-07T19:55:39.6483036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6484997Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.6485624Z ^ 2025-05-07T19:55:39.6485908Z 2025-05-07T19:55:39.6487188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:39.6488999Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:39.6489513Z ^ 2025-05-07T19:55:39.6489972Z 2025-05-07T19:55:57.2050809Z [241/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o 2025-05-07T19:55:57.2074806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:57.2077816Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:57.2079042Z ^ 2025-05-07T19:55:57.2079332Z 2025-05-07T19:55:57.2079785Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:57.2080453Z 2025-05-07T19:55:57.2082584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:57.2085360Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:57.2086562Z ^ 2025-05-07T19:55:57.2086929Z 2025-05-07T19:55:57.2088645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:57.2091346Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:57.2092833Z ^ 2025-05-07T19:55:57.2093104Z 2025-05-07T19:55:57.2093618Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:57.2094311Z 2025-05-07T19:55:57.2095774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:57.2098097Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:57.2099082Z ^ 2025-05-07T19:55:57.2099386Z 2025-05-07T19:55:57.2100853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:57.2103225Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:57.2104349Z ^ 2025-05-07T19:55:57.2104630Z 2025-05-07T19:55:57.2105070Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:57.2105718Z 2025-05-07T19:55:57.2107304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:57.2110141Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:57.2111267Z ^ 2025-05-07T19:55:57.2111592Z 2025-05-07T19:55:57.2113111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:57.2115755Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:57.2117043Z ^ 2025-05-07T19:55:57.2117297Z 2025-05-07T19:55:57.2117727Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:57.2118368Z 2025-05-07T19:55:57.2119926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:57.2122538Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:57.2123691Z ^ 2025-05-07T19:55:57.2124033Z 2025-05-07T19:55:57.2125688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:57.2128320Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:57.2129474Z ^ 2025-05-07T19:55:57.2129734Z 2025-05-07T19:55:57.2130164Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:57.2130709Z 2025-05-07T19:55:57.2132445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:57.2134906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:57.2136090Z ^ 2025-05-07T19:55:57.2136462Z 2025-05-07T19:55:59.1383033Z [242/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o 2025-05-07T19:55:59.1406876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:59.1409966Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:59.1411142Z ^ 2025-05-07T19:55:59.1411400Z 2025-05-07T19:55:59.1411863Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:59.1412530Z 2025-05-07T19:55:59.1414256Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:59.1416947Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:59.1418123Z ^ 2025-05-07T19:55:59.1418612Z 2025-05-07T19:55:59.1420249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:59.1422401Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:59.1422969Z ^ 2025-05-07T19:55:59.1423267Z 2025-05-07T19:55:59.1424869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:59.1426871Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:59.1427420Z ^ 2025-05-07T19:55:59.1427732Z 2025-05-07T19:55:59.1429298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:59.1431313Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:59.1431869Z ^ 2025-05-07T19:55:59.1432199Z 2025-05-07T19:55:59.1433831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:59.1436597Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:59.1437778Z ^ 2025-05-07T19:55:59.1438028Z 2025-05-07T19:55:59.1438468Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:59.1439120Z 2025-05-07T19:55:59.1440812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:59.1443211Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:59.1444271Z ^ 2025-05-07T19:55:59.1444640Z 2025-05-07T19:55:59.1446266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:59.1448289Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:59.1448802Z ^ 2025-05-07T19:55:59.1449101Z 2025-05-07T19:55:59.1450615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:59.1452598Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:59.1453139Z ^ 2025-05-07T19:55:59.1453450Z 2025-05-07T19:55:59.1455246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:59.1457273Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:59.1457834Z ^ 2025-05-07T19:55:59.1458136Z 2025-05-07T19:55:59.1459748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:59.1462510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:59.1463769Z ^ 2025-05-07T19:55:59.1464019Z 2025-05-07T19:55:59.1464483Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:59.1465168Z 2025-05-07T19:55:59.1466847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:59.1469564Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:59.1470789Z ^ 2025-05-07T19:55:59.1471158Z 2025-05-07T19:55:59.1472762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:59.1474823Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:59.1475406Z ^ 2025-05-07T19:55:59.1475732Z 2025-05-07T19:55:59.1477646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:59.1479807Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:59.1480353Z ^ 2025-05-07T19:55:59.1480669Z 2025-05-07T19:55:59.1482238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:59.1484277Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:59.1484833Z ^ 2025-05-07T19:55:59.1485130Z 2025-05-07T19:55:59.1486384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:59.1488957Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:59.1490038Z ^ 2025-05-07T19:55:59.1490274Z 2025-05-07T19:55:59.1490693Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:59.1491329Z 2025-05-07T19:55:59.1492889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:59.1495376Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:59.1496493Z ^ 2025-05-07T19:55:59.1496864Z 2025-05-07T19:55:59.1498597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:59.1500652Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:59.1501205Z ^ 2025-05-07T19:55:59.1501513Z 2025-05-07T19:55:59.1503033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:59.1505000Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:59.1505564Z ^ 2025-05-07T19:55:59.1505984Z 2025-05-07T19:55:59.1507584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:59.1509562Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:59.1510143Z ^ 2025-05-07T19:55:59.1510442Z 2025-05-07T19:55:59.1512144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:59.1514681Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:59.1515827Z ^ 2025-05-07T19:55:59.1516075Z 2025-05-07T19:55:59.1516515Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:55:59.1517217Z 2025-05-07T19:55:59.1518755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:59.1521300Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:55:59.1522540Z ^ 2025-05-07T19:55:59.1522907Z 2025-05-07T19:55:59.1524460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:59.1526599Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:59.1542206Z ^ 2025-05-07T19:55:59.1542572Z 2025-05-07T19:55:59.1544182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:59.1546178Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:59.1546760Z ^ 2025-05-07T19:55:59.1547066Z 2025-05-07T19:55:59.1548423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:59.1550428Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:55:59.1550999Z ^ 2025-05-07T19:55:59.1551328Z 2025-05-07T19:56:06.7751710Z [243/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_grad_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o 2025-05-07T19:56:06.7776187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:06.7779257Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:06.7780570Z ^ 2025-05-07T19:56:06.7780808Z 2025-05-07T19:56:06.7781246Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:06.7781935Z 2025-05-07T19:56:06.7783653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:06.7786324Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:06.7787518Z ^ 2025-05-07T19:56:06.7787883Z 2025-05-07T19:56:06.7789664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:06.7792407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:06.7793623Z ^ 2025-05-07T19:56:06.7793890Z 2025-05-07T19:56:06.7794356Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:06.7795034Z 2025-05-07T19:56:06.7797171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:06.7799901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:06.7801134Z ^ 2025-05-07T19:56:06.7801534Z 2025-05-07T19:56:06.7802981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:06.7805583Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:06.7806906Z ^ 2025-05-07T19:56:06.7807177Z 2025-05-07T19:56:06.7807638Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:06.7808305Z 2025-05-07T19:56:06.7809974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:06.7812517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:06.7813704Z ^ 2025-05-07T19:56:06.7814071Z 2025-05-07T19:56:06.7815760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:06.7818463Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:06.7819659Z ^ 2025-05-07T19:56:06.7819916Z 2025-05-07T19:56:06.7820490Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:06.7821144Z 2025-05-07T19:56:06.7822706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:06.7825293Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:06.7826315Z ^ 2025-05-07T19:56:06.7826663Z 2025-05-07T19:56:06.7828329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:06.7831179Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:06.7832388Z ^ 2025-05-07T19:56:06.7832659Z 2025-05-07T19:56:06.7833132Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:06.7833819Z 2025-05-07T19:56:06.7835576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:06.7838294Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:06.7839509Z ^ 2025-05-07T19:56:06.7839884Z 2025-05-07T19:56:10.1624337Z [244/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o 2025-05-07T19:56:10.1648234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.1651014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.1652238Z ^ 2025-05-07T19:56:10.1652502Z 2025-05-07T19:56:10.1652957Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:10.1653656Z 2025-05-07T19:56:10.1655400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.1658157Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.1659378Z ^ 2025-05-07T19:56:10.1659750Z 2025-05-07T19:56:10.1661781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.1664588Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.1665753Z ^ 2025-05-07T19:56:10.1666213Z 2025-05-07T19:56:10.1666672Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:10.1667335Z 2025-05-07T19:56:10.1668996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.1671585Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.1672957Z ^ 2025-05-07T19:56:10.1673349Z 2025-05-07T19:56:10.1675126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.1678368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.1679602Z ^ 2025-05-07T19:56:10.1679883Z 2025-05-07T19:56:10.1680351Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:10.1681066Z 2025-05-07T19:56:10.1682864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.1685698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.1686970Z ^ 2025-05-07T19:56:10.1687361Z 2025-05-07T19:56:10.1689125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.1691922Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.1693335Z ^ 2025-05-07T19:56:10.1693600Z 2025-05-07T19:56:10.1694070Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:10.1694801Z 2025-05-07T19:56:10.1696573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.1699300Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.1700713Z ^ 2025-05-07T19:56:10.1701129Z 2025-05-07T19:56:10.1702714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.1705323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.1706532Z ^ 2025-05-07T19:56:10.1706822Z 2025-05-07T19:56:10.1707285Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:10.1707980Z 2025-05-07T19:56:10.1709736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:10.1714190Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:10.1715515Z ^ 2025-05-07T19:56:10.1715854Z 2025-05-07T19:56:12.8261647Z [245/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o 2025-05-07T19:56:12.8285081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:12.8287586Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:12.8288589Z ^ 2025-05-07T19:56:12.8288806Z 2025-05-07T19:56:12.8289223Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:12.8289879Z 2025-05-07T19:56:12.8291449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:12.8294305Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:12.8295492Z ^ 2025-05-07T19:56:12.8295879Z 2025-05-07T19:56:12.8297145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8299433Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8300477Z ^ 2025-05-07T19:56:12.8304076Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:12.8307226Z 2025-05-07T19:56:12.8308487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8310406Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8311420Z ^ 2025-05-07T19:56:12.8314770Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:12.8317921Z 2025-05-07T19:56:12.8319202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8321074Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8321911Z ^ 2025-05-07T19:56:12.8325472Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:12.8328847Z 2025-05-07T19:56:12.8330133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8332194Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8333108Z ^ 2025-05-07T19:56:12.8336304Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:12.8339299Z 2025-05-07T19:56:12.8340594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8342539Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8343616Z ^ 2025-05-07T19:56:12.8346943Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:12.8349989Z 2025-05-07T19:56:12.8351301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8353389Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8354296Z ^ 2025-05-07T19:56:12.8357566Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:12.8360694Z 2025-05-07T19:56:12.8361991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8363963Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8364848Z ^ 2025-05-07T19:56:12.8368450Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:12.8371700Z 2025-05-07T19:56:12.8372901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8374886Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8375811Z ^ 2025-05-07T19:56:12.8379419Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:12.8382784Z 2025-05-07T19:56:12.8384141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8386140Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8387058Z ^ 2025-05-07T19:56:12.8390773Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:12.8393950Z 2025-05-07T19:56:12.8395289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8397281Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8398408Z ^ 2025-05-07T19:56:12.8401651Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:12.8404745Z 2025-05-07T19:56:12.8406023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8408047Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8408976Z ^ 2025-05-07T19:56:12.8412474Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:12.8415874Z 2025-05-07T19:56:12.8417405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8419426Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8420348Z ^ 2025-05-07T19:56:12.8423790Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:12.8426075Z 2025-05-07T19:56:12.8427289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8429051Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8429848Z ^ 2025-05-07T19:56:12.8433236Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:12.8436601Z 2025-05-07T19:56:12.8437940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8439811Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8440666Z ^ 2025-05-07T19:56:12.8444192Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:12.8447498Z 2025-05-07T19:56:12.8448869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8450957Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8451895Z ^ 2025-05-07T19:56:12.8455472Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:12.8458882Z 2025-05-07T19:56:12.8460147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8462324Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8463228Z ^ 2025-05-07T19:56:12.8466957Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:12.8470185Z 2025-05-07T19:56:12.8471528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8473634Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8474568Z ^ 2025-05-07T19:56:12.8478688Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:12.8482071Z 2025-05-07T19:56:12.8483415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8485412Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8486365Z ^ 2025-05-07T19:56:12.8489918Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:12.8493367Z 2025-05-07T19:56:12.8494681Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8497235Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8498201Z ^ 2025-05-07T19:56:12.8501807Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:12.8505221Z 2025-05-07T19:56:12.8506589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8508845Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8509817Z ^ 2025-05-07T19:56:12.8513484Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:12.8516879Z 2025-05-07T19:56:12.8518233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8520320Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8521287Z ^ 2025-05-07T19:56:12.8525216Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:12.8528592Z 2025-05-07T19:56:12.8529952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8532184Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8533150Z ^ 2025-05-07T19:56:12.8536887Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:12.8540554Z 2025-05-07T19:56:12.8541954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8544101Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8545063Z ^ 2025-05-07T19:56:12.8548789Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:12.8552277Z 2025-05-07T19:56:12.8553624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8555559Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8556619Z ^ 2025-05-07T19:56:12.8560282Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:12.8563957Z 2025-05-07T19:56:12.8565781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:12.8568709Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:12.8569982Z ^ 2025-05-07T19:56:12.8570252Z 2025-05-07T19:56:12.8570747Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:12.8571469Z 2025-05-07T19:56:12.8573259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:12.8577148Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:12.8578458Z ^ 2025-05-07T19:56:12.8578860Z 2025-05-07T19:56:12.8580255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8582465Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8583445Z ^ 2025-05-07T19:56:12.8587097Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:12.8590602Z 2025-05-07T19:56:12.8591993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8594166Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8595266Z ^ 2025-05-07T19:56:12.8598676Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:12.8602136Z 2025-05-07T19:56:12.8603526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8605839Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8606884Z ^ 2025-05-07T19:56:12.8610496Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:12.8613831Z 2025-05-07T19:56:12.8615349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8617458Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8618407Z ^ 2025-05-07T19:56:12.8622303Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:12.8625942Z 2025-05-07T19:56:12.8627463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8629574Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8630541Z ^ 2025-05-07T19:56:12.8633949Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:12.8637594Z 2025-05-07T19:56:12.8639022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8641160Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8642148Z ^ 2025-05-07T19:56:12.8645790Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:12.8649398Z 2025-05-07T19:56:12.8650829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8653004Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8654093Z ^ 2025-05-07T19:56:12.8657859Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:12.8661431Z 2025-05-07T19:56:12.8662831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8664993Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8665985Z ^ 2025-05-07T19:56:12.8669701Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:12.8673057Z 2025-05-07T19:56:12.8674632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8676916Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8677885Z ^ 2025-05-07T19:56:12.8681506Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:12.8685012Z 2025-05-07T19:56:12.8686389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8688492Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8689436Z ^ 2025-05-07T19:56:12.8693210Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:12.8696580Z 2025-05-07T19:56:12.8697954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8700020Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8701076Z ^ 2025-05-07T19:56:12.8704823Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:12.8708552Z 2025-05-07T19:56:12.8709950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8712118Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8713068Z ^ 2025-05-07T19:56:12.8716708Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:12.8720215Z 2025-05-07T19:56:12.8721649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8724041Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8725042Z ^ 2025-05-07T19:56:12.8728844Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:12.8732168Z 2025-05-07T19:56:12.8733593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8735858Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8736819Z ^ 2025-05-07T19:56:12.8740541Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:12.8744069Z 2025-05-07T19:56:12.8745454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8747749Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8748699Z ^ 2025-05-07T19:56:12.8752281Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:12.8755496Z 2025-05-07T19:56:12.8756868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8759014Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8759843Z ^ 2025-05-07T19:56:12.8763160Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:12.8766510Z 2025-05-07T19:56:12.8768069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8770213Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8771365Z ^ 2025-05-07T19:56:12.8775197Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:12.8778758Z 2025-05-07T19:56:12.8780026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8782230Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8783203Z ^ 2025-05-07T19:56:12.8786839Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:12.8790256Z 2025-05-07T19:56:12.8791587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8793605Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8794512Z ^ 2025-05-07T19:56:12.8797977Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:12.8801433Z 2025-05-07T19:56:12.8802733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8804711Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8805641Z ^ 2025-05-07T19:56:12.8809276Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:12.8812384Z 2025-05-07T19:56:12.8813499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8815315Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8816081Z ^ 2025-05-07T19:56:12.8821677Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:12.8824225Z 2025-05-07T19:56:12.8825357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8827356Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8828204Z ^ 2025-05-07T19:56:12.8831649Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:12.8834816Z 2025-05-07T19:56:12.8836094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8838225Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8839152Z ^ 2025-05-07T19:56:12.8842615Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:12.8846035Z 2025-05-07T19:56:12.8847285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8849269Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8850179Z ^ 2025-05-07T19:56:12.8853560Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:12.8856779Z 2025-05-07T19:56:12.8858496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:12.8861347Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:12.8862546Z ^ 2025-05-07T19:56:12.8862809Z 2025-05-07T19:56:12.8863421Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:12.8864089Z 2025-05-07T19:56:12.8865952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:12.8868657Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:12.8869812Z ^ 2025-05-07T19:56:12.8870136Z 2025-05-07T19:56:12.8871276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8873207Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8874098Z ^ 2025-05-07T19:56:12.8877804Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:12.8880744Z 2025-05-07T19:56:12.8882065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8883984Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8884913Z ^ 2025-05-07T19:56:12.8888227Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:12.8891643Z 2025-05-07T19:56:12.8892888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8894841Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8895675Z ^ 2025-05-07T19:56:12.8898993Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:12.8902180Z 2025-05-07T19:56:12.8903443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8905355Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8906236Z ^ 2025-05-07T19:56:12.8909916Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:12.8913153Z 2025-05-07T19:56:12.8914388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8916365Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8917409Z ^ 2025-05-07T19:56:12.8920844Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:12.8924003Z 2025-05-07T19:56:12.8925366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8927042Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8927854Z ^ 2025-05-07T19:56:12.8931244Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:12.8934628Z 2025-05-07T19:56:12.8935971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8938027Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8938947Z ^ 2025-05-07T19:56:12.8942527Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:12.8946071Z 2025-05-07T19:56:12.8947367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8949246Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8950158Z ^ 2025-05-07T19:56:12.8953853Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:12.8957164Z 2025-05-07T19:56:12.8958492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8960500Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8961429Z ^ 2025-05-07T19:56:12.8965139Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:12.8968275Z 2025-05-07T19:56:12.8969571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8971174Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8971936Z ^ 2025-05-07T19:56:12.8975209Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:12.8978610Z 2025-05-07T19:56:12.8979865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8982009Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8982930Z ^ 2025-05-07T19:56:12.8986198Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:12.8989358Z 2025-05-07T19:56:12.8990630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.8992671Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.8993587Z ^ 2025-05-07T19:56:12.8997450Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:12.9000612Z 2025-05-07T19:56:12.9001905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9003869Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9004759Z ^ 2025-05-07T19:56:12.9008270Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:12.9011665Z 2025-05-07T19:56:12.9013012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9015021Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9015890Z ^ 2025-05-07T19:56:12.9019163Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:12.9022577Z 2025-05-07T19:56:12.9023898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9026057Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9026952Z ^ 2025-05-07T19:56:12.9030451Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:12.9033672Z 2025-05-07T19:56:12.9035010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9037046Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9037965Z ^ 2025-05-07T19:56:12.9041670Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:12.9044821Z 2025-05-07T19:56:12.9046017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9048149Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9049087Z ^ 2025-05-07T19:56:12.9052509Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:12.9055938Z 2025-05-07T19:56:12.9057306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9059219Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9060079Z ^ 2025-05-07T19:56:12.9063524Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:12.9066985Z 2025-05-07T19:56:12.9068308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9070428Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9071376Z ^ 2025-05-07T19:56:12.9074892Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:12.9078419Z 2025-05-07T19:56:12.9079703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9081722Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9082679Z ^ 2025-05-07T19:56:12.9086196Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:12.9089815Z 2025-05-07T19:56:12.9091154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9093146Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9093921Z ^ 2025-05-07T19:56:12.9097213Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:12.9100824Z 2025-05-07T19:56:12.9102142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9104207Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9105105Z ^ 2025-05-07T19:56:12.9108498Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:12.9111619Z 2025-05-07T19:56:12.9112913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9114820Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9115918Z ^ 2025-05-07T19:56:12.9119508Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:12.9122769Z 2025-05-07T19:56:12.9124011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9125992Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9126891Z ^ 2025-05-07T19:56:12.9130424Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:12.9133608Z 2025-05-07T19:56:12.9135513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:12.9138246Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:12.9139463Z ^ 2025-05-07T19:56:12.9139723Z 2025-05-07T19:56:12.9140193Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:12.9141033Z 2025-05-07T19:56:12.9142707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:12.9145576Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:12.9146680Z ^ 2025-05-07T19:56:12.9147049Z 2025-05-07T19:56:12.9148295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9150269Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9151174Z ^ 2025-05-07T19:56:12.9154800Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:12.9158400Z 2025-05-07T19:56:12.9159729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9162003Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9162930Z ^ 2025-05-07T19:56:12.9166622Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:12.9170132Z 2025-05-07T19:56:12.9171495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9173470Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9174281Z ^ 2025-05-07T19:56:12.9177877Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:12.9181328Z 2025-05-07T19:56:12.9183023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9185091Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9186026Z ^ 2025-05-07T19:56:12.9189566Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:12.9193018Z 2025-05-07T19:56:12.9194386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9196436Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9197360Z ^ 2025-05-07T19:56:12.9200889Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:12.9204020Z 2025-05-07T19:56:12.9204962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9206614Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9207724Z ^ 2025-05-07T19:56:12.9210656Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:12.9213760Z 2025-05-07T19:56:12.9215107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9217222Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9218158Z ^ 2025-05-07T19:56:12.9221682Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:12.9224836Z 2025-05-07T19:56:12.9226333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9228417Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9229362Z ^ 2025-05-07T19:56:12.9232946Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:12.9236361Z 2025-05-07T19:56:12.9238005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9239933Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9240786Z ^ 2025-05-07T19:56:12.9244251Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:12.9247882Z 2025-05-07T19:56:12.9249219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9251227Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9252160Z ^ 2025-05-07T19:56:12.9255950Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:12.9259528Z 2025-05-07T19:56:12.9261033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9263137Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9264096Z ^ 2025-05-07T19:56:12.9267756Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:12.9271068Z 2025-05-07T19:56:12.9272392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9274660Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9275604Z ^ 2025-05-07T19:56:12.9279305Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:12.9282581Z 2025-05-07T19:56:12.9283932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9286214Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9287181Z ^ 2025-05-07T19:56:12.9290997Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:12.9294339Z 2025-05-07T19:56:12.9295696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9297797Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9298758Z ^ 2025-05-07T19:56:12.9302449Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:12.9305973Z 2025-05-07T19:56:12.9307332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9309625Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9310594Z ^ 2025-05-07T19:56:12.9314300Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:12.9317750Z 2025-05-07T19:56:12.9319130Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9321271Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9322442Z ^ 2025-05-07T19:56:12.9326175Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:12.9329621Z 2025-05-07T19:56:12.9331017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9333257Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9334235Z ^ 2025-05-07T19:56:12.9337736Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:12.9341164Z 2025-05-07T19:56:12.9342542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9344696Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9345675Z ^ 2025-05-07T19:56:12.9349503Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:12.9353123Z 2025-05-07T19:56:12.9354550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9356687Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9357675Z ^ 2025-05-07T19:56:12.9361452Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:12.9364935Z 2025-05-07T19:56:12.9366360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9368493Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9369462Z ^ 2025-05-07T19:56:12.9373388Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:12.9377166Z 2025-05-07T19:56:12.9378527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9380342Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9381581Z ^ 2025-05-07T19:56:12.9385512Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:12.9388988Z 2025-05-07T19:56:12.9390387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9392563Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9393551Z ^ 2025-05-07T19:56:12.9397282Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:12.9400971Z 2025-05-07T19:56:12.9402356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9404497Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9405457Z ^ 2025-05-07T19:56:12.9409356Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:12.9412842Z 2025-05-07T19:56:12.9414236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9416376Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9417308Z ^ 2025-05-07T19:56:12.9421260Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:12.9424809Z 2025-05-07T19:56:12.9426632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:12.9429531Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:12.9430873Z ^ 2025-05-07T19:56:12.9431155Z 2025-05-07T19:56:12.9431632Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:12.9432360Z 2025-05-07T19:56:12.9434203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:12.9437093Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:12.9438392Z ^ 2025-05-07T19:56:12.9438786Z 2025-05-07T19:56:12.9440183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9442338Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9443321Z ^ 2025-05-07T19:56:12.9446962Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 946 2025-05-07T19:56:12.9450486Z 2025-05-07T19:56:12.9451881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9454040Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9455060Z ^ 2025-05-07T19:56:12.9458677Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 996 2025-05-07T19:56:12.9462156Z 2025-05-07T19:56:12.9463608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9465799Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9466810Z ^ 2025-05-07T19:56:12.9470791Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1046 2025-05-07T19:56:12.9474206Z 2025-05-07T19:56:12.9475575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9478078Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9479178Z ^ 2025-05-07T19:56:12.9482832Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" at line 1096 2025-05-07T19:56:12.9486207Z 2025-05-07T19:56:12.9487562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9489665Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9490634Z ^ 2025-05-07T19:56:12.9494258Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1146 2025-05-07T19:56:12.9497903Z 2025-05-07T19:56:12.9499240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9501279Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9502275Z ^ 2025-05-07T19:56:12.9506267Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1196 2025-05-07T19:56:12.9509618Z 2025-05-07T19:56:12.9511036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9513146Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9514157Z ^ 2025-05-07T19:56:12.9518112Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1246 2025-05-07T19:56:12.9521648Z 2025-05-07T19:56:12.9523080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9525246Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9526231Z ^ 2025-05-07T19:56:12.9530211Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" at line 1296 2025-05-07T19:56:12.9533645Z 2025-05-07T19:56:12.9535064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9537418Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9538433Z ^ 2025-05-07T19:56:12.9542048Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1346 2025-05-07T19:56:12.9545669Z 2025-05-07T19:56:12.9546895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9548859Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9549846Z ^ 2025-05-07T19:56:12.9553607Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1396 2025-05-07T19:56:12.9557087Z 2025-05-07T19:56:12.9558492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9560793Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9561756Z ^ 2025-05-07T19:56:12.9565464Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1446 2025-05-07T19:56:12.9568424Z 2025-05-07T19:56:12.9569687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9571704Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9572619Z ^ 2025-05-07T19:56:12.9576512Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" at line 1496 2025-05-07T19:56:12.9579983Z 2025-05-07T19:56:12.9581399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9583365Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9584281Z ^ 2025-05-07T19:56:12.9587723Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1546 2025-05-07T19:56:12.9590887Z 2025-05-07T19:56:12.9592230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9594393Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9595340Z ^ 2025-05-07T19:56:12.9598649Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1596 2025-05-07T19:56:12.9601655Z 2025-05-07T19:56:12.9602680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9604384Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9605104Z ^ 2025-05-07T19:56:12.9608315Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1646 2025-05-07T19:56:12.9610973Z 2025-05-07T19:56:12.9611949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9613603Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9614400Z ^ 2025-05-07T19:56:12.9617723Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" at line 1696 2025-05-07T19:56:12.9621048Z 2025-05-07T19:56:12.9622264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9624186Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9625078Z ^ 2025-05-07T19:56:12.9628391Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1746 2025-05-07T19:56:12.9631520Z 2025-05-07T19:56:12.9632738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9634697Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9635564Z ^ 2025-05-07T19:56:12.9638932Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1796 2025-05-07T19:56:12.9641934Z 2025-05-07T19:56:12.9643211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9645202Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9646216Z ^ 2025-05-07T19:56:12.9649920Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1846 2025-05-07T19:56:12.9653171Z 2025-05-07T19:56:12.9654649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9656587Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9657387Z ^ 2025-05-07T19:56:12.9660768Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" at line 1896 2025-05-07T19:56:12.9664188Z 2025-05-07T19:56:12.9665369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9667295Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9668223Z ^ 2025-05-07T19:56:12.9671785Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1946 2025-05-07T19:56:12.9674917Z 2025-05-07T19:56:12.9676445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9678286Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9679314Z ^ 2025-05-07T19:56:12.9682674Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 1996 2025-05-07T19:56:12.9685801Z 2025-05-07T19:56:12.9687397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9689184Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9690088Z ^ 2025-05-07T19:56:12.9693540Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2046 2025-05-07T19:56:12.9696779Z 2025-05-07T19:56:12.9698458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:56:12.9700628Z reinterpret_cast(&lxu_cache_weights[cache_idx * max_D_cache]); 2025-05-07T19:56:12.9701555Z ^ 2025-05-07T19:56:12.9705174Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" at line 2096 2025-05-07T19:56:12.9708870Z 2025-05-07T19:56:19.6854223Z [246/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_dense_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o 2025-05-07T19:56:19.6877882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:19.6880830Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:19.6882086Z ^ 2025-05-07T19:56:19.6882358Z 2025-05-07T19:56:19.6882832Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:19.6883925Z 2025-05-07T19:56:19.6885523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:19.6888233Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:19.6889446Z ^ 2025-05-07T19:56:19.6889833Z 2025-05-07T19:56:19.6891524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:19.6894145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:19.6895110Z ^ 2025-05-07T19:56:19.6895326Z 2025-05-07T19:56:19.6895713Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:19.6896321Z 2025-05-07T19:56:19.6897750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:19.6900269Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:19.6901428Z ^ 2025-05-07T19:56:19.6901773Z 2025-05-07T19:56:19.6903346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:19.6905819Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:19.6907030Z ^ 2025-05-07T19:56:19.6907324Z 2025-05-07T19:56:19.6907772Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:19.6908660Z 2025-05-07T19:56:19.6910296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:19.6912819Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:19.6914017Z ^ 2025-05-07T19:56:19.6914393Z 2025-05-07T19:56:19.6916053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:19.6918647Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:19.6919881Z ^ 2025-05-07T19:56:19.6920145Z 2025-05-07T19:56:19.6920579Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:19.6921253Z 2025-05-07T19:56:19.6922968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:19.6925682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:19.6926872Z ^ 2025-05-07T19:56:19.6927264Z 2025-05-07T19:56:19.6929033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:19.6931709Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:19.6932852Z ^ 2025-05-07T19:56:19.6933145Z 2025-05-07T19:56:19.6933605Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:19.6934293Z 2025-05-07T19:56:19.6935980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:19.6938745Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:19.6940110Z ^ 2025-05-07T19:56:19.6940654Z 2025-05-07T19:56:30.5822393Z [247/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o 2025-05-07T19:56:30.5846948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.5849541Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.5850681Z ^ 2025-05-07T19:56:30.5850959Z 2025-05-07T19:56:30.5851437Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:30.5852080Z 2025-05-07T19:56:30.5853364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.5855808Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.5857217Z ^ 2025-05-07T19:56:30.5857585Z 2025-05-07T19:56:30.5859208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.5861406Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.5862032Z ^ 2025-05-07T19:56:30.5862340Z 2025-05-07T19:56:30.5863933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.5865969Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.5866553Z ^ 2025-05-07T19:56:30.5866895Z 2025-05-07T19:56:30.5868503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.5870448Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.5871042Z ^ 2025-05-07T19:56:30.5871375Z 2025-05-07T19:56:30.5873059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.5878440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.5879576Z ^ 2025-05-07T19:56:30.5879837Z 2025-05-07T19:56:30.5880305Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:30.5880972Z 2025-05-07T19:56:30.5882664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.5885388Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.5886611Z ^ 2025-05-07T19:56:30.5886960Z 2025-05-07T19:56:30.5888530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.5890572Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.5891144Z ^ 2025-05-07T19:56:30.5891452Z 2025-05-07T19:56:30.5893065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.5895319Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.5895888Z ^ 2025-05-07T19:56:30.5896183Z 2025-05-07T19:56:30.5897788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.5899474Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.5900003Z ^ 2025-05-07T19:56:30.5900275Z 2025-05-07T19:56:30.5901925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.5904773Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.5905960Z ^ 2025-05-07T19:56:30.5906216Z 2025-05-07T19:56:30.5906653Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:30.5907301Z 2025-05-07T19:56:30.5908930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.5911622Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.5912675Z ^ 2025-05-07T19:56:30.5913143Z 2025-05-07T19:56:30.5914803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.5916868Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.5917386Z ^ 2025-05-07T19:56:30.5917686Z 2025-05-07T19:56:30.5919150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.5921278Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.5921856Z ^ 2025-05-07T19:56:30.5922169Z 2025-05-07T19:56:30.5923909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.5925845Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.5926420Z ^ 2025-05-07T19:56:30.5926713Z 2025-05-07T19:56:30.5928336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.5930934Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.5932095Z ^ 2025-05-07T19:56:30.5932337Z 2025-05-07T19:56:30.5932760Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:30.5933419Z 2025-05-07T19:56:30.5934914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.5937305Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.5938400Z ^ 2025-05-07T19:56:30.5938691Z 2025-05-07T19:56:30.5939956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.5941966Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.5942482Z ^ 2025-05-07T19:56:30.5942919Z 2025-05-07T19:56:30.5944181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.5946034Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.5946560Z ^ 2025-05-07T19:56:30.5946846Z 2025-05-07T19:56:30.5948340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.5950065Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.5950618Z ^ 2025-05-07T19:56:30.5950889Z 2025-05-07T19:56:30.5952323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.5954810Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.5955982Z ^ 2025-05-07T19:56:30.5956238Z 2025-05-07T19:56:30.5956698Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:30.5957361Z 2025-05-07T19:56:30.5959037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:30.5961881Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:30.5963026Z ^ 2025-05-07T19:56:30.5963410Z 2025-05-07T19:56:30.5964909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.5966887Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.5967461Z ^ 2025-05-07T19:56:30.5967761Z 2025-05-07T19:56:30.5969252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.5971142Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.5971728Z ^ 2025-05-07T19:56:30.5972010Z 2025-05-07T19:56:30.5973706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:30.5975623Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:30.5976494Z ^ 2025-05-07T19:56:30.5976773Z 2025-05-07T19:56:39.7354656Z [248/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:39.7379358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.7382123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.7383452Z ^ 2025-05-07T19:56:39.7383720Z 2025-05-07T19:56:39.7384217Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:39.7384926Z 2025-05-07T19:56:39.7386678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.7389510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.7390718Z ^ 2025-05-07T19:56:39.7391085Z 2025-05-07T19:56:39.7392708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7394960Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:39.7395741Z ^ 2025-05-07T19:56:39.7396053Z 2025-05-07T19:56:39.7398118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7400157Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.7400735Z ^ 2025-05-07T19:56:39.7401048Z 2025-05-07T19:56:39.7402765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7404770Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.7405334Z ^ 2025-05-07T19:56:39.7405692Z 2025-05-07T19:56:39.7407242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7409203Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.7409746Z ^ 2025-05-07T19:56:39.7410026Z 2025-05-07T19:56:39.7411706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.7414413Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.7415524Z ^ 2025-05-07T19:56:39.7415782Z 2025-05-07T19:56:39.7416234Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:39.7416945Z 2025-05-07T19:56:39.7418652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.7421498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.7422849Z ^ 2025-05-07T19:56:39.7423157Z 2025-05-07T19:56:39.7424363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7426031Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:39.7426642Z ^ 2025-05-07T19:56:39.7426929Z 2025-05-07T19:56:39.7428117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7429655Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.7430259Z ^ 2025-05-07T19:56:39.7430498Z 2025-05-07T19:56:39.7431688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7433223Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.7433721Z ^ 2025-05-07T19:56:39.7433968Z 2025-05-07T19:56:39.7435227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7436876Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.7437374Z ^ 2025-05-07T19:56:39.7437637Z 2025-05-07T19:56:39.7439307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.7441725Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.7442830Z ^ 2025-05-07T19:56:39.7443052Z 2025-05-07T19:56:39.7443445Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:39.7444073Z 2025-05-07T19:56:39.7445628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.7448207Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.7449368Z ^ 2025-05-07T19:56:39.7449763Z 2025-05-07T19:56:39.7464769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7467056Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:39.7467871Z ^ 2025-05-07T19:56:39.7468176Z 2025-05-07T19:56:39.7469840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7471934Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.7472539Z ^ 2025-05-07T19:56:39.7472841Z 2025-05-07T19:56:39.7474527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7477017Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.7477613Z ^ 2025-05-07T19:56:39.7477901Z 2025-05-07T19:56:39.7479539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7481652Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.7482225Z ^ 2025-05-07T19:56:39.7482541Z 2025-05-07T19:56:39.7484318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.7487259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.7488457Z ^ 2025-05-07T19:56:39.7488741Z 2025-05-07T19:56:39.7489206Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:39.7489891Z 2025-05-07T19:56:39.7491625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.7494356Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.7495597Z ^ 2025-05-07T19:56:39.7495977Z 2025-05-07T19:56:39.7497961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7500157Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:39.7501093Z ^ 2025-05-07T19:56:39.7501386Z 2025-05-07T19:56:39.7502992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7505018Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.7505598Z ^ 2025-05-07T19:56:39.7508416Z 2025-05-07T19:56:39.7510002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7511963Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.7512502Z ^ 2025-05-07T19:56:39.7512793Z 2025-05-07T19:56:39.7514336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7516337Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.7516876Z ^ 2025-05-07T19:56:39.7517167Z 2025-05-07T19:56:39.7518845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.7521593Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.7522784Z ^ 2025-05-07T19:56:39.7523035Z 2025-05-07T19:56:39.7523517Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:39.7524192Z 2025-05-07T19:56:39.7526078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:39.7528839Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:39.7530385Z ^ 2025-05-07T19:56:39.7530750Z 2025-05-07T19:56:39.7532344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7534517Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:39.7535252Z ^ 2025-05-07T19:56:39.7535518Z 2025-05-07T19:56:39.7537041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7538845Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.7539578Z ^ 2025-05-07T19:56:39.7539851Z 2025-05-07T19:56:39.7541434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7543293Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.7543793Z ^ 2025-05-07T19:56:39.7544034Z 2025-05-07T19:56:39.7545571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:39.7547219Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:39.7547817Z ^ 2025-05-07T19:56:39.7548115Z 2025-05-07T19:56:49.0801208Z [249/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:49.0823858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.0826522Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.0827702Z ^ 2025-05-07T19:56:49.0827953Z 2025-05-07T19:56:49.0828380Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:49.0829050Z 2025-05-07T19:56:49.0830607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.0833715Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.0834914Z ^ 2025-05-07T19:56:49.0835280Z 2025-05-07T19:56:49.0836728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0838750Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:49.0839433Z ^ 2025-05-07T19:56:49.0839717Z 2025-05-07T19:56:49.0841127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0842956Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:49.0843476Z ^ 2025-05-07T19:56:49.0843740Z 2025-05-07T19:56:49.0845124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0846793Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:49.0847299Z ^ 2025-05-07T19:56:49.0847549Z 2025-05-07T19:56:49.0848957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0850687Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:49.0851219Z ^ 2025-05-07T19:56:49.0851478Z 2025-05-07T19:56:49.0853000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.0855464Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.0856755Z ^ 2025-05-07T19:56:49.0856997Z 2025-05-07T19:56:49.0857411Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:49.0858042Z 2025-05-07T19:56:49.0859508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.0862073Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.0863159Z ^ 2025-05-07T19:56:49.0863496Z 2025-05-07T19:56:49.0865027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0866946Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:49.0867686Z ^ 2025-05-07T19:56:49.0868082Z 2025-05-07T19:56:49.0869515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0871313Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:49.0871851Z ^ 2025-05-07T19:56:49.0872142Z 2025-05-07T19:56:49.0873823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0875707Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:49.0876535Z ^ 2025-05-07T19:56:49.0876804Z 2025-05-07T19:56:49.0878293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0880291Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:49.0880823Z ^ 2025-05-07T19:56:49.0881070Z 2025-05-07T19:56:49.0882547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.0885429Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.0886539Z ^ 2025-05-07T19:56:49.0886806Z 2025-05-07T19:56:49.0887230Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:49.0887865Z 2025-05-07T19:56:49.0889427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.0891965Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.0893111Z ^ 2025-05-07T19:56:49.0893476Z 2025-05-07T19:56:49.0894950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0896983Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:49.0897725Z ^ 2025-05-07T19:56:49.0898009Z 2025-05-07T19:56:49.0899463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0901740Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:49.0902206Z ^ 2025-05-07T19:56:49.0902442Z 2025-05-07T19:56:49.0903720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0905369Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:49.0905878Z ^ 2025-05-07T19:56:49.0906169Z 2025-05-07T19:56:49.0907533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0909305Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:49.0909758Z ^ 2025-05-07T19:56:49.0910005Z 2025-05-07T19:56:49.0911487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.0913983Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.0915116Z ^ 2025-05-07T19:56:49.0915361Z 2025-05-07T19:56:49.0915812Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:49.0916743Z 2025-05-07T19:56:49.0918005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.0920332Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.0921481Z ^ 2025-05-07T19:56:49.0921822Z 2025-05-07T19:56:49.0923262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0925305Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:49.0926021Z ^ 2025-05-07T19:56:49.0926281Z 2025-05-07T19:56:49.0927737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0929629Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:49.0930149Z ^ 2025-05-07T19:56:49.0930431Z 2025-05-07T19:56:49.0931916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0933746Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:49.0934238Z ^ 2025-05-07T19:56:49.0934484Z 2025-05-07T19:56:49.0935968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0937877Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:49.0938438Z ^ 2025-05-07T19:56:49.0938712Z 2025-05-07T19:56:49.0940293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.0942843Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.0944019Z ^ 2025-05-07T19:56:49.0944267Z 2025-05-07T19:56:49.0944717Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:49.0945353Z 2025-05-07T19:56:49.0947036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.0949788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:49.0950984Z ^ 2025-05-07T19:56:49.0951364Z 2025-05-07T19:56:49.0952785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0954850Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:49.0955614Z ^ 2025-05-07T19:56:49.0955897Z 2025-05-07T19:56:49.0957565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0959365Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:49.0959897Z ^ 2025-05-07T19:56:49.0960166Z 2025-05-07T19:56:49.0961646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0963454Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:49.0963961Z ^ 2025-05-07T19:56:49.0964234Z 2025-05-07T19:56:49.0965537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:49.0967385Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:49.0967921Z ^ 2025-05-07T19:56:49.0968174Z 2025-05-07T19:56:50.1991682Z [250/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o 2025-05-07T19:56:50.2006468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2008067Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2008718Z ^ 2025-05-07T19:56:50.2008865Z 2025-05-07T19:56:50.2009112Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:50.2009476Z 2025-05-07T19:56:50.2010371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2011780Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2012428Z ^ 2025-05-07T19:56:50.2012689Z 2025-05-07T19:56:50.2013513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2014630Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:50.2015063Z ^ 2025-05-07T19:56:50.2015224Z 2025-05-07T19:56:50.2016044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2017109Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:50.2017433Z ^ 2025-05-07T19:56:50.2017595Z 2025-05-07T19:56:50.2018406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2019417Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:50.2019731Z ^ 2025-05-07T19:56:50.2019891Z 2025-05-07T19:56:50.2020834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2021926Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:50.2022241Z ^ 2025-05-07T19:56:50.2022397Z 2025-05-07T19:56:50.2023276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2024700Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2025344Z ^ 2025-05-07T19:56:50.2025490Z 2025-05-07T19:56:50.2025739Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:50.2026112Z 2025-05-07T19:56:50.2026990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2028405Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2029095Z ^ 2025-05-07T19:56:50.2029365Z 2025-05-07T19:56:50.2030219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2031333Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:50.2031761Z ^ 2025-05-07T19:56:50.2031916Z 2025-05-07T19:56:50.2032836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2033851Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:50.2034174Z ^ 2025-05-07T19:56:50.2034331Z 2025-05-07T19:56:50.2035125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2036149Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:50.2036502Z ^ 2025-05-07T19:56:50.2036654Z 2025-05-07T19:56:50.2037451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2038471Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:50.2038771Z ^ 2025-05-07T19:56:50.2038936Z 2025-05-07T19:56:50.2039814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2041230Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2041853Z ^ 2025-05-07T19:56:50.2042009Z 2025-05-07T19:56:50.2042255Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:50.2042612Z 2025-05-07T19:56:50.2043512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2044918Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2045612Z ^ 2025-05-07T19:56:50.2045813Z 2025-05-07T19:56:50.2046615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2047742Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:50.2048164Z ^ 2025-05-07T19:56:50.2048322Z 2025-05-07T19:56:50.2049125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2050150Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:50.2050467Z ^ 2025-05-07T19:56:50.2050622Z 2025-05-07T19:56:50.2051412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2052433Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:50.2052734Z ^ 2025-05-07T19:56:50.2052901Z 2025-05-07T19:56:50.2053699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2054722Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:50.2055020Z ^ 2025-05-07T19:56:50.2055173Z 2025-05-07T19:56:50.2056129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2057537Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2058174Z ^ 2025-05-07T19:56:50.2058316Z 2025-05-07T19:56:50.2058573Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:50.2058933Z 2025-05-07T19:56:50.2059814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2061425Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2062074Z ^ 2025-05-07T19:56:50.2062273Z 2025-05-07T19:56:50.2063073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2064202Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:50.2064611Z ^ 2025-05-07T19:56:50.2064783Z 2025-05-07T19:56:50.2065578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2066605Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:50.2066916Z ^ 2025-05-07T19:56:50.2067085Z 2025-05-07T19:56:50.2067884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2068966Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:50.2069268Z ^ 2025-05-07T19:56:50.2069424Z 2025-05-07T19:56:50.2070240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2071250Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:50.2071571Z ^ 2025-05-07T19:56:50.2071726Z 2025-05-07T19:56:50.2072620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2074024Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2074678Z ^ 2025-05-07T19:56:50.2074821Z 2025-05-07T19:56:50.2075066Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:50.2075439Z 2025-05-07T19:56:50.2076577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.2078006Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:50.2078648Z ^ 2025-05-07T19:56:50.2078868Z 2025-05-07T19:56:50.2080012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2081145Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:50.2081566Z ^ 2025-05-07T19:56:50.2081741Z 2025-05-07T19:56:50.2082541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2083556Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:50.2083879Z ^ 2025-05-07T19:56:50.2084089Z 2025-05-07T19:56:50.2084907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2085924Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:50.2086240Z ^ 2025-05-07T19:56:50.2086393Z 2025-05-07T19:56:50.2087183Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:50.2088205Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:50.2088517Z ^ 2025-05-07T19:56:50.2088667Z 2025-05-07T19:56:53.4171810Z [251/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o 2025-05-07T19:56:53.4194971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.4197542Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.4198708Z ^ 2025-05-07T19:56:53.4198965Z 2025-05-07T19:56:53.4199391Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:53.4200232Z 2025-05-07T19:56:53.4201845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.4204392Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.4205601Z ^ 2025-05-07T19:56:53.4205983Z 2025-05-07T19:56:53.4207588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4209605Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:53.4210315Z ^ 2025-05-07T19:56:53.4210582Z 2025-05-07T19:56:53.4212227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4214085Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:53.4214634Z ^ 2025-05-07T19:56:53.4214922Z 2025-05-07T19:56:53.4216481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4218440Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:53.4218952Z ^ 2025-05-07T19:56:53.4219227Z 2025-05-07T19:56:53.4220907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4222646Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:53.4223149Z ^ 2025-05-07T19:56:53.4223420Z 2025-05-07T19:56:53.4224920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.4227386Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.4228573Z ^ 2025-05-07T19:56:53.4228812Z 2025-05-07T19:56:53.4229274Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:53.4229966Z 2025-05-07T19:56:53.4231566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.4234313Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.4235602Z ^ 2025-05-07T19:56:53.4235940Z 2025-05-07T19:56:53.4237356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4239358Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:53.4240094Z ^ 2025-05-07T19:56:53.4240359Z 2025-05-07T19:56:53.4241900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4244166Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:53.4244700Z ^ 2025-05-07T19:56:53.4244988Z 2025-05-07T19:56:53.4246464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4248316Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:53.4248841Z ^ 2025-05-07T19:56:53.4249108Z 2025-05-07T19:56:53.4250817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4252821Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:53.4253362Z ^ 2025-05-07T19:56:53.4253628Z 2025-05-07T19:56:53.4255230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.4257762Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.4258965Z ^ 2025-05-07T19:56:53.4259206Z 2025-05-07T19:56:53.4259659Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:53.4260293Z 2025-05-07T19:56:53.4262169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.4264779Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.4266011Z ^ 2025-05-07T19:56:53.4266418Z 2025-05-07T19:56:53.4267941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4269989Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:53.4270708Z ^ 2025-05-07T19:56:53.4271009Z 2025-05-07T19:56:53.4272487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4274360Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:53.4274883Z ^ 2025-05-07T19:56:53.4275159Z 2025-05-07T19:56:53.4277388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4279500Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:53.4280066Z ^ 2025-05-07T19:56:53.4280356Z 2025-05-07T19:56:53.4281999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4283963Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:53.4284502Z ^ 2025-05-07T19:56:53.4284763Z 2025-05-07T19:56:53.4286337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.4289087Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.4290316Z ^ 2025-05-07T19:56:53.4290572Z 2025-05-07T19:56:53.4291034Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:53.4291712Z 2025-05-07T19:56:53.4293324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.4295891Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.4297020Z ^ 2025-05-07T19:56:53.4297365Z 2025-05-07T19:56:53.4298865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4301197Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:53.4301922Z ^ 2025-05-07T19:56:53.4302209Z 2025-05-07T19:56:53.4303708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4305858Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:53.4306381Z ^ 2025-05-07T19:56:53.4306654Z 2025-05-07T19:56:53.4308143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4309981Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:53.4310530Z ^ 2025-05-07T19:56:53.4310788Z 2025-05-07T19:56:53.4312259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4314148Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:53.4314686Z ^ 2025-05-07T19:56:53.4314968Z 2025-05-07T19:56:53.4316659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.4319642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.4320785Z ^ 2025-05-07T19:56:53.4321035Z 2025-05-07T19:56:53.4321461Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:53.4322261Z 2025-05-07T19:56:53.4323894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.4326753Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:53.4328054Z ^ 2025-05-07T19:56:53.4328410Z 2025-05-07T19:56:53.4329957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4332272Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T19:56:53.4333038Z ^ 2025-05-07T19:56:53.4333320Z 2025-05-07T19:56:53.4334859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4336779Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:53.4337338Z ^ 2025-05-07T19:56:53.4337618Z 2025-05-07T19:56:53.4339307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4341454Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:53.4342055Z ^ 2025-05-07T19:56:53.4342353Z 2025-05-07T19:56:53.4343931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:53.4345838Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:56:53.4346351Z ^ 2025-05-07T19:56:53.4346651Z 2025-05-07T19:56:54.5151228Z [252/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:54.5175378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5178489Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:54.5179699Z ^ 2025-05-07T19:56:54.5179973Z 2025-05-07T19:56:54.5180525Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:54.5181242Z 2025-05-07T19:56:54.5182974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5185769Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:54.5187005Z ^ 2025-05-07T19:56:54.5187397Z 2025-05-07T19:56:54.5189116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5191882Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:54.5193254Z ^ 2025-05-07T19:56:54.5193512Z 2025-05-07T19:56:54.5193971Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:54.5194644Z 2025-05-07T19:56:54.5195928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5198592Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:54.5199817Z ^ 2025-05-07T19:56:54.5200190Z 2025-05-07T19:56:54.5201881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5204614Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:54.5205830Z ^ 2025-05-07T19:56:54.5206090Z 2025-05-07T19:56:54.5206553Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:54.5207246Z 2025-05-07T19:56:54.5208938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5211748Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:54.5212972Z ^ 2025-05-07T19:56:54.5213348Z 2025-05-07T19:56:54.5215096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5217853Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:54.5219073Z ^ 2025-05-07T19:56:54.5219333Z 2025-05-07T19:56:54.5219933Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:54.5220763Z 2025-05-07T19:56:54.5222585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5225473Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:54.5226735Z ^ 2025-05-07T19:56:54.5227117Z 2025-05-07T19:56:54.5228875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5231701Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:54.5232843Z ^ 2025-05-07T19:56:54.5233117Z 2025-05-07T19:56:54.5233578Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:54.5234271Z 2025-05-07T19:56:54.5236057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5238919Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:54.5240174Z ^ 2025-05-07T19:56:54.5240560Z 2025-05-07T19:56:56.5412519Z [253/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o 2025-05-07T19:56:56.5434453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.5437214Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.5438281Z ^ 2025-05-07T19:56:56.5438530Z 2025-05-07T19:56:56.5438946Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:56.5439704Z 2025-05-07T19:56:56.5441347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.5444046Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.5445193Z ^ 2025-05-07T19:56:56.5445547Z 2025-05-07T19:56:56.5447537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.5450176Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.5451378Z ^ 2025-05-07T19:56:56.5451635Z 2025-05-07T19:56:56.5452102Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:56.5452771Z 2025-05-07T19:56:56.5454485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.5457045Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.5458221Z ^ 2025-05-07T19:56:56.5458587Z 2025-05-07T19:56:56.5460257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.5462861Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.5463996Z ^ 2025-05-07T19:56:56.5464234Z 2025-05-07T19:56:56.5464627Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:56.5465271Z 2025-05-07T19:56:56.5467082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.5469445Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.5470515Z ^ 2025-05-07T19:56:56.5470847Z 2025-05-07T19:56:56.5472329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.5475114Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.5476608Z ^ 2025-05-07T19:56:56.5476856Z 2025-05-07T19:56:56.5477260Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:56.5477828Z 2025-05-07T19:56:56.5479305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.5481927Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.5483111Z ^ 2025-05-07T19:56:56.5483507Z 2025-05-07T19:56:56.5485152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.5487920Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.5489004Z ^ 2025-05-07T19:56:56.5489218Z 2025-05-07T19:56:56.5489620Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:56.5490436Z 2025-05-07T19:56:56.5492019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.5494472Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:56.5495587Z ^ 2025-05-07T19:56:56.5495911Z 2025-05-07T19:56:57.9672632Z [254/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o 2025-05-07T19:56:57.9696692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.9699178Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:57.9700239Z ^ 2025-05-07T19:56:57.9700626Z 2025-05-07T19:56:57.9701059Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:57.9701669Z 2025-05-07T19:56:57.9703428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.9706467Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:57.9707658Z ^ 2025-05-07T19:56:57.9708001Z 2025-05-07T19:56:57.9709614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.9712324Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:57.9713510Z ^ 2025-05-07T19:56:57.9713803Z 2025-05-07T19:56:57.9714254Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:57.9714944Z 2025-05-07T19:56:57.9716338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.9718654Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:57.9719781Z ^ 2025-05-07T19:56:57.9720143Z 2025-05-07T19:56:57.9721708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.9724625Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:57.9725784Z ^ 2025-05-07T19:56:57.9726033Z 2025-05-07T19:56:57.9726446Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:57.9727036Z 2025-05-07T19:56:57.9728508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.9730972Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:57.9732207Z ^ 2025-05-07T19:56:57.9732682Z 2025-05-07T19:56:57.9734417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.9736970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:57.9738150Z ^ 2025-05-07T19:56:57.9738424Z 2025-05-07T19:56:57.9738855Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:57.9739508Z 2025-05-07T19:56:57.9741319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.9743807Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:57.9744927Z ^ 2025-05-07T19:56:57.9745369Z 2025-05-07T19:56:57.9747098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.9749673Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:57.9750855Z ^ 2025-05-07T19:56:57.9751119Z 2025-05-07T19:56:57.9751555Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:56:57.9752237Z 2025-05-07T19:56:57.9753790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.9756484Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:56:57.9757686Z ^ 2025-05-07T19:56:57.9758091Z 2025-05-07T19:57:02.9074577Z [255/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:57:02.9095915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9098408Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9099554Z ^ 2025-05-07T19:57:02.9099820Z 2025-05-07T19:57:02.9100276Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:02.9101332Z 2025-05-07T19:57:02.9102760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9105207Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9106351Z ^ 2025-05-07T19:57:02.9106714Z 2025-05-07T19:57:02.9108336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9110837Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9112032Z ^ 2025-05-07T19:57:02.9112281Z 2025-05-07T19:57:02.9112696Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:02.9113326Z 2025-05-07T19:57:02.9114896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9117344Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9118545Z ^ 2025-05-07T19:57:02.9118932Z 2025-05-07T19:57:02.9120931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9123338Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9124399Z ^ 2025-05-07T19:57:02.9124639Z 2025-05-07T19:57:02.9125037Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:02.9125580Z 2025-05-07T19:57:02.9126857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9129663Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9130744Z ^ 2025-05-07T19:57:02.9131114Z 2025-05-07T19:57:02.9132591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9134960Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9135990Z ^ 2025-05-07T19:57:02.9136256Z 2025-05-07T19:57:02.9136659Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:02.9137212Z 2025-05-07T19:57:02.9138734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9141379Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9142486Z ^ 2025-05-07T19:57:02.9142980Z 2025-05-07T19:57:02.9144493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9146934Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9148222Z ^ 2025-05-07T19:57:02.9148494Z 2025-05-07T19:57:02.9148896Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:02.9149517Z 2025-05-07T19:57:02.9151062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:02.9153513Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:02.9154634Z ^ 2025-05-07T19:57:02.9154945Z 2025-05-07T19:57:03.6303844Z [256/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:03.6327114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6331384Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6332616Z ^ 2025-05-07T19:57:03.6332881Z 2025-05-07T19:57:03.6333345Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.6334056Z 2025-05-07T19:57:03.6335768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6338301Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6339474Z ^ 2025-05-07T19:57:03.6339860Z 2025-05-07T19:57:03.6341718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6344417Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6345544Z ^ 2025-05-07T19:57:03.6345790Z 2025-05-07T19:57:03.6346239Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.6346867Z 2025-05-07T19:57:03.6348459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6351307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6352405Z ^ 2025-05-07T19:57:03.6352772Z 2025-05-07T19:57:03.6354297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6356873Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6357993Z ^ 2025-05-07T19:57:03.6358372Z 2025-05-07T19:57:03.6358803Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.6359438Z 2025-05-07T19:57:03.6361137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6363817Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6365024Z ^ 2025-05-07T19:57:03.6365392Z 2025-05-07T19:57:03.6367186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6369843Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6371038Z ^ 2025-05-07T19:57:03.6371293Z 2025-05-07T19:57:03.6371785Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.6372472Z 2025-05-07T19:57:03.6374153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6377403Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6378613Z ^ 2025-05-07T19:57:03.6379014Z 2025-05-07T19:57:03.6380741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6383508Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6384735Z ^ 2025-05-07T19:57:03.6385009Z 2025-05-07T19:57:03.6385496Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:03.6386205Z 2025-05-07T19:57:03.6387772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.6390525Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:03.6391700Z ^ 2025-05-07T19:57:03.6392118Z 2025-05-07T19:57:06.6198869Z [257/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:57:06.6220762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6223229Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6224328Z ^ 2025-05-07T19:57:06.6224614Z 2025-05-07T19:57:06.6225033Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:06.6225678Z 2025-05-07T19:57:06.6227233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6229738Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6230841Z ^ 2025-05-07T19:57:06.6231204Z 2025-05-07T19:57:06.6232724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6235241Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6236316Z ^ 2025-05-07T19:57:06.6236591Z 2025-05-07T19:57:06.6237015Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:06.6237893Z 2025-05-07T19:57:06.6239470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6241918Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6243034Z ^ 2025-05-07T19:57:06.6243398Z 2025-05-07T19:57:06.6244913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6247512Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6248632Z ^ 2025-05-07T19:57:06.6248867Z 2025-05-07T19:57:06.6249291Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:06.6249923Z 2025-05-07T19:57:06.6251447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6253963Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6255072Z ^ 2025-05-07T19:57:06.6255439Z 2025-05-07T19:57:06.6256965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6259495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6260745Z ^ 2025-05-07T19:57:06.6260987Z 2025-05-07T19:57:06.6261443Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:06.6262189Z 2025-05-07T19:57:06.6263704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6266184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6267355Z ^ 2025-05-07T19:57:06.6267671Z 2025-05-07T19:57:06.6269172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6271638Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6272719Z ^ 2025-05-07T19:57:06.6272954Z 2025-05-07T19:57:06.6273370Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:06.6273983Z 2025-05-07T19:57:06.6275489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:06.6278334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:06.6279439Z ^ 2025-05-07T19:57:06.6279785Z 2025-05-07T19:57:14.2502732Z [258/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:14.2525845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2528568Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2529693Z ^ 2025-05-07T19:57:14.2529935Z 2025-05-07T19:57:14.2530357Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:14.2530967Z 2025-05-07T19:57:14.2532496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2535172Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2536293Z ^ 2025-05-07T19:57:14.2536682Z 2025-05-07T19:57:14.2538271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2541395Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2542554Z ^ 2025-05-07T19:57:14.2542829Z 2025-05-07T19:57:14.2543276Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:14.2543955Z 2025-05-07T19:57:14.2545598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2548109Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2549308Z ^ 2025-05-07T19:57:14.2549654Z 2025-05-07T19:57:14.2551236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2553857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2555026Z ^ 2025-05-07T19:57:14.2555256Z 2025-05-07T19:57:14.2555693Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:14.2556361Z 2025-05-07T19:57:14.2558009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2560721Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2561939Z ^ 2025-05-07T19:57:14.2562331Z 2025-05-07T19:57:14.2563930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2566612Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2567640Z ^ 2025-05-07T19:57:14.2567906Z 2025-05-07T19:57:14.2568313Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:14.2568911Z 2025-05-07T19:57:14.2570565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2573264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2574440Z ^ 2025-05-07T19:57:14.2574784Z 2025-05-07T19:57:14.2576953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2579655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2580937Z ^ 2025-05-07T19:57:14.2581194Z 2025-05-07T19:57:14.2581633Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:14.2582330Z 2025-05-07T19:57:14.2584448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.2587080Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:14.2588208Z ^ 2025-05-07T19:57:14.2588555Z 2025-05-07T19:57:19.5091032Z [259/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:57:19.5118131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.5120834Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.5122039Z ^ 2025-05-07T19:57:19.5122336Z 2025-05-07T19:57:19.5122785Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:19.5123455Z 2025-05-07T19:57:19.5125156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.5127692Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.5128799Z ^ 2025-05-07T19:57:19.5129445Z 2025-05-07T19:57:19.5131139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.5133904Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.5135128Z ^ 2025-05-07T19:57:19.5135397Z 2025-05-07T19:57:19.5135852Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:19.5136568Z 2025-05-07T19:57:19.5138185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.5141226Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.5142390Z ^ 2025-05-07T19:57:19.5142794Z 2025-05-07T19:57:19.5144479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.5147243Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.5148458Z ^ 2025-05-07T19:57:19.5148720Z 2025-05-07T19:57:19.5149201Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:19.5149881Z 2025-05-07T19:57:19.5151516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.5154189Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.5155494Z ^ 2025-05-07T19:57:19.5156012Z 2025-05-07T19:57:19.5157645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.5160251Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.5161423Z ^ 2025-05-07T19:57:19.5161689Z 2025-05-07T19:57:19.5162124Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:19.5162831Z 2025-05-07T19:57:19.5164537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.5167121Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.5168235Z ^ 2025-05-07T19:57:19.5168599Z 2025-05-07T19:57:19.5170209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.5172908Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.5174083Z ^ 2025-05-07T19:57:19.5174334Z 2025-05-07T19:57:19.5175001Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:19.5175734Z 2025-05-07T19:57:19.5177637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.5180361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:19.5181644Z ^ 2025-05-07T19:57:19.5182024Z 2025-05-07T19:57:20.7616789Z [260/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:20.7640572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.7643258Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:20.7644438Z ^ 2025-05-07T19:57:20.7644814Z 2025-05-07T19:57:20.7645259Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:20.7645937Z 2025-05-07T19:57:20.7647871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.7650559Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:20.7651738Z ^ 2025-05-07T19:57:20.7652173Z 2025-05-07T19:57:20.7653680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.7656454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:20.7657704Z ^ 2025-05-07T19:57:20.7657996Z 2025-05-07T19:57:20.7658437Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:20.7659112Z 2025-05-07T19:57:20.7660701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.7663136Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:20.7664338Z ^ 2025-05-07T19:57:20.7664710Z 2025-05-07T19:57:20.7666367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.7669073Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:20.7670276Z ^ 2025-05-07T19:57:20.7670536Z 2025-05-07T19:57:20.7670989Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:20.7671618Z 2025-05-07T19:57:20.7673282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.7676307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:20.7677458Z ^ 2025-05-07T19:57:20.7677857Z 2025-05-07T19:57:20.7679503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.7682227Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:20.7683401Z ^ 2025-05-07T19:57:20.7683672Z 2025-05-07T19:57:20.7684166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:20.7684851Z 2025-05-07T19:57:20.7686494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.7688999Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:20.7690159Z ^ 2025-05-07T19:57:20.7690511Z 2025-05-07T19:57:20.7692365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.7695013Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:20.7696353Z ^ 2025-05-07T19:57:20.7696585Z 2025-05-07T19:57:20.7697052Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:20.7697718Z 2025-05-07T19:57:20.7699413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.7702332Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:20.7703720Z ^ 2025-05-07T19:57:20.7704112Z 2025-05-07T19:57:23.2616483Z [261/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:23.2641011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.2643863Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:23.2644972Z ^ 2025-05-07T19:57:23.2645217Z 2025-05-07T19:57:23.2646049Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:23.2646927Z 2025-05-07T19:57:23.2648901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.2651558Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:23.2652710Z ^ 2025-05-07T19:57:23.2653106Z 2025-05-07T19:57:23.2654704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.2657494Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:23.2658661Z ^ 2025-05-07T19:57:23.2658928Z 2025-05-07T19:57:23.2659368Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:23.2660009Z 2025-05-07T19:57:23.2661778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.2664332Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:23.2665517Z ^ 2025-05-07T19:57:23.2665882Z 2025-05-07T19:57:23.2667518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.2670166Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:23.2671656Z ^ 2025-05-07T19:57:23.2671967Z 2025-05-07T19:57:23.2672433Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:23.2673122Z 2025-05-07T19:57:23.2675014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.2677969Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:23.2679191Z ^ 2025-05-07T19:57:23.2679563Z 2025-05-07T19:57:23.2681221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.2684022Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:23.2685215Z ^ 2025-05-07T19:57:23.2685485Z 2025-05-07T19:57:23.2685947Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:23.2686632Z 2025-05-07T19:57:23.2688320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.2691058Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:23.2692530Z ^ 2025-05-07T19:57:23.2692947Z 2025-05-07T19:57:23.2694733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.2697630Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:23.2698838Z ^ 2025-05-07T19:57:23.2699136Z 2025-05-07T19:57:23.2699611Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:23.2700272Z 2025-05-07T19:57:23.2702112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:23.2704906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:23.2706166Z ^ 2025-05-07T19:57:23.2706563Z 2025-05-07T19:57:25.7521168Z [262/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o 2025-05-07T19:57:25.7545308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.7548036Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:25.7549242Z ^ 2025-05-07T19:57:25.7549507Z 2025-05-07T19:57:25.7549955Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:25.7550639Z 2025-05-07T19:57:25.7552166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.7554891Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:25.7556093Z ^ 2025-05-07T19:57:25.7556461Z 2025-05-07T19:57:25.7558037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.7560508Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:25.7561648Z ^ 2025-05-07T19:57:25.7561877Z 2025-05-07T19:57:25.7562348Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:25.7563003Z 2025-05-07T19:57:25.7564775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.7567417Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:25.7568624Z ^ 2025-05-07T19:57:25.7568999Z 2025-05-07T19:57:25.7570634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.7573475Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:25.7574621Z ^ 2025-05-07T19:57:25.7574880Z 2025-05-07T19:57:25.7575313Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:25.7576317Z 2025-05-07T19:57:25.7577950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.7580796Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:25.7581897Z ^ 2025-05-07T19:57:25.7582301Z 2025-05-07T19:57:25.7583754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.7586292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:25.7587464Z ^ 2025-05-07T19:57:25.7587694Z 2025-05-07T19:57:25.7588171Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:25.7588811Z 2025-05-07T19:57:25.7590804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.7593565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:25.7594816Z ^ 2025-05-07T19:57:25.7595173Z 2025-05-07T19:57:25.7596783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.7599440Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:25.7600826Z ^ 2025-05-07T19:57:25.7601070Z 2025-05-07T19:57:25.7601524Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:25.7602196Z 2025-05-07T19:57:25.7603876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.7606600Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:25.7607774Z ^ 2025-05-07T19:57:25.7608158Z 2025-05-07T19:57:29.0574118Z [263/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:29.0597569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.0600298Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.0601471Z ^ 2025-05-07T19:57:29.0601738Z 2025-05-07T19:57:29.0602196Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:29.0603251Z 2025-05-07T19:57:29.0604991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.0607690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.0608979Z ^ 2025-05-07T19:57:29.0609373Z 2025-05-07T19:57:29.0611079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.0613774Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.0614937Z ^ 2025-05-07T19:57:29.0615222Z 2025-05-07T19:57:29.0615653Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:29.0616326Z 2025-05-07T19:57:29.0618025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.0621232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.0622507Z ^ 2025-05-07T19:57:29.0623005Z 2025-05-07T19:57:29.0624706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.0627416Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.0628623Z ^ 2025-05-07T19:57:29.0628877Z 2025-05-07T19:57:29.0629341Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:29.0630031Z 2025-05-07T19:57:29.0631533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.0634064Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.0635250Z ^ 2025-05-07T19:57:29.0635621Z 2025-05-07T19:57:29.0637229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.0640194Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.0641371Z ^ 2025-05-07T19:57:29.0641639Z 2025-05-07T19:57:29.0642115Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:29.0642784Z 2025-05-07T19:57:29.0644470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.0647185Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.0648389Z ^ 2025-05-07T19:57:29.0648874Z 2025-05-07T19:57:29.0650555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.0653214Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.0654446Z ^ 2025-05-07T19:57:29.0654704Z 2025-05-07T19:57:29.0655137Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:29.0655815Z 2025-05-07T19:57:29.0657460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:29.0660056Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:29.0673094Z ^ 2025-05-07T19:57:29.0673606Z 2025-05-07T19:57:30.7965352Z [264/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o 2025-05-07T19:57:30.7990679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.7993845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.7994988Z ^ 2025-05-07T19:57:30.7995279Z 2025-05-07T19:57:30.7995674Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:30.7996383Z 2025-05-07T19:57:30.7998097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.8000883Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.8002099Z ^ 2025-05-07T19:57:30.8002506Z 2025-05-07T19:57:30.8004106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.8006484Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.8007563Z ^ 2025-05-07T19:57:30.8007848Z 2025-05-07T19:57:30.8008294Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:30.8009094Z 2025-05-07T19:57:30.8010723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.8013482Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.8014739Z ^ 2025-05-07T19:57:30.8015120Z 2025-05-07T19:57:30.8016786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.8019770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.8021166Z ^ 2025-05-07T19:57:30.8021564Z 2025-05-07T19:57:30.8022015Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:30.8022703Z 2025-05-07T19:57:30.8024448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.8027184Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.8028283Z ^ 2025-05-07T19:57:30.8028691Z 2025-05-07T19:57:30.8030594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.8033308Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.8034416Z ^ 2025-05-07T19:57:30.8034676Z 2025-05-07T19:57:30.8035152Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:30.8035739Z 2025-05-07T19:57:30.8037398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.8040279Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.8041515Z ^ 2025-05-07T19:57:30.8041897Z 2025-05-07T19:57:30.8043575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.8046311Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.8047538Z ^ 2025-05-07T19:57:30.8047763Z 2025-05-07T19:57:30.8048201Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:30.8048874Z 2025-05-07T19:57:30.8050552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.8053244Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:30.8054422Z ^ 2025-05-07T19:57:30.8055010Z 2025-05-07T19:57:46.6219824Z [265/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:46.6243541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.6245971Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.6247115Z ^ 2025-05-07T19:57:46.6247332Z 2025-05-07T19:57:46.6247688Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.6248248Z 2025-05-07T19:57:46.6249614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.6252193Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.6253106Z ^ 2025-05-07T19:57:46.6253446Z 2025-05-07T19:57:46.6254917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.6257503Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.6258592Z ^ 2025-05-07T19:57:46.6258846Z 2025-05-07T19:57:46.6259271Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.6259860Z 2025-05-07T19:57:46.6261455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.6264044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.6265247Z ^ 2025-05-07T19:57:46.6265620Z 2025-05-07T19:57:46.6267271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.6269917Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.6271040Z ^ 2025-05-07T19:57:46.6271297Z 2025-05-07T19:57:46.6271764Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.6272442Z 2025-05-07T19:57:46.6274244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.6277275Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.6278323Z ^ 2025-05-07T19:57:46.6278683Z 2025-05-07T19:57:46.6280168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.6282736Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.6284016Z ^ 2025-05-07T19:57:46.6284246Z 2025-05-07T19:57:46.6284683Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.6285342Z 2025-05-07T19:57:46.6286915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.6289521Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.6290655Z ^ 2025-05-07T19:57:46.6290993Z 2025-05-07T19:57:46.6292543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.6295407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.6296723Z ^ 2025-05-07T19:57:46.6296976Z 2025-05-07T19:57:46.6297432Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:46.6298121Z 2025-05-07T19:57:46.6299793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.6302655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:46.6303765Z ^ 2025-05-07T19:57:46.6304134Z 2025-05-07T19:57:49.3798929Z [266/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:49.3820167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.3822702Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.3823655Z ^ 2025-05-07T19:57:49.3823903Z 2025-05-07T19:57:49.3824338Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:49.3824872Z 2025-05-07T19:57:49.3826335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.3828912Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.3830276Z ^ 2025-05-07T19:57:49.3830606Z 2025-05-07T19:57:49.3832189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.3834727Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.3835645Z ^ 2025-05-07T19:57:49.3835853Z 2025-05-07T19:57:49.3836211Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:49.3836736Z 2025-05-07T19:57:49.3838079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.3840209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.3841183Z ^ 2025-05-07T19:57:49.3841471Z 2025-05-07T19:57:49.3842904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.3845391Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.3846528Z ^ 2025-05-07T19:57:49.3846779Z 2025-05-07T19:57:49.3847467Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:49.3848041Z 2025-05-07T19:57:49.3849639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.3852350Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.3853543Z ^ 2025-05-07T19:57:49.3853930Z 2025-05-07T19:57:49.3855608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.3858336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.3859490Z ^ 2025-05-07T19:57:49.3859761Z 2025-05-07T19:57:49.3860198Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:49.3861014Z 2025-05-07T19:57:49.3862685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.3865323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.3866498Z ^ 2025-05-07T19:57:49.3866858Z 2025-05-07T19:57:49.3868504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.3871119Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.3872262Z ^ 2025-05-07T19:57:49.3872631Z 2025-05-07T19:57:49.3873068Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:49.3873732Z 2025-05-07T19:57:49.3875371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.3878282Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:49.3879440Z ^ 2025-05-07T19:57:49.3879811Z 2025-05-07T19:57:51.1943827Z [267/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:51.1967248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.1969996Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.1971089Z ^ 2025-05-07T19:57:51.1971320Z 2025-05-07T19:57:51.1971731Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.1972324Z 2025-05-07T19:57:51.1973766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.1976741Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.1977763Z ^ 2025-05-07T19:57:51.1978109Z 2025-05-07T19:57:51.1979509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.1981922Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.1982916Z ^ 2025-05-07T19:57:51.1983161Z 2025-05-07T19:57:51.1983546Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.1984111Z 2025-05-07T19:57:51.1985531Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.1987713Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.1988807Z ^ 2025-05-07T19:57:51.1989175Z 2025-05-07T19:57:51.1991046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.1993690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.1994874Z ^ 2025-05-07T19:57:51.1995119Z 2025-05-07T19:57:51.1995573Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.1996244Z 2025-05-07T19:57:51.1997882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.2000699Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.2002074Z ^ 2025-05-07T19:57:51.2002450Z 2025-05-07T19:57:51.2004066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.2006692Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.2007838Z ^ 2025-05-07T19:57:51.2008098Z 2025-05-07T19:57:51.2008550Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.2009217Z 2025-05-07T19:57:51.2010652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.2013035Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.2014177Z ^ 2025-05-07T19:57:51.2014516Z 2025-05-07T19:57:51.2016077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.2018655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.2019619Z ^ 2025-05-07T19:57:51.2019827Z 2025-05-07T19:57:51.2020204Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:51.2020860Z 2025-05-07T19:57:51.2022238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.2024603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:51.2025723Z ^ 2025-05-07T19:57:51.2026035Z 2025-05-07T19:57:53.8349109Z [268/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:53.8372347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.8375055Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.8376855Z ^ 2025-05-07T19:57:53.8377104Z 2025-05-07T19:57:53.8377540Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.8378197Z 2025-05-07T19:57:53.8379753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.8382235Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.8383359Z ^ 2025-05-07T19:57:53.8383736Z 2025-05-07T19:57:53.8385312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.8387397Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.8388496Z ^ 2025-05-07T19:57:53.8388733Z 2025-05-07T19:57:53.8389166Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.8389787Z 2025-05-07T19:57:53.8391640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.8394179Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.8395737Z ^ 2025-05-07T19:57:53.8396112Z 2025-05-07T19:57:53.8397738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.8400425Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.8401597Z ^ 2025-05-07T19:57:53.8401851Z 2025-05-07T19:57:53.8402302Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.8402974Z 2025-05-07T19:57:53.8404836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.8407549Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.8408740Z ^ 2025-05-07T19:57:53.8409122Z 2025-05-07T19:57:53.8410636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.8413238Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.8414419Z ^ 2025-05-07T19:57:53.8414669Z 2025-05-07T19:57:53.8415117Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.8415787Z 2025-05-07T19:57:53.8417408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.8419825Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.8421257Z ^ 2025-05-07T19:57:53.8421603Z 2025-05-07T19:57:53.8423010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.8425486Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.8426477Z ^ 2025-05-07T19:57:53.8426705Z 2025-05-07T19:57:53.8427095Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:53.8427750Z 2025-05-07T19:57:53.8429299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:53.8431971Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:53.8433097Z ^ 2025-05-07T19:57:53.8433449Z 2025-05-07T19:57:55.1613370Z [269/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:55.1633899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.1636566Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.1637638Z ^ 2025-05-07T19:57:55.1637872Z 2025-05-07T19:57:55.1638282Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:55.1638910Z 2025-05-07T19:57:55.1640439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.1642789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.1643851Z ^ 2025-05-07T19:57:55.1644185Z 2025-05-07T19:57:55.1645684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.1648051Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.1649105Z ^ 2025-05-07T19:57:55.1649345Z 2025-05-07T19:57:55.1649760Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:55.1650353Z 2025-05-07T19:57:55.1654227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.1656737Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.1657820Z ^ 2025-05-07T19:57:55.1658155Z 2025-05-07T19:57:55.1659401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:55.1661102Z int error_code = 0; 2025-05-07T19:57:55.1661428Z ^ 2025-05-07T19:57:55.1661740Z 2025-05-07T19:57:55.1662851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:55.1664480Z int64_t error_value; 2025-05-07T19:57:55.1664866Z ^ 2025-05-07T19:57:55.1665067Z 2025-05-07T19:57:55.1666293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:55.1667844Z int error_code = 0; 2025-05-07T19:57:55.1668219Z ^ 2025-05-07T19:57:55.1668410Z 2025-05-07T19:57:55.1669621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:55.1671173Z int64_t error_value; 2025-05-07T19:57:55.1671590Z ^ 2025-05-07T19:57:55.1671797Z 2025-05-07T19:57:55.1673069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:55.1674700Z int error_code = 0; 2025-05-07T19:57:55.1675101Z ^ 2025-05-07T19:57:55.1675303Z 2025-05-07T19:57:55.1676659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:55.1678569Z int64_t error_value; 2025-05-07T19:57:55.1678983Z ^ 2025-05-07T19:57:55.1679217Z 2025-05-07T19:57:55.1680508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:55.1682224Z int error_code = 0; 2025-05-07T19:57:55.1682654Z ^ 2025-05-07T19:57:55.1682864Z 2025-05-07T19:57:55.1684319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:55.1686148Z int64_t error_value; 2025-05-07T19:57:55.1686599Z ^ 2025-05-07T19:57:55.1686809Z 2025-05-07T19:57:55.1688135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.1690418Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.1691487Z ^ 2025-05-07T19:57:55.1691712Z 2025-05-07T19:57:55.1692097Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:55.1692670Z 2025-05-07T19:57:55.1694492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.1697255Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.1698474Z ^ 2025-05-07T19:57:55.1698874Z 2025-05-07T19:57:55.1700308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:55.1702262Z int error_code = 0; 2025-05-07T19:57:55.1702698Z ^ 2025-05-07T19:57:55.1702890Z 2025-05-07T19:57:55.1704262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:55.1706016Z int64_t error_value; 2025-05-07T19:57:55.1706465Z ^ 2025-05-07T19:57:55.1706676Z 2025-05-07T19:57:55.1708069Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:55.1709729Z int error_code = 0; 2025-05-07T19:57:55.1710120Z ^ 2025-05-07T19:57:55.1710324Z 2025-05-07T19:57:55.1711488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:55.1713003Z int64_t error_value; 2025-05-07T19:57:55.1713382Z ^ 2025-05-07T19:57:55.1713620Z 2025-05-07T19:57:55.1714806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:55.1716384Z int error_code = 0; 2025-05-07T19:57:55.1716776Z ^ 2025-05-07T19:57:55.1716993Z 2025-05-07T19:57:55.1718259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:55.1719985Z int64_t error_value; 2025-05-07T19:57:55.1720366Z ^ 2025-05-07T19:57:55.1720585Z 2025-05-07T19:57:55.1721793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:55.1723351Z int error_code = 0; 2025-05-07T19:57:55.1723743Z ^ 2025-05-07T19:57:55.1723941Z 2025-05-07T19:57:55.1725165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:55.1726816Z int64_t error_value; 2025-05-07T19:57:55.1727247Z ^ 2025-05-07T19:57:55.1727454Z 2025-05-07T19:57:55.1728950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.1731386Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.1732455Z ^ 2025-05-07T19:57:55.1732657Z 2025-05-07T19:57:55.1732986Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:55.1733496Z 2025-05-07T19:57:55.1735127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.1737571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.1738760Z ^ 2025-05-07T19:57:55.1739100Z 2025-05-07T19:57:55.1740696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:55.1742491Z int error_code = 0; 2025-05-07T19:57:55.1742869Z ^ 2025-05-07T19:57:55.1743046Z 2025-05-07T19:57:55.1744430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:55.1746470Z int64_t error_value; 2025-05-07T19:57:55.1746890Z ^ 2025-05-07T19:57:55.1747115Z 2025-05-07T19:57:55.1748373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:55.1750028Z int error_code = 0; 2025-05-07T19:57:55.1750450Z ^ 2025-05-07T19:57:55.1750645Z 2025-05-07T19:57:55.1751894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:55.1753502Z int64_t error_value; 2025-05-07T19:57:55.1753909Z ^ 2025-05-07T19:57:55.1754150Z 2025-05-07T19:57:55.1755288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:55.1756834Z int error_code = 0; 2025-05-07T19:57:55.1757272Z ^ 2025-05-07T19:57:55.1757450Z 2025-05-07T19:57:55.1758493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:55.1760080Z int64_t error_value; 2025-05-07T19:57:55.1760477Z ^ 2025-05-07T19:57:55.1760680Z 2025-05-07T19:57:55.1761837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:55.1763546Z int error_code = 0; 2025-05-07T19:57:55.1764010Z ^ 2025-05-07T19:57:55.1764201Z 2025-05-07T19:57:55.1765337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:55.1766967Z int64_t error_value; 2025-05-07T19:57:55.1767377Z ^ 2025-05-07T19:57:55.1767602Z 2025-05-07T19:57:55.1769222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.1771928Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.1773065Z ^ 2025-05-07T19:57:55.1773314Z 2025-05-07T19:57:55.1773736Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:55.1774401Z 2025-05-07T19:57:55.1776756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.1779482Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.1780831Z ^ 2025-05-07T19:57:55.1781200Z 2025-05-07T19:57:55.1782599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:55.1784381Z int error_code = 0; 2025-05-07T19:57:55.1784816Z ^ 2025-05-07T19:57:55.1785022Z 2025-05-07T19:57:55.1786414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:55.1788326Z int64_t error_value; 2025-05-07T19:57:55.1788761Z ^ 2025-05-07T19:57:55.1789007Z 2025-05-07T19:57:55.1790461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:55.1792183Z int error_code = 0; 2025-05-07T19:57:55.1792597Z ^ 2025-05-07T19:57:55.1792791Z 2025-05-07T19:57:55.1794163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:55.1795933Z int64_t error_value; 2025-05-07T19:57:55.1796370Z ^ 2025-05-07T19:57:55.1796593Z 2025-05-07T19:57:55.1797969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:55.1799738Z int error_code = 0; 2025-05-07T19:57:55.1800157Z ^ 2025-05-07T19:57:55.1800376Z 2025-05-07T19:57:55.1801766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:55.1803749Z int64_t error_value; 2025-05-07T19:57:55.1804185Z ^ 2025-05-07T19:57:55.1804406Z 2025-05-07T19:57:55.1805778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:57:55.1807527Z int error_code = 0; 2025-05-07T19:57:55.1807974Z ^ 2025-05-07T19:57:55.1808173Z 2025-05-07T19:57:55.1809577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:57:55.1811302Z int64_t error_value; 2025-05-07T19:57:55.1811733Z ^ 2025-05-07T19:57:55.1811948Z 2025-05-07T19:57:55.4437113Z [270/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o 2025-05-07T19:57:55.4461293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.4464117Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.4465339Z ^ 2025-05-07T19:57:55.4465612Z 2025-05-07T19:57:55.4466098Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:55.4466898Z 2025-05-07T19:57:55.4468656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.4471491Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.4472749Z ^ 2025-05-07T19:57:55.4473131Z 2025-05-07T19:57:55.4474515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.4477547Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.4478789Z ^ 2025-05-07T19:57:55.4479052Z 2025-05-07T19:57:55.4479520Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:55.4480214Z 2025-05-07T19:57:55.4481974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.4484777Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.4486022Z ^ 2025-05-07T19:57:55.4486400Z 2025-05-07T19:57:55.4489884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:55.4491746Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:55.4492350Z ^ 2025-05-07T19:57:55.4492620Z 2025-05-07T19:57:55.4494348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.4497113Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.4498423Z ^ 2025-05-07T19:57:55.4498683Z 2025-05-07T19:57:55.4499144Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:55.4499859Z 2025-05-07T19:57:55.4501779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.4504595Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.4505827Z ^ 2025-05-07T19:57:55.4506227Z 2025-05-07T19:57:55.4507636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:55.4509471Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:55.4510036Z ^ 2025-05-07T19:57:55.4510321Z 2025-05-07T19:57:55.4512062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.4514840Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.4516162Z ^ 2025-05-07T19:57:55.4516424Z 2025-05-07T19:57:55.4516900Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:55.4517587Z 2025-05-07T19:57:55.4519329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.4522152Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.4523399Z ^ 2025-05-07T19:57:55.4523791Z 2025-05-07T19:57:55.4525185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:55.4526806Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:55.4527193Z ^ 2025-05-07T19:57:55.4527406Z 2025-05-07T19:57:55.4528817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.4531480Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.4532733Z ^ 2025-05-07T19:57:55.4533012Z 2025-05-07T19:57:55.4533473Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:55.4534317Z 2025-05-07T19:57:55.4536050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:55.4538845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:55.4540083Z ^ 2025-05-07T19:57:55.4540593Z 2025-05-07T19:57:55.4542001Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:57:55.4543853Z const auto offset_idx = idx * D_emb; 2025-05-07T19:57:55.4544426Z ^ 2025-05-07T19:57:55.4544685Z 2025-05-07T19:57:59.0432868Z [271/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:59.0455599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0458312Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0459529Z ^ 2025-05-07T19:57:59.0459786Z 2025-05-07T19:57:59.0460853Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:59.0461547Z 2025-05-07T19:57:59.0463248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0465970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0467105Z ^ 2025-05-07T19:57:59.0467457Z 2025-05-07T19:57:59.0469218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0472006Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0473181Z ^ 2025-05-07T19:57:59.0473428Z 2025-05-07T19:57:59.0473813Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:59.0474410Z 2025-05-07T19:57:59.0476304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0478951Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0480123Z ^ 2025-05-07T19:57:59.0480519Z 2025-05-07T19:57:59.0482172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0484911Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0486097Z ^ 2025-05-07T19:57:59.0486581Z 2025-05-07T19:57:59.0487043Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:59.0487719Z 2025-05-07T19:57:59.0489430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0492125Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0493325Z ^ 2025-05-07T19:57:59.0493706Z 2025-05-07T19:57:59.0495420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0498073Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0499117Z ^ 2025-05-07T19:57:59.0499356Z 2025-05-07T19:57:59.0499800Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:59.0500611Z 2025-05-07T19:57:59.0502305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0505034Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0506573Z ^ 2025-05-07T19:57:59.0506953Z 2025-05-07T19:57:59.0508679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0511423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0512628Z ^ 2025-05-07T19:57:59.0512880Z 2025-05-07T19:57:59.0513355Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:57:59.0514036Z 2025-05-07T19:57:59.0515756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.0518632Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:57:59.0519852Z ^ 2025-05-07T19:57:59.0520246Z 2025-05-07T19:58:00.2426876Z [272/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:58:00.2448787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.2451423Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.2452520Z ^ 2025-05-07T19:58:00.2452766Z 2025-05-07T19:58:00.2453197Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:00.2453854Z 2025-05-07T19:58:00.2455495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.2458264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.2459566Z ^ 2025-05-07T19:58:00.2459952Z 2025-05-07T19:58:00.2461903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.2464559Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.2465710Z ^ 2025-05-07T19:58:00.2465971Z 2025-05-07T19:58:00.2466410Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:00.2467047Z 2025-05-07T19:58:00.2468686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.2471419Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.2472629Z ^ 2025-05-07T19:58:00.2472997Z 2025-05-07T19:58:00.2474630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.2477665Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.2478841Z ^ 2025-05-07T19:58:00.2479075Z 2025-05-07T19:58:00.2479521Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:00.2480189Z 2025-05-07T19:58:00.2481859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.2484427Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.2485535Z ^ 2025-05-07T19:58:00.2485869Z 2025-05-07T19:58:00.2487723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.2490171Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.2491245Z ^ 2025-05-07T19:58:00.2491484Z 2025-05-07T19:58:00.2491907Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:00.2492554Z 2025-05-07T19:58:00.2494457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.2496829Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.2497936Z ^ 2025-05-07T19:58:00.2498297Z 2025-05-07T19:58:00.2499808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.2502517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.2503820Z ^ 2025-05-07T19:58:00.2504061Z 2025-05-07T19:58:00.2504495Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:00.2505141Z 2025-05-07T19:58:00.2506811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.2509441Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.2510618Z ^ 2025-05-07T19:58:00.2510975Z 2025-05-07T19:58:00.9596458Z [273/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:00.9620748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9623427Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9624639Z ^ 2025-05-07T19:58:00.9624907Z 2025-05-07T19:58:00.9625357Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:00.9626273Z 2025-05-07T19:58:00.9627962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9630690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9631752Z ^ 2025-05-07T19:58:00.9632153Z 2025-05-07T19:58:00.9633674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9636193Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9637228Z ^ 2025-05-07T19:58:00.9637521Z 2025-05-07T19:58:00.9637947Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:00.9638617Z 2025-05-07T19:58:00.9640226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9642807Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9644088Z ^ 2025-05-07T19:58:00.9644430Z 2025-05-07T19:58:00.9645955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9648533Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9649740Z ^ 2025-05-07T19:58:00.9650007Z 2025-05-07T19:58:00.9650484Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:00.9651198Z 2025-05-07T19:58:00.9652873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9655611Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9656825Z ^ 2025-05-07T19:58:00.9657207Z 2025-05-07T19:58:00.9658833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9661491Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9662872Z ^ 2025-05-07T19:58:00.9663145Z 2025-05-07T19:58:00.9663617Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:00.9664301Z 2025-05-07T19:58:00.9665990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9668754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9670002Z ^ 2025-05-07T19:58:00.9670373Z 2025-05-07T19:58:00.9672107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9674695Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9675643Z ^ 2025-05-07T19:58:00.9675851Z 2025-05-07T19:58:00.9676561Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:00.9677202Z 2025-05-07T19:58:00.9678714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:00.9681186Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:00.9682314Z ^ 2025-05-07T19:58:00.9682662Z 2025-05-07T19:58:02.9271816Z [274/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o 2025-05-07T19:58:02.9294910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:02.9297543Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:02.9299013Z ^ 2025-05-07T19:58:02.9299394Z 2025-05-07T19:58:02.9299807Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:02.9300540Z 2025-05-07T19:58:02.9301999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:02.9304394Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:02.9305360Z ^ 2025-05-07T19:58:02.9305738Z 2025-05-07T19:58:02.9307113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:02.9309334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:02.9310296Z ^ 2025-05-07T19:58:02.9310538Z 2025-05-07T19:58:02.9310935Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:02.9311544Z 2025-05-07T19:58:02.9313151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:02.9315599Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:02.9316791Z ^ 2025-05-07T19:58:02.9317181Z 2025-05-07T19:58:02.9318813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:02.9321421Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:02.9322561Z ^ 2025-05-07T19:58:02.9322858Z 2025-05-07T19:58:02.9323312Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:02.9324148Z 2025-05-07T19:58:02.9325858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:02.9328488Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:02.9329575Z ^ 2025-05-07T19:58:02.9329932Z 2025-05-07T19:58:02.9331861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:02.9334304Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:02.9335389Z ^ 2025-05-07T19:58:02.9335627Z 2025-05-07T19:58:02.9336048Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:02.9336748Z 2025-05-07T19:58:02.9338416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:02.9341006Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:02.9342111Z ^ 2025-05-07T19:58:02.9342532Z 2025-05-07T19:58:02.9344222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:02.9346617Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:02.9347595Z ^ 2025-05-07T19:58:02.9347845Z 2025-05-07T19:58:02.9348277Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:02.9348911Z 2025-05-07T19:58:02.9350300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:02.9352650Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:02.9353640Z ^ 2025-05-07T19:58:02.9353960Z 2025-05-07T19:58:03.3026107Z [275/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:03.3049911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:03.3052525Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:03.3053815Z ^ 2025-05-07T19:58:03.3054069Z 2025-05-07T19:58:03.3054510Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:03.3055190Z 2025-05-07T19:58:03.3056840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:03.3059539Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:03.3060882Z ^ 2025-05-07T19:58:03.3061254Z 2025-05-07T19:58:03.3062799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:03.3065672Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:03.3066809Z ^ 2025-05-07T19:58:03.3067074Z 2025-05-07T19:58:03.3067486Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:03.3068110Z 2025-05-07T19:58:03.3069751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:03.3072302Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:03.3073490Z ^ 2025-05-07T19:58:03.3073809Z 2025-05-07T19:58:03.3075390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:03.3078401Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:03.3079583Z ^ 2025-05-07T19:58:03.3079837Z 2025-05-07T19:58:03.3080271Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:03.3080953Z 2025-05-07T19:58:03.3082459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:03.3087361Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:03.3088564Z ^ 2025-05-07T19:58:03.3088889Z 2025-05-07T19:58:03.3090522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:03.3093179Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:03.3094313Z ^ 2025-05-07T19:58:03.3094743Z 2025-05-07T19:58:03.3095198Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:03.3095800Z 2025-05-07T19:58:03.3097367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:03.3100013Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:03.3101291Z ^ 2025-05-07T19:58:03.3101616Z 2025-05-07T19:58:03.3103225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:03.3105864Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:03.3107049Z ^ 2025-05-07T19:58:03.3107305Z 2025-05-07T19:58:03.3107754Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:03.3108461Z 2025-05-07T19:58:03.3110066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:03.3112929Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:03.3114069Z ^ 2025-05-07T19:58:03.3114420Z 2025-05-07T19:58:04.1205891Z [276/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o 2025-05-07T19:58:04.1229972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.1232710Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.1233885Z ^ 2025-05-07T19:58:04.1234174Z 2025-05-07T19:58:04.1234637Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:04.1235321Z 2025-05-07T19:58:04.1237054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.1239770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.1241021Z ^ 2025-05-07T19:58:04.1241624Z 2025-05-07T19:58:04.1243150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.1245558Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.1246501Z ^ 2025-05-07T19:58:04.1246729Z 2025-05-07T19:58:04.1247113Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:04.1247696Z 2025-05-07T19:58:04.1249215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.1251698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.1252699Z ^ 2025-05-07T19:58:04.1253052Z 2025-05-07T19:58:04.1254458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.1256975Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.1258051Z ^ 2025-05-07T19:58:04.1258297Z 2025-05-07T19:58:04.1259025Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:04.1259675Z 2025-05-07T19:58:04.1261400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.1263936Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.1265086Z ^ 2025-05-07T19:58:04.1265436Z 2025-05-07T19:58:04.1266976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.1269698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.1270872Z ^ 2025-05-07T19:58:04.1271118Z 2025-05-07T19:58:04.1271554Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:04.1272156Z 2025-05-07T19:58:04.1273755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.1276546Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.1277605Z ^ 2025-05-07T19:58:04.1277928Z 2025-05-07T19:58:04.1279537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.1282073Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.1283168Z ^ 2025-05-07T19:58:04.1283427Z 2025-05-07T19:58:04.1284119Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:04.1284791Z 2025-05-07T19:58:04.1286487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:04.1288983Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:04.1290215Z ^ 2025-05-07T19:58:04.1290592Z 2025-05-07T19:58:18.9898813Z [277/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o 2025-05-07T19:58:18.9921837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9924517Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9925947Z ^ 2025-05-07T19:58:18.9926220Z 2025-05-07T19:58:18.9926697Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.9927392Z 2025-05-07T19:58:18.9929150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9932033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9933222Z ^ 2025-05-07T19:58:18.9933589Z 2025-05-07T19:58:18.9935282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9937941Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9939105Z ^ 2025-05-07T19:58:18.9939376Z 2025-05-07T19:58:18.9939967Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.9940652Z 2025-05-07T19:58:18.9942033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9944674Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9945828Z ^ 2025-05-07T19:58:18.9946171Z 2025-05-07T19:58:18.9947664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9950341Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9951448Z ^ 2025-05-07T19:58:18.9951686Z 2025-05-07T19:58:18.9952099Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.9952715Z 2025-05-07T19:58:18.9954250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9956657Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9957749Z ^ 2025-05-07T19:58:18.9958098Z 2025-05-07T19:58:18.9959647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9962155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9963273Z ^ 2025-05-07T19:58:18.9963512Z 2025-05-07T19:58:18.9963966Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.9964591Z 2025-05-07T19:58:18.9966163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9968678Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9969820Z ^ 2025-05-07T19:58:18.9970173Z 2025-05-07T19:58:18.9971722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9974341Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9975445Z ^ 2025-05-07T19:58:18.9975884Z 2025-05-07T19:58:18.9976625Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:18.9977282Z 2025-05-07T19:58:18.9978928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:18.9981680Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:18.9982839Z ^ 2025-05-07T19:58:18.9983203Z 2025-05-07T19:58:26.3075529Z [278/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:26.3094445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.3096593Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.3097511Z ^ 2025-05-07T19:58:26.3097717Z 2025-05-07T19:58:26.3098464Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.3099052Z 2025-05-07T19:58:26.3100660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.3103074Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.3104199Z ^ 2025-05-07T19:58:26.3104543Z 2025-05-07T19:58:26.3105990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.3108326Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.3109419Z ^ 2025-05-07T19:58:26.3109666Z 2025-05-07T19:58:26.3110071Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.3110699Z 2025-05-07T19:58:26.3112164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.3114548Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.3115602Z ^ 2025-05-07T19:58:26.3116297Z 2025-05-07T19:58:26.3117778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.3120163Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.3121207Z ^ 2025-05-07T19:58:26.3121433Z 2025-05-07T19:58:26.3121860Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.3122422Z 2025-05-07T19:58:26.3123982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.3126495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.3127544Z ^ 2025-05-07T19:58:26.3127874Z 2025-05-07T19:58:26.3129446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.3131990Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.3133029Z ^ 2025-05-07T19:58:26.3133298Z 2025-05-07T19:58:26.3133709Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.3134301Z 2025-05-07T19:58:26.3135788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.3138122Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.3139474Z ^ 2025-05-07T19:58:26.3139800Z 2025-05-07T19:58:26.3141402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.3143884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.3144945Z ^ 2025-05-07T19:58:26.3145178Z 2025-05-07T19:58:26.3145577Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:26.3146181Z 2025-05-07T19:58:26.3147591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.3150236Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:26.3151322Z ^ 2025-05-07T19:58:26.3151685Z 2025-05-07T19:58:27.5207739Z [279/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:27.5232723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.5235733Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.5236953Z ^ 2025-05-07T19:58:27.5237232Z 2025-05-07T19:58:27.5237693Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.5238417Z 2025-05-07T19:58:27.5240290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.5243118Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.5244366Z ^ 2025-05-07T19:58:27.5244767Z 2025-05-07T19:58:27.5246543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.5249387Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.5250739Z ^ 2025-05-07T19:58:27.5251041Z 2025-05-07T19:58:27.5251507Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.5252179Z 2025-05-07T19:58:27.5254050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.5256738Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.5257976Z ^ 2025-05-07T19:58:27.5258349Z 2025-05-07T19:58:27.5259989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.5262894Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.5264221Z ^ 2025-05-07T19:58:27.5264510Z 2025-05-07T19:58:27.5264969Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.5265643Z 2025-05-07T19:58:27.5267284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.5270173Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.5271551Z ^ 2025-05-07T19:58:27.5271954Z 2025-05-07T19:58:27.5273616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.5276692Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.5277910Z ^ 2025-05-07T19:58:27.5278219Z 2025-05-07T19:58:27.5278691Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.5279372Z 2025-05-07T19:58:27.5281081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.5284062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.5285315Z ^ 2025-05-07T19:58:27.5285701Z 2025-05-07T19:58:27.5287393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.5290134Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.5291356Z ^ 2025-05-07T19:58:27.5291626Z 2025-05-07T19:58:27.5292084Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:27.5292782Z 2025-05-07T19:58:27.5294467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:27.5297229Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:27.5298461Z ^ 2025-05-07T19:58:27.5298863Z 2025-05-07T19:58:31.3119566Z [280/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:31.3143245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.3145966Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.3147182Z ^ 2025-05-07T19:58:31.3147421Z 2025-05-07T19:58:31.3147869Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:31.3148535Z 2025-05-07T19:58:31.3150240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.3152979Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.3154164Z ^ 2025-05-07T19:58:31.3154573Z 2025-05-07T19:58:31.3156259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.3159132Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.3160266Z ^ 2025-05-07T19:58:31.3160794Z 2025-05-07T19:58:31.3161229Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:31.3161882Z 2025-05-07T19:58:31.3163552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.3166225Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.3167421Z ^ 2025-05-07T19:58:31.3167793Z 2025-05-07T19:58:31.3169400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.3172163Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.3173278Z ^ 2025-05-07T19:58:31.3173542Z 2025-05-07T19:58:31.3173983Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:31.3174680Z 2025-05-07T19:58:31.3176617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.3179336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.3180644Z ^ 2025-05-07T19:58:31.3181035Z 2025-05-07T19:58:31.3182662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.3185332Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.3186709Z ^ 2025-05-07T19:58:31.3186968Z 2025-05-07T19:58:31.3187446Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:31.3188118Z 2025-05-07T19:58:31.3189751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.3192490Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.3193736Z ^ 2025-05-07T19:58:31.3194118Z 2025-05-07T19:58:31.3195743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.3198445Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.3199597Z ^ 2025-05-07T19:58:31.3199849Z 2025-05-07T19:58:31.3200296Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:31.3200919Z 2025-05-07T19:58:31.3202587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:31.3205571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:31.3206781Z ^ 2025-05-07T19:58:31.3207158Z 2025-05-07T19:58:32.2084847Z [281/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:32.2108279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.2111086Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.2112400Z ^ 2025-05-07T19:58:32.2112669Z 2025-05-07T19:58:32.2113058Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:32.2113707Z 2025-05-07T19:58:32.2115389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.2118102Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.2119351Z ^ 2025-05-07T19:58:32.2119723Z 2025-05-07T19:58:32.2121722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.2124340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.2125535Z ^ 2025-05-07T19:58:32.2125802Z 2025-05-07T19:58:32.2126256Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:32.2126816Z 2025-05-07T19:58:32.2128328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.2131039Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.2132319Z ^ 2025-05-07T19:58:32.2132714Z 2025-05-07T19:58:32.2134328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.2136910Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.2138050Z ^ 2025-05-07T19:58:32.2138340Z 2025-05-07T19:58:32.2138765Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:32.2139353Z 2025-05-07T19:58:32.2141202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.2143898Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.2145127Z ^ 2025-05-07T19:58:32.2145498Z 2025-05-07T19:58:32.2147180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.2150031Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.2151180Z ^ 2025-05-07T19:58:32.2151425Z 2025-05-07T19:58:32.2151857Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:32.2152513Z 2025-05-07T19:58:32.2154252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.2156927Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.2158102Z ^ 2025-05-07T19:58:32.2158481Z 2025-05-07T19:58:32.2160084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.2162739Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.2163881Z ^ 2025-05-07T19:58:32.2164151Z 2025-05-07T19:58:32.2164580Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:32.2165239Z 2025-05-07T19:58:32.2167098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:32.2184316Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:32.2185713Z ^ 2025-05-07T19:58:32.2186128Z 2025-05-07T19:58:36.2824379Z [282/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:36.2845478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.2848085Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.2849251Z ^ 2025-05-07T19:58:36.2849551Z 2025-05-07T19:58:36.2850012Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.2850695Z 2025-05-07T19:58:36.2852399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.2855169Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.2856400Z ^ 2025-05-07T19:58:36.2856715Z 2025-05-07T19:58:36.2858272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.2860833Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.2861897Z ^ 2025-05-07T19:58:36.2862125Z 2025-05-07T19:58:36.2862578Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.2863357Z 2025-05-07T19:58:36.2864872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.2867342Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.2868460Z ^ 2025-05-07T19:58:36.2868807Z 2025-05-07T19:58:36.2870204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.2872596Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.2873655Z ^ 2025-05-07T19:58:36.2873917Z 2025-05-07T19:58:36.2874340Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.2874952Z 2025-05-07T19:58:36.2876781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.2879206Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.2880502Z ^ 2025-05-07T19:58:36.2880849Z 2025-05-07T19:58:36.2882254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.2884642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.2885733Z ^ 2025-05-07T19:58:36.2885966Z 2025-05-07T19:58:36.2886394Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.2886997Z 2025-05-07T19:58:36.2888511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.2891005Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.2892053Z ^ 2025-05-07T19:58:36.2892414Z 2025-05-07T19:58:36.2893857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.2896587Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.2897678Z ^ 2025-05-07T19:58:36.2897951Z 2025-05-07T19:58:36.2898377Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.2898994Z 2025-05-07T19:58:36.2900669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.2902996Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.2904094Z ^ 2025-05-07T19:58:36.2904587Z 2025-05-07T19:58:36.5461820Z [283/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:36.5484227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.5486891Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.5487980Z ^ 2025-05-07T19:58:36.5488216Z 2025-05-07T19:58:36.5488649Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.5489254Z 2025-05-07T19:58:36.5491257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.5493800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.5494848Z ^ 2025-05-07T19:58:36.5495192Z 2025-05-07T19:58:36.5496621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.5499001Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.5500302Z ^ 2025-05-07T19:58:36.5500725Z 2025-05-07T19:58:36.5501168Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.5501769Z 2025-05-07T19:58:36.5503378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.5505868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.5507030Z ^ 2025-05-07T19:58:36.5507393Z 2025-05-07T19:58:36.5508981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.5511524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.5512611Z ^ 2025-05-07T19:58:36.5512857Z 2025-05-07T19:58:36.5513284Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.5513947Z 2025-05-07T19:58:36.5515725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.5518204Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.5519347Z ^ 2025-05-07T19:58:36.5519707Z 2025-05-07T19:58:36.5521216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.5523714Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.5524786Z ^ 2025-05-07T19:58:36.5525071Z 2025-05-07T19:58:36.5525494Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.5526100Z 2025-05-07T19:58:36.5527722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.5530086Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.5531300Z ^ 2025-05-07T19:58:36.5531646Z 2025-05-07T19:58:36.5533462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.5535941Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.5537143Z ^ 2025-05-07T19:58:36.5537401Z 2025-05-07T19:58:36.5537845Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:36.5538489Z 2025-05-07T19:58:36.5539903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:36.5544728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:36.5545830Z ^ 2025-05-07T19:58:36.5546205Z 2025-05-07T19:58:37.0155000Z [284/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o 2025-05-07T19:58:37.0178817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.0181002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.0182401Z ^ 2025-05-07T19:58:37.0182649Z 2025-05-07T19:58:37.0183071Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.0183684Z 2025-05-07T19:58:37.0185207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.0187770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.0189070Z ^ 2025-05-07T19:58:37.0189446Z 2025-05-07T19:58:37.0191328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.0194233Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.0195454Z ^ 2025-05-07T19:58:37.0195716Z 2025-05-07T19:58:37.0196194Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.0196888Z 2025-05-07T19:58:37.0198837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.0201566Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.0202776Z ^ 2025-05-07T19:58:37.0203145Z 2025-05-07T19:58:37.0204658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.0207259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.0208583Z ^ 2025-05-07T19:58:37.0208832Z 2025-05-07T19:58:37.0209268Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.0209952Z 2025-05-07T19:58:37.0211614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.0214267Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.0215465Z ^ 2025-05-07T19:58:37.0215847Z 2025-05-07T19:58:37.0217550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.0220576Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.0221804Z ^ 2025-05-07T19:58:37.0222066Z 2025-05-07T19:58:37.0222548Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.0223237Z 2025-05-07T19:58:37.0224979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.0227803Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.0229040Z ^ 2025-05-07T19:58:37.0229410Z 2025-05-07T19:58:37.0231101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.0233841Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.0235007Z ^ 2025-05-07T19:58:37.0235281Z 2025-05-07T19:58:37.0235737Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.0236494Z 2025-05-07T19:58:37.0238196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.0240519Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.0241665Z ^ 2025-05-07T19:58:37.0241998Z 2025-05-07T19:58:37.1703179Z [285/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:37.1726775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.1729617Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.1730822Z ^ 2025-05-07T19:58:37.1731105Z 2025-05-07T19:58:37.1731559Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.1732237Z 2025-05-07T19:58:37.1733949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.1736756Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.1737964Z ^ 2025-05-07T19:58:37.1738275Z 2025-05-07T19:58:37.1739913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.1742630Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.1743812Z ^ 2025-05-07T19:58:37.1744058Z 2025-05-07T19:58:37.1744510Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.1745186Z 2025-05-07T19:58:37.1746856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.1749671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.1750862Z ^ 2025-05-07T19:58:37.1751238Z 2025-05-07T19:58:37.1752929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.1755663Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.1756851Z ^ 2025-05-07T19:58:37.1757108Z 2025-05-07T19:58:37.1757586Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.1758276Z 2025-05-07T19:58:37.1759974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.1762781Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.1763957Z ^ 2025-05-07T19:58:37.1764327Z 2025-05-07T19:58:37.1765937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.1768574Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.1769717Z ^ 2025-05-07T19:58:37.1769971Z 2025-05-07T19:58:37.1770406Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.1771062Z 2025-05-07T19:58:37.1772910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.1775596Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.1777021Z ^ 2025-05-07T19:58:37.1777369Z 2025-05-07T19:58:37.1778957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.1781917Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.1783235Z ^ 2025-05-07T19:58:37.1783513Z 2025-05-07T19:58:37.1783956Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.1784635Z 2025-05-07T19:58:37.1786379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.1789055Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.1790296Z ^ 2025-05-07T19:58:37.1790674Z 2025-05-07T19:58:37.3055928Z [286/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o 2025-05-07T19:58:37.3068425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3069868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3070495Z ^ 2025-05-07T19:58:37.3070644Z 2025-05-07T19:58:37.3070911Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.3071341Z 2025-05-07T19:58:37.3072230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3073652Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3074304Z ^ 2025-05-07T19:58:37.3074505Z 2025-05-07T19:58:37.3075419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3077117Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3077763Z ^ 2025-05-07T19:58:37.3077907Z 2025-05-07T19:58:37.3078151Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.3078523Z 2025-05-07T19:58:37.3079407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3080838Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3081546Z ^ 2025-05-07T19:58:37.3081771Z 2025-05-07T19:58:37.3082643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3084056Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3084677Z ^ 2025-05-07T19:58:37.3084821Z 2025-05-07T19:58:37.3085083Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.3085437Z 2025-05-07T19:58:37.3086314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3087738Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3088380Z ^ 2025-05-07T19:58:37.3088585Z 2025-05-07T19:58:37.3089452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3090980Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3091629Z ^ 2025-05-07T19:58:37.3091774Z 2025-05-07T19:58:37.3092016Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.3092385Z 2025-05-07T19:58:37.3093269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3094699Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3095337Z ^ 2025-05-07T19:58:37.3095590Z 2025-05-07T19:58:37.3096467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3097865Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3098500Z ^ 2025-05-07T19:58:37.3098642Z 2025-05-07T19:58:37.3098896Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.3099256Z 2025-05-07T19:58:37.3100132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3101628Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3102283Z ^ 2025-05-07T19:58:37.3102484Z 2025-05-07T19:58:37.3288878Z [287/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o 2025-05-07T19:58:37.3301208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3302633Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3303351Z ^ 2025-05-07T19:58:37.3303499Z 2025-05-07T19:58:37.3303768Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.3304129Z 2025-05-07T19:58:37.3305015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3306457Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3307095Z ^ 2025-05-07T19:58:37.3307311Z 2025-05-07T19:58:37.3308180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3309591Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3310220Z ^ 2025-05-07T19:58:37.3310375Z 2025-05-07T19:58:37.3310620Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.3310978Z 2025-05-07T19:58:37.3311927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3313334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3313979Z ^ 2025-05-07T19:58:37.3314179Z 2025-05-07T19:58:37.3314918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:37.3315862Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:37.3316178Z ^ 2025-05-07T19:58:37.3316324Z 2025-05-07T19:58:37.3317195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3318607Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3319245Z ^ 2025-05-07T19:58:37.3319388Z 2025-05-07T19:58:37.3319630Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.3320004Z 2025-05-07T19:58:37.3320889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3322421Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3323063Z ^ 2025-05-07T19:58:37.3323267Z 2025-05-07T19:58:37.3323998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:37.3324932Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:37.3325245Z ^ 2025-05-07T19:58:37.3325389Z 2025-05-07T19:58:37.3326266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3327707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3328348Z ^ 2025-05-07T19:58:37.3328490Z 2025-05-07T19:58:37.3328733Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.3329110Z 2025-05-07T19:58:37.3329988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3331415Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3332052Z ^ 2025-05-07T19:58:37.3332269Z 2025-05-07T19:58:37.3332986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:37.3333938Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:37.3334241Z ^ 2025-05-07T19:58:37.3334386Z 2025-05-07T19:58:37.3335268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3336707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3337349Z ^ 2025-05-07T19:58:37.3337491Z 2025-05-07T19:58:37.3337750Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:37.3338111Z 2025-05-07T19:58:37.3338998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:37.3340624Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:37.3341279Z ^ 2025-05-07T19:58:37.3341484Z 2025-05-07T19:58:37.3342200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:58:37.3343151Z const auto offset_idx = idx * D_emb; 2025-05-07T19:58:37.3343453Z ^ 2025-05-07T19:58:37.3343612Z 2025-05-07T19:58:42.8109850Z [288/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:42.8133248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.8136170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.8137284Z ^ 2025-05-07T19:58:42.8137574Z 2025-05-07T19:58:42.8138020Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:42.8138675Z 2025-05-07T19:58:42.8140512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.8143225Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.8144394Z ^ 2025-05-07T19:58:42.8144765Z 2025-05-07T19:58:42.8146383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.8149167Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.8150534Z ^ 2025-05-07T19:58:42.8150794Z 2025-05-07T19:58:42.8151257Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:42.8152179Z 2025-05-07T19:58:42.8153598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.8156143Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.8157344Z ^ 2025-05-07T19:58:42.8157718Z 2025-05-07T19:58:42.8159431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.8162273Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.8163431Z ^ 2025-05-07T19:58:42.8163710Z 2025-05-07T19:58:42.8164165Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:42.8164845Z 2025-05-07T19:58:42.8166545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.8169299Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.8170526Z ^ 2025-05-07T19:58:42.8170902Z 2025-05-07T19:58:42.8172605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.8175327Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.8176788Z ^ 2025-05-07T19:58:42.8177020Z 2025-05-07T19:58:42.8177461Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:42.8178349Z 2025-05-07T19:58:42.8179898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.8182726Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.8183744Z ^ 2025-05-07T19:58:42.8184099Z 2025-05-07T19:58:42.8185629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.8188298Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.8189496Z ^ 2025-05-07T19:58:42.8189768Z 2025-05-07T19:58:42.8190221Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:42.8190902Z 2025-05-07T19:58:42.8192635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.8195397Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:42.8196621Z ^ 2025-05-07T19:58:42.8196993Z 2025-05-07T19:58:43.5802667Z [289/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:58:43.5826563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.5829329Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.5830473Z ^ 2025-05-07T19:58:43.5830739Z 2025-05-07T19:58:43.5831163Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:43.5831815Z 2025-05-07T19:58:43.5833453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.5836072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.5837239Z ^ 2025-05-07T19:58:43.5837606Z 2025-05-07T19:58:43.5839213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.5841961Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.5843099Z ^ 2025-05-07T19:58:43.5843333Z 2025-05-07T19:58:43.5843881Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:43.5844491Z 2025-05-07T19:58:43.5846087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.5848735Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.5849997Z ^ 2025-05-07T19:58:43.5850370Z 2025-05-07T19:58:43.5852100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.5854694Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.5855841Z ^ 2025-05-07T19:58:43.5856114Z 2025-05-07T19:58:43.5856573Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:43.5857249Z 2025-05-07T19:58:43.5858845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.5861368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.5862512Z ^ 2025-05-07T19:58:43.5862876Z 2025-05-07T19:58:43.5864456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.5867142Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.5868343Z ^ 2025-05-07T19:58:43.5868594Z 2025-05-07T19:58:43.5869033Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:43.5869733Z 2025-05-07T19:58:43.5871446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.5874205Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.5875381Z ^ 2025-05-07T19:58:43.5875758Z 2025-05-07T19:58:43.5877664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.5880149Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.5881267Z ^ 2025-05-07T19:58:43.5881522Z 2025-05-07T19:58:43.5881937Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:43.5882594Z 2025-05-07T19:58:43.5884234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:43.5887287Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:43.5888546Z ^ 2025-05-07T19:58:43.5888935Z 2025-05-07T19:58:44.4836789Z [290/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:44.4854834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.4856857Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.4857733Z ^ 2025-05-07T19:58:44.4857955Z 2025-05-07T19:58:44.4858302Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:44.4858813Z 2025-05-07T19:58:44.4860144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.4862545Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.4863479Z ^ 2025-05-07T19:58:44.4863743Z 2025-05-07T19:58:44.4865275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.4867349Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.4868274Z ^ 2025-05-07T19:58:44.4868481Z 2025-05-07T19:58:44.4868847Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:44.4869412Z 2025-05-07T19:58:44.4870614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.4872589Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.4873505Z ^ 2025-05-07T19:58:44.4873812Z 2025-05-07T19:58:44.4875095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.4877381Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.4878238Z ^ 2025-05-07T19:58:44.4878477Z 2025-05-07T19:58:44.4878842Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:44.4879386Z 2025-05-07T19:58:44.4880732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.4882675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.4883602Z ^ 2025-05-07T19:58:44.4884063Z 2025-05-07T19:58:44.4885419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.4887493Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.4888366Z ^ 2025-05-07T19:58:44.4888570Z 2025-05-07T19:58:44.4888949Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:44.4889462Z 2025-05-07T19:58:44.4890826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.4892890Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.4893848Z ^ 2025-05-07T19:58:44.4894156Z 2025-05-07T19:58:44.4895482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.4897495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.4898398Z ^ 2025-05-07T19:58:44.4898642Z 2025-05-07T19:58:44.4899250Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:44.4899793Z 2025-05-07T19:58:44.4901112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.4903140Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:44.4904084Z ^ 2025-05-07T19:58:44.4904391Z 2025-05-07T19:58:46.7511288Z [291/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:46.7534090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.7536813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.7537887Z ^ 2025-05-07T19:58:46.7538120Z 2025-05-07T19:58:46.7538515Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:46.7539129Z 2025-05-07T19:58:46.7541128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.7543554Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.7544513Z ^ 2025-05-07T19:58:46.7544855Z 2025-05-07T19:58:46.7546194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.7548523Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.7549787Z ^ 2025-05-07T19:58:46.7550012Z 2025-05-07T19:58:46.7550460Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:46.7551121Z 2025-05-07T19:58:46.7552793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.7555431Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.7556618Z ^ 2025-05-07T19:58:46.7556997Z 2025-05-07T19:58:46.7558645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.7561115Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.7562274Z ^ 2025-05-07T19:58:46.7562557Z 2025-05-07T19:58:46.7562966Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:46.7563532Z 2025-05-07T19:58:46.7565073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.7567840Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.7568875Z ^ 2025-05-07T19:58:46.7569190Z 2025-05-07T19:58:46.7570752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.7573313Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.7574391Z ^ 2025-05-07T19:58:46.7574624Z 2025-05-07T19:58:46.7575026Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:46.7575579Z 2025-05-07T19:58:46.7577393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.7579707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.7580879Z ^ 2025-05-07T19:58:46.7581195Z 2025-05-07T19:58:46.7582494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.7585077Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.7586186Z ^ 2025-05-07T19:58:46.7586433Z 2025-05-07T19:58:46.7586833Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:46.7587398Z 2025-05-07T19:58:46.7589079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.7591687Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.7593023Z ^ 2025-05-07T19:58:46.7593385Z 2025-05-07T19:58:46.7944660Z [292/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:46.7966873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.7969331Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.7970468Z ^ 2025-05-07T19:58:46.7970681Z 2025-05-07T19:58:46.7971464Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:46.7972002Z 2025-05-07T19:58:46.7973317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.7975682Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.7977119Z ^ 2025-05-07T19:58:46.7977458Z 2025-05-07T19:58:46.7979063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.7982040Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.7983235Z ^ 2025-05-07T19:58:46.7983487Z 2025-05-07T19:58:46.7983942Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:46.7984622Z 2025-05-07T19:58:46.7986294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.7988764Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.7989923Z ^ 2025-05-07T19:58:46.7990282Z 2025-05-07T19:58:46.7991775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.7994170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.7995230Z ^ 2025-05-07T19:58:46.7995704Z 2025-05-07T19:58:46.7996141Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:46.7996699Z 2025-05-07T19:58:46.7998119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.8000781Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.8001837Z ^ 2025-05-07T19:58:46.8002190Z 2025-05-07T19:58:46.8003593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.8005955Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.8007073Z ^ 2025-05-07T19:58:46.8007311Z 2025-05-07T19:58:46.8007666Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:46.8008316Z 2025-05-07T19:58:46.8009639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.8011917Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.8013271Z ^ 2025-05-07T19:58:46.8013652Z 2025-05-07T19:58:46.8015112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.8017772Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.8018935Z ^ 2025-05-07T19:58:46.8019194Z 2025-05-07T19:58:46.8019634Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:46.8020289Z 2025-05-07T19:58:46.8022062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:46.8024807Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:46.8025864Z ^ 2025-05-07T19:58:46.8026247Z 2025-05-07T19:58:47.2029222Z [293/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:47.2042534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2043943Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2044573Z ^ 2025-05-07T19:58:47.2044728Z 2025-05-07T19:58:47.2044970Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:47.2045332Z 2025-05-07T19:58:47.2046190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2047581Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2048296Z ^ 2025-05-07T19:58:47.2048540Z 2025-05-07T19:58:47.2049405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2050813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2051432Z ^ 2025-05-07T19:58:47.2051612Z 2025-05-07T19:58:47.2051863Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:47.2052223Z 2025-05-07T19:58:47.2053090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2054526Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2055201Z ^ 2025-05-07T19:58:47.2055411Z 2025-05-07T19:58:47.2056264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2057689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2058307Z ^ 2025-05-07T19:58:47.2058451Z 2025-05-07T19:58:47.2058685Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:47.2059042Z 2025-05-07T19:58:47.2059895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2061417Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2062035Z ^ 2025-05-07T19:58:47.2062252Z 2025-05-07T19:58:47.2063105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2064475Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2065117Z ^ 2025-05-07T19:58:47.2065268Z 2025-05-07T19:58:47.2065536Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:47.2065896Z 2025-05-07T19:58:47.2066840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2068270Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2068945Z ^ 2025-05-07T19:58:47.2069153Z 2025-05-07T19:58:47.2070013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2071426Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2072110Z ^ 2025-05-07T19:58:47.2072262Z 2025-05-07T19:58:47.2072509Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:47.2072865Z 2025-05-07T19:58:47.2073759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.2075148Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:47.2075797Z ^ 2025-05-07T19:58:47.2076272Z 2025-05-07T19:58:49.8724431Z [294/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:49.8750412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.8753222Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.8754424Z ^ 2025-05-07T19:58:49.8754721Z 2025-05-07T19:58:49.8755184Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:49.8755871Z 2025-05-07T19:58:49.8757600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.8760030Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.8761016Z ^ 2025-05-07T19:58:49.8761324Z 2025-05-07T19:58:49.8762645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.8765197Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.8766384Z ^ 2025-05-07T19:58:49.8766640Z 2025-05-07T19:58:49.8767097Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:49.8767805Z 2025-05-07T19:58:49.8769507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.8772225Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.8773554Z ^ 2025-05-07T19:58:49.8773959Z 2025-05-07T19:58:49.8775634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.8778601Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.8779784Z ^ 2025-05-07T19:58:49.8780061Z 2025-05-07T19:58:49.8780596Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:49.8781279Z 2025-05-07T19:58:49.8782982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.8785768Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.8787044Z ^ 2025-05-07T19:58:49.8787422Z 2025-05-07T19:58:49.8789111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.8791736Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.8793017Z ^ 2025-05-07T19:58:49.8793266Z 2025-05-07T19:58:49.8794155Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:49.8794819Z 2025-05-07T19:58:49.8796408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.8799143Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.8800345Z ^ 2025-05-07T19:58:49.8800757Z 2025-05-07T19:58:49.8802371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.8805043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.8806164Z ^ 2025-05-07T19:58:49.8806406Z 2025-05-07T19:58:49.8806860Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:49.8807500Z 2025-05-07T19:58:49.8809073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:49.8811603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:49.8812846Z ^ 2025-05-07T19:58:49.8813235Z 2025-05-07T19:58:52.8678863Z [295/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:52.8702500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8705123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.8706478Z ^ 2025-05-07T19:58:52.8706744Z 2025-05-07T19:58:52.8707187Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:52.8707874Z 2025-05-07T19:58:52.8709502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8712168Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.8713316Z ^ 2025-05-07T19:58:52.8713704Z 2025-05-07T19:58:52.8715290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8717902Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.8719051Z ^ 2025-05-07T19:58:52.8719304Z 2025-05-07T19:58:52.8719793Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:52.8720447Z 2025-05-07T19:58:52.8722077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8724997Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.8726215Z ^ 2025-05-07T19:58:52.8726588Z 2025-05-07T19:58:52.8728433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8731169Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.8732390Z ^ 2025-05-07T19:58:52.8732658Z 2025-05-07T19:58:52.8733291Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:52.8733991Z 2025-05-07T19:58:52.8735624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8738246Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.8739403Z ^ 2025-05-07T19:58:52.8739772Z 2025-05-07T19:58:52.8741719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8744333Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.8745521Z ^ 2025-05-07T19:58:52.8745781Z 2025-05-07T19:58:52.8746251Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:52.8746906Z 2025-05-07T19:58:52.8748544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8751165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.8754189Z ^ 2025-05-07T19:58:52.8754579Z 2025-05-07T19:58:52.8756226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8758944Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.8760138Z ^ 2025-05-07T19:58:52.8760406Z 2025-05-07T19:58:52.8760841Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:52.8761508Z 2025-05-07T19:58:52.8763202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8765884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:52.8767277Z ^ 2025-05-07T19:58:52.8767641Z 2025-05-07T19:58:54.8942237Z [296/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:54.8967077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.8969459Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.8970671Z ^ 2025-05-07T19:58:54.8970943Z 2025-05-07T19:58:54.8971408Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.8972094Z 2025-05-07T19:58:54.8973778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.8976760Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.8977935Z ^ 2025-05-07T19:58:54.8978347Z 2025-05-07T19:58:54.8980039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.8982805Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.8984211Z ^ 2025-05-07T19:58:54.8984471Z 2025-05-07T19:58:54.8984962Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.8985625Z 2025-05-07T19:58:54.8987282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.8990090Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.8991274Z ^ 2025-05-07T19:58:54.8991626Z 2025-05-07T19:58:54.8993206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.8995768Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.8996894Z ^ 2025-05-07T19:58:54.8997116Z 2025-05-07T19:58:54.8997514Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.8998165Z 2025-05-07T19:58:54.8999857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.9002795Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.9004138Z ^ 2025-05-07T19:58:54.9004505Z 2025-05-07T19:58:54.9006144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.9008668Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.9009779Z ^ 2025-05-07T19:58:54.9010026Z 2025-05-07T19:58:54.9010487Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.9011252Z 2025-05-07T19:58:54.9012917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.9015590Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.9016804Z ^ 2025-05-07T19:58:54.9017168Z 2025-05-07T19:58:54.9018801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.9021608Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.9022789Z ^ 2025-05-07T19:58:54.9023070Z 2025-05-07T19:58:54.9023517Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:54.9024190Z 2025-05-07T19:58:54.9025872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:54.9028540Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:54.9029844Z ^ 2025-05-07T19:58:54.9030205Z 2025-05-07T19:58:56.1724479Z [297/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o 2025-05-07T19:58:56.1746306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.1748978Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.1750114Z ^ 2025-05-07T19:58:56.1750384Z 2025-05-07T19:58:56.1750729Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.1751407Z 2025-05-07T19:58:56.1752873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.1755573Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.1756674Z ^ 2025-05-07T19:58:56.1757027Z 2025-05-07T19:58:56.1758619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.1761501Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.1762555Z ^ 2025-05-07T19:58:56.1762799Z 2025-05-07T19:58:56.1763212Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.1763812Z 2025-05-07T19:58:56.1765345Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.1767710Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.1768739Z ^ 2025-05-07T19:58:56.1769109Z 2025-05-07T19:58:56.1770606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.1773469Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.1774628Z ^ 2025-05-07T19:58:56.1774917Z 2025-05-07T19:58:56.1775373Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.1776554Z 2025-05-07T19:58:56.1778502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.1781183Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.1782231Z ^ 2025-05-07T19:58:56.1782571Z 2025-05-07T19:58:56.1784067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.1786371Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.1787650Z ^ 2025-05-07T19:58:56.1787879Z 2025-05-07T19:58:56.1788303Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.1788947Z 2025-05-07T19:58:56.1790487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.1793140Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.1794320Z ^ 2025-05-07T19:58:56.1794702Z 2025-05-07T19:58:56.1796246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.1798638Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.1799938Z ^ 2025-05-07T19:58:56.1800242Z 2025-05-07T19:58:56.1800697Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.1801305Z 2025-05-07T19:58:56.1803070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.1805460Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.1806510Z ^ 2025-05-07T19:58:56.1806868Z 2025-05-07T19:58:56.9780631Z [298/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:56.9800615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.9802524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.9803330Z ^ 2025-05-07T19:58:56.9803555Z 2025-05-07T19:58:56.9803869Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.9804324Z 2025-05-07T19:58:56.9805829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.9808196Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.9809341Z ^ 2025-05-07T19:58:56.9809727Z 2025-05-07T19:58:56.9811323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.9813149Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.9814290Z ^ 2025-05-07T19:58:56.9814476Z 2025-05-07T19:58:56.9814844Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.9815595Z 2025-05-07T19:58:56.9816902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.9819166Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.9820128Z ^ 2025-05-07T19:58:56.9820618Z 2025-05-07T19:58:56.9822298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.9824402Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.9825705Z ^ 2025-05-07T19:58:56.9825911Z 2025-05-07T19:58:56.9826249Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.9826714Z 2025-05-07T19:58:56.9828053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.9830387Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.9831292Z ^ 2025-05-07T19:58:56.9831627Z 2025-05-07T19:58:56.9832964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.9834978Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.9835891Z ^ 2025-05-07T19:58:56.9836086Z 2025-05-07T19:58:56.9836439Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.9836931Z 2025-05-07T19:58:56.9838239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.9840310Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.9841209Z ^ 2025-05-07T19:58:56.9841510Z 2025-05-07T19:58:56.9842753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.9844866Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.9845895Z ^ 2025-05-07T19:58:56.9846087Z 2025-05-07T19:58:56.9846470Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:58:56.9846987Z 2025-05-07T19:58:56.9848241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:56.9850158Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:58:56.9851054Z ^ 2025-05-07T19:58:56.9851364Z 2025-05-07T19:59:02.4571399Z [299/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:02.4595636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.4598369Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.4599452Z ^ 2025-05-07T19:59:02.4599743Z 2025-05-07T19:59:02.4600203Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.4601156Z 2025-05-07T19:59:02.4602829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.4605473Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.4606650Z ^ 2025-05-07T19:59:02.4607028Z 2025-05-07T19:59:02.4608647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.4611345Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.4612556Z ^ 2025-05-07T19:59:02.4612822Z 2025-05-07T19:59:02.4613272Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.4613965Z 2025-05-07T19:59:02.4615603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.4618257Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.4619449Z ^ 2025-05-07T19:59:02.4619847Z 2025-05-07T19:59:02.4621934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.4624668Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.4625773Z ^ 2025-05-07T19:59:02.4626013Z 2025-05-07T19:59:02.4626472Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.4627033Z 2025-05-07T19:59:02.4628649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.4631646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.4632912Z ^ 2025-05-07T19:59:02.4633294Z 2025-05-07T19:59:02.4634949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.4637689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.4638913Z ^ 2025-05-07T19:59:02.4639182Z 2025-05-07T19:59:02.4639641Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.4640330Z 2025-05-07T19:59:02.4642038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.4644880Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.4646160Z ^ 2025-05-07T19:59:02.4646552Z 2025-05-07T19:59:02.4648319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.4651251Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.4652480Z ^ 2025-05-07T19:59:02.4652696Z 2025-05-07T19:59:02.4653122Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:02.4653809Z 2025-05-07T19:59:02.4655554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:02.4658396Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:02.4659604Z ^ 2025-05-07T19:59:02.4659934Z 2025-05-07T19:59:11.2346089Z [300/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T19:59:11.2368929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.2371629Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.2372993Z ^ 2025-05-07T19:59:11.2373241Z 2025-05-07T19:59:11.2373670Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:11.2374307Z 2025-05-07T19:59:11.2376241Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.2378894Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.2380087Z ^ 2025-05-07T19:59:11.2380536Z 2025-05-07T19:59:11.2382191Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.2384858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.2386001Z ^ 2025-05-07T19:59:11.2386251Z 2025-05-07T19:59:11.2386663Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:11.2387317Z 2025-05-07T19:59:11.2388964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.2391958Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.2393171Z ^ 2025-05-07T19:59:11.2393535Z 2025-05-07T19:59:11.2395179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.2397822Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.2398958Z ^ 2025-05-07T19:59:11.2399190Z 2025-05-07T19:59:11.2399644Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:11.2400426Z 2025-05-07T19:59:11.2402103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.2404758Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.2405952Z ^ 2025-05-07T19:59:11.2406317Z 2025-05-07T19:59:11.2407940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.2410566Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.2411725Z ^ 2025-05-07T19:59:11.2411998Z 2025-05-07T19:59:11.2412448Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:11.2413120Z 2025-05-07T19:59:11.2414825Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.2417608Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.2418734Z ^ 2025-05-07T19:59:11.2419094Z 2025-05-07T19:59:11.2420856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.2423418Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.2424520Z ^ 2025-05-07T19:59:11.2424751Z 2025-05-07T19:59:11.2425196Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:11.2425859Z 2025-05-07T19:59:11.2427450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.2430135Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:11.2431310Z ^ 2025-05-07T19:59:11.2431688Z 2025-05-07T19:59:14.6695553Z [301/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:14.6717743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:14.6720477Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:14.6721661Z ^ 2025-05-07T19:59:14.6721912Z 2025-05-07T19:59:14.6722372Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:14.6723056Z 2025-05-07T19:59:14.6724761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:14.6727443Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:14.6728641Z ^ 2025-05-07T19:59:14.6729016Z 2025-05-07T19:59:14.6730658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:14.6733171Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:14.6734151Z ^ 2025-05-07T19:59:14.6734403Z 2025-05-07T19:59:14.6734784Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:14.6735295Z 2025-05-07T19:59:14.6737034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:14.6739457Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:14.6740715Z ^ 2025-05-07T19:59:14.6741049Z 2025-05-07T19:59:14.6742535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:14.6745061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:14.6746128Z ^ 2025-05-07T19:59:14.6746336Z 2025-05-07T19:59:14.6746729Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:14.6747401Z 2025-05-07T19:59:14.6748944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:14.6751441Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:14.6752612Z ^ 2025-05-07T19:59:14.6752994Z 2025-05-07T19:59:14.6754645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:14.6757349Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:14.6758519Z ^ 2025-05-07T19:59:14.6758765Z 2025-05-07T19:59:14.6759224Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:14.6760039Z 2025-05-07T19:59:14.6761707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:14.6764394Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:14.6765580Z ^ 2025-05-07T19:59:14.6765941Z 2025-05-07T19:59:14.6767609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:14.6770217Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:14.6771185Z ^ 2025-05-07T19:59:14.6771383Z 2025-05-07T19:59:14.6771782Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:14.6772380Z 2025-05-07T19:59:14.6773746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:14.6776298Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:14.6777390Z ^ 2025-05-07T19:59:14.6777728Z 2025-05-07T19:59:15.2400596Z [302/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:15.2422417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.2424892Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.2425958Z ^ 2025-05-07T19:59:15.2426219Z 2025-05-07T19:59:15.2426632Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:15.2427238Z 2025-05-07T19:59:15.2428731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.2431140Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.2432233Z ^ 2025-05-07T19:59:15.2432566Z 2025-05-07T19:59:15.2434042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.2436736Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.2437812Z ^ 2025-05-07T19:59:15.2438034Z 2025-05-07T19:59:15.2438424Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:15.2439037Z 2025-05-07T19:59:15.2440499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.2443021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.2444229Z ^ 2025-05-07T19:59:15.2444592Z 2025-05-07T19:59:15.2446258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.2448689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.2449765Z ^ 2025-05-07T19:59:15.2450030Z 2025-05-07T19:59:15.2450466Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:15.2451127Z 2025-05-07T19:59:15.2452800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.2455483Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.2456681Z ^ 2025-05-07T19:59:15.2471228Z 2025-05-07T19:59:15.2472991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.2475854Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.2477278Z ^ 2025-05-07T19:59:15.2477530Z 2025-05-07T19:59:15.2477982Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:15.2478670Z 2025-05-07T19:59:15.2480349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.2483099Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.2484289Z ^ 2025-05-07T19:59:15.2484659Z 2025-05-07T19:59:15.2486303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.2488949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.2490124Z ^ 2025-05-07T19:59:15.2490377Z 2025-05-07T19:59:15.2490836Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:15.2491509Z 2025-05-07T19:59:15.2493502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.2496163Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:15.2497268Z ^ 2025-05-07T19:59:15.2497595Z 2025-05-07T19:59:16.9746646Z [303/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o 2025-05-07T19:59:16.9773603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.9776930Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:16.9778275Z ^ 2025-05-07T19:59:16.9778603Z 2025-05-07T19:59:16.9779099Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:16.9779821Z 2025-05-07T19:59:16.9781749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.9784683Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:16.9786345Z ^ 2025-05-07T19:59:16.9786769Z 2025-05-07T19:59:16.9788536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:16.9790782Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:16.9791569Z ^ 2025-05-07T19:59:16.9791906Z 2025-05-07T19:59:16.9793656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:16.9796017Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:16.9796685Z ^ 2025-05-07T19:59:16.9797035Z 2025-05-07T19:59:16.9798814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:16.9801016Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:16.9801697Z ^ 2025-05-07T19:59:16.9802039Z 2025-05-07T19:59:16.9803808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.9806678Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:16.9808001Z ^ 2025-05-07T19:59:16.9808292Z 2025-05-07T19:59:16.9808784Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:16.9809538Z 2025-05-07T19:59:16.9811341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.9814313Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:16.9815823Z ^ 2025-05-07T19:59:16.9816231Z 2025-05-07T19:59:16.9817994Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:16.9820182Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:16.9821016Z ^ 2025-05-07T19:59:16.9821359Z 2025-05-07T19:59:16.9823140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:16.9825321Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:16.9826006Z ^ 2025-05-07T19:59:16.9826348Z 2025-05-07T19:59:16.9828093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:16.9830324Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:16.9831001Z ^ 2025-05-07T19:59:16.9831344Z 2025-05-07T19:59:16.9833132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.9836237Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:16.9837580Z ^ 2025-05-07T19:59:16.9837871Z 2025-05-07T19:59:16.9838374Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:16.9839100Z 2025-05-07T19:59:16.9840921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.9843636Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:16.9845109Z ^ 2025-05-07T19:59:16.9845523Z 2025-05-07T19:59:16.9847304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:16.9849488Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:16.9850154Z ^ 2025-05-07T19:59:16.9850495Z 2025-05-07T19:59:16.9852245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:16.9854427Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:16.9855093Z ^ 2025-05-07T19:59:16.9855439Z 2025-05-07T19:59:16.9857165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:16.9859392Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:16.9860054Z ^ 2025-05-07T19:59:16.9860565Z 2025-05-07T19:59:16.9862335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.9865370Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:16.9866624Z ^ 2025-05-07T19:59:16.9866946Z 2025-05-07T19:59:16.9867438Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:16.9868160Z 2025-05-07T19:59:16.9869985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.9872848Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:16.9874191Z ^ 2025-05-07T19:59:16.9874609Z 2025-05-07T19:59:16.9876586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:16.9878786Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:16.9879458Z ^ 2025-05-07T19:59:16.9879799Z 2025-05-07T19:59:16.9881530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:16.9883760Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:16.9884688Z ^ 2025-05-07T19:59:16.9885034Z 2025-05-07T19:59:16.9886757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:16.9888978Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:16.9889638Z ^ 2025-05-07T19:59:16.9889975Z 2025-05-07T19:59:16.9891748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.9894808Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:16.9896104Z ^ 2025-05-07T19:59:16.9896430Z 2025-05-07T19:59:16.9896936Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:16.9897660Z 2025-05-07T19:59:16.9899477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.9902521Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:16.9903864Z ^ 2025-05-07T19:59:16.9904273Z 2025-05-07T19:59:16.9906030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:16.9908155Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:16.9908842Z ^ 2025-05-07T19:59:16.9909187Z 2025-05-07T19:59:16.9910904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:16.9913285Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:16.9913945Z ^ 2025-05-07T19:59:16.9914279Z 2025-05-07T19:59:16.9916012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:59:16.9918249Z if (output_d >= 0 && output_d < D) { 2025-05-07T19:59:16.9918890Z ^ 2025-05-07T19:59:16.9919257Z 2025-05-07T19:59:17.3688860Z [304/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:17.3700999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3702404Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3703038Z ^ 2025-05-07T19:59:17.3703185Z 2025-05-07T19:59:17.3703439Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:17.3703807Z 2025-05-07T19:59:17.3704665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3706166Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3706792Z ^ 2025-05-07T19:59:17.3707009Z 2025-05-07T19:59:17.3707854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3709237Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3709848Z ^ 2025-05-07T19:59:17.3709987Z 2025-05-07T19:59:17.3710239Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:17.3710588Z 2025-05-07T19:59:17.3711445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3712835Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3713474Z ^ 2025-05-07T19:59:17.3713674Z 2025-05-07T19:59:17.3714519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3715996Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3716630Z ^ 2025-05-07T19:59:17.3716775Z 2025-05-07T19:59:17.3717011Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:17.3717378Z 2025-05-07T19:59:17.3718229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3719616Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3720283Z ^ 2025-05-07T19:59:17.3720480Z 2025-05-07T19:59:17.3721341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3722698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3723321Z ^ 2025-05-07T19:59:17.3723457Z 2025-05-07T19:59:17.3723703Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:17.3724049Z 2025-05-07T19:59:17.3724899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3726283Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3726915Z ^ 2025-05-07T19:59:17.3727111Z 2025-05-07T19:59:17.3727952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3729381Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3729984Z ^ 2025-05-07T19:59:17.3730135Z 2025-05-07T19:59:17.3730368Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:17.3730712Z 2025-05-07T19:59:17.3731574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:17.3732949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:17.3733582Z ^ 2025-05-07T19:59:17.3733778Z 2025-05-07T19:59:18.5877358Z [305/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:18.5896749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.5899220Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.5900155Z ^ 2025-05-07T19:59:18.5900837Z 2025-05-07T19:59:18.5901239Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.5901691Z 2025-05-07T19:59:18.5902758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.5904849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.5905771Z ^ 2025-05-07T19:59:18.5906005Z 2025-05-07T19:59:18.5907348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.5909708Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.5910805Z ^ 2025-05-07T19:59:18.5911058Z 2025-05-07T19:59:18.5911404Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.5911946Z 2025-05-07T19:59:18.5913312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.5915547Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.5916375Z ^ 2025-05-07T19:59:18.5916972Z 2025-05-07T19:59:18.5918321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.5920761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.5921773Z ^ 2025-05-07T19:59:18.5922027Z 2025-05-07T19:59:18.5922452Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.5923065Z 2025-05-07T19:59:18.5924535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.5927131Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.5928284Z ^ 2025-05-07T19:59:18.5928632Z 2025-05-07T19:59:18.5930035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.5932360Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.5933331Z ^ 2025-05-07T19:59:18.5933562Z 2025-05-07T19:59:18.5933942Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.5934534Z 2025-05-07T19:59:18.5935923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.5938145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.5939279Z ^ 2025-05-07T19:59:18.5939624Z 2025-05-07T19:59:18.5941206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.5943585Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.5944636Z ^ 2025-05-07T19:59:18.5944868Z 2025-05-07T19:59:18.5945266Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.5945874Z 2025-05-07T19:59:18.5947301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.5949482Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.5950517Z ^ 2025-05-07T19:59:18.5950823Z 2025-05-07T19:59:18.7037751Z [306/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o 2025-05-07T19:59:18.7057878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7060711Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7061729Z ^ 2025-05-07T19:59:18.7061952Z 2025-05-07T19:59:18.7062352Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.7062967Z 2025-05-07T19:59:18.7064449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7066943Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7067998Z ^ 2025-05-07T19:59:18.7068333Z 2025-05-07T19:59:18.7069854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7072280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7073371Z ^ 2025-05-07T19:59:18.7073623Z 2025-05-07T19:59:18.7074049Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.7074617Z 2025-05-07T19:59:18.7076677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7079112Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7080250Z ^ 2025-05-07T19:59:18.7080590Z 2025-05-07T19:59:18.7082053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7084271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7085522Z ^ 2025-05-07T19:59:18.7085767Z 2025-05-07T19:59:18.7086226Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.7086926Z 2025-05-07T19:59:18.7088476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7090793Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7091803Z ^ 2025-05-07T19:59:18.7092150Z 2025-05-07T19:59:18.7093534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7095808Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7096837Z ^ 2025-05-07T19:59:18.7097065Z 2025-05-07T19:59:18.7097483Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.7098064Z 2025-05-07T19:59:18.7099348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7101970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7102983Z ^ 2025-05-07T19:59:18.7103314Z 2025-05-07T19:59:18.7104719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7107088Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7108083Z ^ 2025-05-07T19:59:18.7108309Z 2025-05-07T19:59:18.7108726Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.7109319Z 2025-05-07T19:59:18.7110768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7112919Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7114045Z ^ 2025-05-07T19:59:18.7114404Z 2025-05-07T19:59:18.7209783Z [307/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:59:18.7231489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7233818Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7234832Z ^ 2025-05-07T19:59:18.7235090Z 2025-05-07T19:59:18.7235474Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.7236036Z 2025-05-07T19:59:18.7237546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7240078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7241246Z ^ 2025-05-07T19:59:18.7241571Z 2025-05-07T19:59:18.7242946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7245219Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7246285Z ^ 2025-05-07T19:59:18.7246507Z 2025-05-07T19:59:18.7247150Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.7247762Z 2025-05-07T19:59:18.7249109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7251311Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7252074Z ^ 2025-05-07T19:59:18.7252333Z 2025-05-07T19:59:18.7253688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7256201Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7257194Z ^ 2025-05-07T19:59:18.7257428Z 2025-05-07T19:59:18.7257832Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.7258420Z 2025-05-07T19:59:18.7259892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7262487Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7263541Z ^ 2025-05-07T19:59:18.7263850Z 2025-05-07T19:59:18.7265312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7267169Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7268137Z ^ 2025-05-07T19:59:18.7268496Z 2025-05-07T19:59:18.7268885Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.7269516Z 2025-05-07T19:59:18.7270982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7273443Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7274563Z ^ 2025-05-07T19:59:18.7274886Z 2025-05-07T19:59:18.7276695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7279106Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7280189Z ^ 2025-05-07T19:59:18.7280422Z 2025-05-07T19:59:18.7280863Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:18.7281475Z 2025-05-07T19:59:18.7282919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:18.7285363Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:18.7286678Z ^ 2025-05-07T19:59:18.7287000Z 2025-05-07T19:59:19.0971713Z [308/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:19.0993024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.0995500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:19.0996529Z ^ 2025-05-07T19:59:19.0996761Z 2025-05-07T19:59:19.0997205Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:19.0997803Z 2025-05-07T19:59:19.0999228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.1001627Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:19.1002661Z ^ 2025-05-07T19:59:19.1003009Z 2025-05-07T19:59:19.1004691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.1007031Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:19.1008063Z ^ 2025-05-07T19:59:19.1008333Z 2025-05-07T19:59:19.1008711Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:19.1009254Z 2025-05-07T19:59:19.1010611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.1013035Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:19.1014345Z ^ 2025-05-07T19:59:19.1014682Z 2025-05-07T19:59:19.1016150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.1018411Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:19.1019410Z ^ 2025-05-07T19:59:19.1019645Z 2025-05-07T19:59:19.1020047Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:19.1020764Z 2025-05-07T19:59:19.1022205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.1024621Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:19.1025706Z ^ 2025-05-07T19:59:19.1026070Z 2025-05-07T19:59:19.1027537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.1030125Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:19.1031062Z ^ 2025-05-07T19:59:19.1031280Z 2025-05-07T19:59:19.1031615Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:19.1032166Z 2025-05-07T19:59:19.1033358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.1035512Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:19.1036551Z ^ 2025-05-07T19:59:19.1036910Z 2025-05-07T19:59:19.1038462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.1040745Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:19.1041790Z ^ 2025-05-07T19:59:19.1042041Z 2025-05-07T19:59:19.1042470Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:19.1043121Z 2025-05-07T19:59:19.1044780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.1047358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:19.1048537Z ^ 2025-05-07T19:59:19.1048915Z 2025-05-07T19:59:22.2405622Z [309/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:22.2431603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.2434534Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.2435860Z ^ 2025-05-07T19:59:22.2436152Z 2025-05-07T19:59:22.2436660Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:22.2437385Z 2025-05-07T19:59:22.2439201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.2441802Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.2443302Z ^ 2025-05-07T19:59:22.2443729Z 2025-05-07T19:59:22.2445347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.2448247Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.2449528Z ^ 2025-05-07T19:59:22.2449821Z 2025-05-07T19:59:22.2450210Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:22.2450933Z 2025-05-07T19:59:22.2452629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.2455589Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.2456896Z ^ 2025-05-07T19:59:22.2457295Z 2025-05-07T19:59:22.2459081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.2462128Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.2463426Z ^ 2025-05-07T19:59:22.2463707Z 2025-05-07T19:59:22.2464190Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:22.2464936Z 2025-05-07T19:59:22.2466666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.2469531Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.2470970Z ^ 2025-05-07T19:59:22.2471378Z 2025-05-07T19:59:22.2473136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.2475817Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.2477601Z ^ 2025-05-07T19:59:22.2477883Z 2025-05-07T19:59:22.2478380Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:22.2479100Z 2025-05-07T19:59:22.2480903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.2483811Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.2485115Z ^ 2025-05-07T19:59:22.2485518Z 2025-05-07T19:59:22.2487226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.2489972Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.2491236Z ^ 2025-05-07T19:59:22.2491533Z 2025-05-07T19:59:22.2492299Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:22.2493025Z 2025-05-07T19:59:22.2494676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.2497585Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.2498772Z ^ 2025-05-07T19:59:22.2499145Z 2025-05-07T19:59:22.3400922Z [310/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:22.3427125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.3430044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.3431347Z ^ 2025-05-07T19:59:22.3431653Z 2025-05-07T19:59:22.3432163Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:22.3432893Z 2025-05-07T19:59:22.3435107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.3438047Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.3439371Z ^ 2025-05-07T19:59:22.3439774Z 2025-05-07T19:59:22.3441543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.3444417Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.3445828Z ^ 2025-05-07T19:59:22.3446107Z 2025-05-07T19:59:22.3446591Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:22.3447327Z 2025-05-07T19:59:22.3449015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.3451791Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.3453070Z ^ 2025-05-07T19:59:22.3453482Z 2025-05-07T19:59:22.3455232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.3458117Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.3459403Z ^ 2025-05-07T19:59:22.3459695Z 2025-05-07T19:59:22.3460189Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:22.3461082Z 2025-05-07T19:59:22.3462861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.3465922Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.3467212Z ^ 2025-05-07T19:59:22.3467613Z 2025-05-07T19:59:22.3469365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.3472271Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.3473550Z ^ 2025-05-07T19:59:22.3473832Z 2025-05-07T19:59:22.3474311Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:22.3475039Z 2025-05-07T19:59:22.3477134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.3480014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.3481343Z ^ 2025-05-07T19:59:22.3481761Z 2025-05-07T19:59:22.3483847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.3486707Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.3487985Z ^ 2025-05-07T19:59:22.3488284Z 2025-05-07T19:59:22.3488760Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:22.3489479Z 2025-05-07T19:59:22.3491223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.3494111Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:22.3495576Z ^ 2025-05-07T19:59:22.3495973Z 2025-05-07T19:59:24.6713834Z [311/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:24.6737076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.6739797Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.6741113Z ^ 2025-05-07T19:59:24.6741682Z 2025-05-07T19:59:24.6742141Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:24.6742804Z 2025-05-07T19:59:24.6744487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.6747175Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.6748306Z ^ 2025-05-07T19:59:24.6748685Z 2025-05-07T19:59:24.6750398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.6753172Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.6754305Z ^ 2025-05-07T19:59:24.6754567Z 2025-05-07T19:59:24.6754997Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:24.6755679Z 2025-05-07T19:59:24.6757376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.6760007Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.6761221Z ^ 2025-05-07T19:59:24.6761589Z 2025-05-07T19:59:24.6763251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.6765884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.6767179Z ^ 2025-05-07T19:59:24.6767420Z 2025-05-07T19:59:24.6767845Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:24.6768505Z 2025-05-07T19:59:24.6770106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.6772786Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.6773989Z ^ 2025-05-07T19:59:24.6774376Z 2025-05-07T19:59:24.6776240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.6778875Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.6780054Z ^ 2025-05-07T19:59:24.6780394Z 2025-05-07T19:59:24.6780827Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:24.6781474Z 2025-05-07T19:59:24.6783059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.6786079Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.6787240Z ^ 2025-05-07T19:59:24.6787598Z 2025-05-07T19:59:24.6789215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.6799254Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.6800476Z ^ 2025-05-07T19:59:24.6800746Z 2025-05-07T19:59:24.6801173Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:24.6801973Z 2025-05-07T19:59:24.6803673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:24.6806334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:24.6807504Z ^ 2025-05-07T19:59:24.6807866Z 2025-05-07T19:59:26.5317662Z [312/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:26.5339804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5342648Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5343757Z ^ 2025-05-07T19:59:26.5344000Z 2025-05-07T19:59:26.5344422Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:26.5345032Z 2025-05-07T19:59:26.5346906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5349493Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5350618Z ^ 2025-05-07T19:59:26.5350969Z 2025-05-07T19:59:26.5352521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5354999Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5356078Z ^ 2025-05-07T19:59:26.5356305Z 2025-05-07T19:59:26.5356718Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:26.5357317Z 2025-05-07T19:59:26.5358880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5361405Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5362505Z ^ 2025-05-07T19:59:26.5362874Z 2025-05-07T19:59:26.5364429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5367131Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5368239Z ^ 2025-05-07T19:59:26.5368504Z 2025-05-07T19:59:26.5368931Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:26.5369570Z 2025-05-07T19:59:26.5371180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5373560Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5374649Z ^ 2025-05-07T19:59:26.5374963Z 2025-05-07T19:59:26.5376920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5379289Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5380490Z ^ 2025-05-07T19:59:26.5380741Z 2025-05-07T19:59:26.5381137Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:26.5381718Z 2025-05-07T19:59:26.5383330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5385801Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5387077Z ^ 2025-05-07T19:59:26.5387446Z 2025-05-07T19:59:26.5389213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5391705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5392959Z ^ 2025-05-07T19:59:26.5393225Z 2025-05-07T19:59:26.5393634Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:26.5394404Z 2025-05-07T19:59:26.5395881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:26.5398375Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:26.5399508Z ^ 2025-05-07T19:59:26.5399826Z 2025-05-07T19:59:27.8028898Z [313/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o 2025-05-07T19:59:27.8049707Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:27.8052132Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:27.8053482Z ^ 2025-05-07T19:59:27.8053682Z 2025-05-07T19:59:27.8054117Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:27.8054690Z 2025-05-07T19:59:27.8056356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:27.8058697Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:27.8059952Z ^ 2025-05-07T19:59:27.8060460Z 2025-05-07T19:59:27.8061861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:27.8064138Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:27.8065159Z ^ 2025-05-07T19:59:27.8065391Z 2025-05-07T19:59:27.8065801Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:27.8066405Z 2025-05-07T19:59:27.8067814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:27.8070166Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:27.8071409Z ^ 2025-05-07T19:59:27.8071745Z 2025-05-07T19:59:27.8073240Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:27.8075513Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:27.8076868Z ^ 2025-05-07T19:59:27.8077159Z 2025-05-07T19:59:27.8077620Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:27.8078273Z 2025-05-07T19:59:27.8079666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:27.8082083Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:27.8083159Z ^ 2025-05-07T19:59:27.8083482Z 2025-05-07T19:59:27.8084943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:27.8087334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:27.8088397Z ^ 2025-05-07T19:59:27.8088860Z 2025-05-07T19:59:27.8089268Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:27.8089838Z 2025-05-07T19:59:27.8091295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:27.8093741Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:27.8094762Z ^ 2025-05-07T19:59:27.8095109Z 2025-05-07T19:59:27.8096539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:27.8099018Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:27.8100026Z ^ 2025-05-07T19:59:27.8100416Z 2025-05-07T19:59:27.8100803Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:27.8101317Z 2025-05-07T19:59:27.8102686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:27.8105078Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:27.8106076Z ^ 2025-05-07T19:59:27.8106369Z 2025-05-07T19:59:31.3846157Z [314/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o 2025-05-07T19:59:31.3868558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.3871177Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.3872424Z ^ 2025-05-07T19:59:31.3872682Z 2025-05-07T19:59:31.3873081Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:31.3873690Z 2025-05-07T19:59:31.3875265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.3878033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.3879163Z ^ 2025-05-07T19:59:31.3879487Z 2025-05-07T19:59:31.3881044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.3883563Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.3884665Z ^ 2025-05-07T19:59:31.3884913Z 2025-05-07T19:59:31.3885322Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:31.3885972Z 2025-05-07T19:59:31.3887647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.3890415Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.3891502Z ^ 2025-05-07T19:59:31.3891868Z 2025-05-07T19:59:31.3893434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.3895957Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.3897039Z ^ 2025-05-07T19:59:31.3897306Z 2025-05-07T19:59:31.3897726Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:31.3898371Z 2025-05-07T19:59:31.3900012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.3902601Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.3903805Z ^ 2025-05-07T19:59:31.3904166Z 2025-05-07T19:59:31.3905879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.3908467Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.3909646Z ^ 2025-05-07T19:59:31.3909911Z 2025-05-07T19:59:31.3910343Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:31.3910988Z 2025-05-07T19:59:31.3912771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.3915374Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.3916666Z ^ 2025-05-07T19:59:31.3917023Z 2025-05-07T19:59:31.3918656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.3921216Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.3922327Z ^ 2025-05-07T19:59:31.3922571Z 2025-05-07T19:59:31.3923044Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:31.3923678Z 2025-05-07T19:59:31.3925275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.3927836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:31.3928982Z ^ 2025-05-07T19:59:31.3929382Z 2025-05-07T19:59:40.7782636Z [315/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:40.7805117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.7807637Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.7808698Z ^ 2025-05-07T19:59:40.7808957Z 2025-05-07T19:59:40.7809352Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:40.7810123Z 2025-05-07T19:59:40.7811448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.7813968Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.7815045Z ^ 2025-05-07T19:59:40.7815409Z 2025-05-07T19:59:40.7816812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.7819250Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.7820648Z ^ 2025-05-07T19:59:40.7820879Z 2025-05-07T19:59:40.7821340Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:40.7821940Z 2025-05-07T19:59:40.7823435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.7825906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.7827063Z ^ 2025-05-07T19:59:40.7827417Z 2025-05-07T19:59:40.7828879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.7831471Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.7832521Z ^ 2025-05-07T19:59:40.7832737Z 2025-05-07T19:59:40.7833119Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:40.7833662Z 2025-05-07T19:59:40.7835105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.7837756Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.7838858Z ^ 2025-05-07T19:59:40.7839190Z 2025-05-07T19:59:40.7840668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.7843160Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.7844257Z ^ 2025-05-07T19:59:40.7844484Z 2025-05-07T19:59:40.7844900Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:40.7845585Z 2025-05-07T19:59:40.7847047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.7849387Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.7850523Z ^ 2025-05-07T19:59:40.7850859Z 2025-05-07T19:59:40.7852367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.7854909Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.7855934Z ^ 2025-05-07T19:59:40.7856192Z 2025-05-07T19:59:40.7856588Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:40.7857176Z 2025-05-07T19:59:40.7858657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:40.7861407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:40.7862655Z ^ 2025-05-07T19:59:40.7862996Z 2025-05-07T19:59:45.5137379Z [316/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:45.5159894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.5162435Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.5163529Z ^ 2025-05-07T19:59:45.5163771Z 2025-05-07T19:59:45.5164189Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:45.5164816Z 2025-05-07T19:59:45.5166454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.5168865Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.5169906Z ^ 2025-05-07T19:59:45.5170242Z 2025-05-07T19:59:45.5171811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.5174642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.5175674Z ^ 2025-05-07T19:59:45.5176170Z 2025-05-07T19:59:45.5176600Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:45.5177232Z 2025-05-07T19:59:45.5178829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.5181467Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.5182559Z ^ 2025-05-07T19:59:45.5182930Z 2025-05-07T19:59:45.5184502Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.5186829Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.5187930Z ^ 2025-05-07T19:59:45.5188184Z 2025-05-07T19:59:45.5188620Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:45.5189261Z 2025-05-07T19:59:45.5191143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.5193617Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.5194707Z ^ 2025-05-07T19:59:45.5195056Z 2025-05-07T19:59:45.5196782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.5199137Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.5200432Z ^ 2025-05-07T19:59:45.5200683Z 2025-05-07T19:59:45.5201119Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:45.5201780Z 2025-05-07T19:59:45.5203365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.5205680Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.5206852Z ^ 2025-05-07T19:59:45.5207228Z 2025-05-07T19:59:45.5208789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.5211393Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.5212512Z ^ 2025-05-07T19:59:45.5212767Z 2025-05-07T19:59:45.5213188Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:45.5213976Z 2025-05-07T19:59:45.5215615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.5217965Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:45.5219082Z ^ 2025-05-07T19:59:45.5219453Z 2025-05-07T19:59:46.8111012Z [317/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:46.8131316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.8133646Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.8134657Z ^ 2025-05-07T19:59:46.8134970Z 2025-05-07T19:59:46.8135362Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:46.8135963Z 2025-05-07T19:59:46.8137406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.8139919Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.8141080Z ^ 2025-05-07T19:59:46.8141405Z 2025-05-07T19:59:46.8142862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.8145120Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.8146152Z ^ 2025-05-07T19:59:46.8146382Z 2025-05-07T19:59:46.8146798Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:46.8147363Z 2025-05-07T19:59:46.8148767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.8151100Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.8152161Z ^ 2025-05-07T19:59:46.8152485Z 2025-05-07T19:59:46.8153915Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.8156536Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.8157569Z ^ 2025-05-07T19:59:46.8157784Z 2025-05-07T19:59:46.8158189Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:46.8158787Z 2025-05-07T19:59:46.8160339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.8162644Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.8163721Z ^ 2025-05-07T19:59:46.8164094Z 2025-05-07T19:59:46.8165420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.8167749Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.8168856Z ^ 2025-05-07T19:59:46.8169087Z 2025-05-07T19:59:46.8169527Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:46.8170121Z 2025-05-07T19:59:46.8171537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.8174244Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.8175443Z ^ 2025-05-07T19:59:46.8175850Z 2025-05-07T19:59:46.8177800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.8180449Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.8181446Z ^ 2025-05-07T19:59:46.8181695Z 2025-05-07T19:59:46.8182084Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:46.8182673Z 2025-05-07T19:59:46.8184120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:46.8186392Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:46.8187563Z ^ 2025-05-07T19:59:46.8187936Z 2025-05-07T19:59:48.5384034Z [318/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:48.5402170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:48.5404249Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:48.5405137Z ^ 2025-05-07T19:59:48.5405375Z 2025-05-07T19:59:48.5405705Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:48.5406454Z 2025-05-07T19:59:48.5407702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:48.5409755Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:48.5410672Z ^ 2025-05-07T19:59:48.5410963Z 2025-05-07T19:59:48.5412234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:48.5414205Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:48.5415116Z ^ 2025-05-07T19:59:48.5415316Z 2025-05-07T19:59:48.5415670Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:48.5416204Z 2025-05-07T19:59:48.5417441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:48.5419454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:48.5420675Z ^ 2025-05-07T19:59:48.5421015Z 2025-05-07T19:59:48.5422443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:48.5424529Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:48.5425435Z ^ 2025-05-07T19:59:48.5425656Z 2025-05-07T19:59:48.5426143Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:48.5426660Z 2025-05-07T19:59:48.5427941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:48.5430095Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:48.5431029Z ^ 2025-05-07T19:59:48.5431335Z 2025-05-07T19:59:48.5432580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:48.5434669Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:48.5435618Z ^ 2025-05-07T19:59:48.5435819Z 2025-05-07T19:59:48.5436193Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:48.5436746Z 2025-05-07T19:59:48.5438012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:48.5440108Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:48.5441051Z ^ 2025-05-07T19:59:48.5441495Z 2025-05-07T19:59:48.5442781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:48.5444861Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:48.5445812Z ^ 2025-05-07T19:59:48.5446030Z 2025-05-07T19:59:48.5446413Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:48.5446938Z 2025-05-07T19:59:48.5448227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:48.5450323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:48.5451309Z ^ 2025-05-07T19:59:48.5451597Z 2025-05-07T19:59:52.2027229Z [319/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:52.2050762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.2053837Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.2055182Z ^ 2025-05-07T19:59:52.2055447Z 2025-05-07T19:59:52.2055939Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.2056601Z 2025-05-07T19:59:52.2058337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.2061121Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.2062205Z ^ 2025-05-07T19:59:52.2062567Z 2025-05-07T19:59:52.2064111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.2066413Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.2067427Z ^ 2025-05-07T19:59:52.2067655Z 2025-05-07T19:59:52.2068057Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.2068677Z 2025-05-07T19:59:52.2070351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.2072888Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.2073956Z ^ 2025-05-07T19:59:52.2074312Z 2025-05-07T19:59:52.2076222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.2078819Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.2079866Z ^ 2025-05-07T19:59:52.2080298Z 2025-05-07T19:59:52.2080700Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.2081296Z 2025-05-07T19:59:52.2082823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.2085323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.2086540Z ^ 2025-05-07T19:59:52.2086910Z 2025-05-07T19:59:52.2088575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.2091296Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.2092466Z ^ 2025-05-07T19:59:52.2092713Z 2025-05-07T19:59:52.2093167Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.2093793Z 2025-05-07T19:59:52.2095339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.2098137Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.2099295Z ^ 2025-05-07T19:59:52.2099682Z 2025-05-07T19:59:52.2101522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.2104042Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.2105198Z ^ 2025-05-07T19:59:52.2105454Z 2025-05-07T19:59:52.2105892Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:52.2106529Z 2025-05-07T19:59:52.2107881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:52.2110573Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:52.2111822Z ^ 2025-05-07T19:59:52.2112152Z 2025-05-07T19:59:59.0113083Z [320/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:59:59.0134597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.0137127Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:59.0138218Z ^ 2025-05-07T19:59:59.0138479Z 2025-05-07T19:59:59.0138903Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:59.0139515Z 2025-05-07T19:59:59.0141230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.0143630Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:59.0144744Z ^ 2025-05-07T19:59:59.0145079Z 2025-05-07T19:59:59.0146627Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.0149310Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:59.0150477Z ^ 2025-05-07T19:59:59.0150744Z 2025-05-07T19:59:59.0151419Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:59.0152094Z 2025-05-07T19:59:59.0153741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.0156435Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:59.0157687Z ^ 2025-05-07T19:59:59.0158038Z 2025-05-07T19:59:59.0159619Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.0162375Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:59.0163682Z ^ 2025-05-07T19:59:59.0163944Z 2025-05-07T19:59:59.0164386Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:59.0165036Z 2025-05-07T19:59:59.0166634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.0169105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:59.0170133Z ^ 2025-05-07T19:59:59.0170426Z 2025-05-07T19:59:59.0171761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.0174062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:59.0175108Z ^ 2025-05-07T19:59:59.0175371Z 2025-05-07T19:59:59.0176215Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:59.0176812Z 2025-05-07T19:59:59.0178313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.0180818Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:59.0181903Z ^ 2025-05-07T19:59:59.0182233Z 2025-05-07T19:59:59.0183753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.0186187Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:59.0187248Z ^ 2025-05-07T19:59:59.0187472Z 2025-05-07T19:59:59.0187895Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T19:59:59.0188515Z 2025-05-07T19:59:59.0190072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:59.0192495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T19:59:59.0193705Z ^ 2025-05-07T19:59:59.0194354Z 2025-05-07T20:00:02.9577528Z [321/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o 2025-05-07T20:00:02.9600563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.9603499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:02.9604628Z ^ 2025-05-07T20:00:02.9604898Z 2025-05-07T20:00:02.9605345Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:02.9606009Z 2025-05-07T20:00:02.9607529Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.9610066Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:02.9611187Z ^ 2025-05-07T20:00:02.9611535Z 2025-05-07T20:00:02.9613298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9615440Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:02.9616225Z ^ 2025-05-07T20:00:02.9616491Z 2025-05-07T20:00:02.9617907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9620013Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:02.9620733Z ^ 2025-05-07T20:00:02.9621006Z 2025-05-07T20:00:02.9622589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9624616Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:02.9625194Z ^ 2025-05-07T20:00:02.9625516Z 2025-05-07T20:00:02.9626973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9628977Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:02.9643547Z ^ 2025-05-07T20:00:02.9643895Z 2025-05-07T20:00:02.9645466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.9648132Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:02.9649274Z ^ 2025-05-07T20:00:02.9649553Z 2025-05-07T20:00:02.9650002Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:02.9650646Z 2025-05-07T20:00:02.9652372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.9655373Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:02.9656600Z ^ 2025-05-07T20:00:02.9656961Z 2025-05-07T20:00:02.9658553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9660745Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:02.9661519Z ^ 2025-05-07T20:00:02.9661820Z 2025-05-07T20:00:02.9663407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9665376Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:02.9665932Z ^ 2025-05-07T20:00:02.9666235Z 2025-05-07T20:00:02.9667830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9669847Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:02.9670405Z ^ 2025-05-07T20:00:02.9670706Z 2025-05-07T20:00:02.9672437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9674432Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:02.9674980Z ^ 2025-05-07T20:00:02.9675271Z 2025-05-07T20:00:02.9677169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.9680088Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:02.9681306Z ^ 2025-05-07T20:00:02.9681665Z 2025-05-07T20:00:02.9682140Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:02.9682821Z 2025-05-07T20:00:02.9684556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.9687285Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:02.9688425Z ^ 2025-05-07T20:00:02.9688736Z 2025-05-07T20:00:02.9690165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9692375Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:02.9693121Z ^ 2025-05-07T20:00:02.9693405Z 2025-05-07T20:00:02.9694877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9696799Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:02.9697299Z ^ 2025-05-07T20:00:02.9697734Z 2025-05-07T20:00:02.9699258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9701261Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:02.9701804Z ^ 2025-05-07T20:00:02.9702241Z 2025-05-07T20:00:02.9703774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9705699Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:02.9706205Z ^ 2025-05-07T20:00:02.9706429Z 2025-05-07T20:00:02.9708098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.9710573Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:02.9711693Z ^ 2025-05-07T20:00:02.9711930Z 2025-05-07T20:00:02.9712356Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:02.9713031Z 2025-05-07T20:00:02.9714766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.9717811Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:02.9718989Z ^ 2025-05-07T20:00:02.9719369Z 2025-05-07T20:00:02.9720966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9723062Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:02.9723780Z ^ 2025-05-07T20:00:02.9724065Z 2025-05-07T20:00:02.9725591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9727734Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:02.9728332Z ^ 2025-05-07T20:00:02.9728624Z 2025-05-07T20:00:02.9730251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9732234Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:02.9732800Z ^ 2025-05-07T20:00:02.9733089Z 2025-05-07T20:00:02.9734569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9736239Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:02.9736790Z ^ 2025-05-07T20:00:02.9737048Z 2025-05-07T20:00:02.9738476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.9741101Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:02.9742349Z ^ 2025-05-07T20:00:02.9742623Z 2025-05-07T20:00:02.9743049Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:02.9743670Z 2025-05-07T20:00:02.9745203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:02.9747674Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:02.9748686Z ^ 2025-05-07T20:00:02.9748994Z 2025-05-07T20:00:02.9750427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9752425Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:02.9753161Z ^ 2025-05-07T20:00:02.9753449Z 2025-05-07T20:00:02.9754992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9756932Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:02.9757507Z ^ 2025-05-07T20:00:02.9757802Z 2025-05-07T20:00:02.9759489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9761497Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:02.9762053Z ^ 2025-05-07T20:00:02.9762336Z 2025-05-07T20:00:02.9763752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:02.9765772Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:02.9766270Z ^ 2025-05-07T20:00:02.9766549Z 2025-05-07T20:00:13.9861694Z [322/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o 2025-05-07T20:00:13.9885713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.9888343Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.9889466Z ^ 2025-05-07T20:00:13.9889719Z 2025-05-07T20:00:13.9890165Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:13.9890826Z 2025-05-07T20:00:13.9892683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.9895301Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.9896503Z ^ 2025-05-07T20:00:13.9896876Z 2025-05-07T20:00:13.9898542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:13.9900648Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:13.9901494Z ^ 2025-05-07T20:00:13.9901774Z 2025-05-07T20:00:13.9903222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:13.9905044Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:13.9905591Z ^ 2025-05-07T20:00:13.9905851Z 2025-05-07T20:00:13.9907323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:13.9909193Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:13.9909701Z ^ 2025-05-07T20:00:13.9909924Z 2025-05-07T20:00:13.9911224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:13.9912999Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:13.9913508Z ^ 2025-05-07T20:00:13.9913831Z 2025-05-07T20:00:13.9915302Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.9917891Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.9918975Z ^ 2025-05-07T20:00:13.9919213Z 2025-05-07T20:00:13.9919618Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:13.9920240Z 2025-05-07T20:00:13.9921786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.9924346Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.9925475Z ^ 2025-05-07T20:00:13.9925821Z 2025-05-07T20:00:13.9927211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:13.9929184Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:13.9929884Z ^ 2025-05-07T20:00:13.9930146Z 2025-05-07T20:00:13.9931524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:13.9933373Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:13.9934029Z ^ 2025-05-07T20:00:13.9934306Z 2025-05-07T20:00:13.9935736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:13.9937616Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:13.9938128Z ^ 2025-05-07T20:00:13.9938411Z 2025-05-07T20:00:13.9940004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:13.9942031Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:13.9942501Z ^ 2025-05-07T20:00:13.9942756Z 2025-05-07T20:00:13.9944320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.9946705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.9947840Z ^ 2025-05-07T20:00:13.9948106Z 2025-05-07T20:00:13.9948575Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:13.9949257Z 2025-05-07T20:00:13.9950960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.9953560Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.9954728Z ^ 2025-05-07T20:00:13.9955073Z 2025-05-07T20:00:13.9956569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:13.9958770Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:13.9959484Z ^ 2025-05-07T20:00:13.9959783Z 2025-05-07T20:00:13.9961297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:13.9963237Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:13.9963786Z ^ 2025-05-07T20:00:13.9964083Z 2025-05-07T20:00:13.9965566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:13.9967489Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:13.9968031Z ^ 2025-05-07T20:00:13.9968301Z 2025-05-07T20:00:13.9969741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:13.9971684Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:13.9972223Z ^ 2025-05-07T20:00:13.9972490Z 2025-05-07T20:00:13.9974258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.9977222Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.9978253Z ^ 2025-05-07T20:00:13.9978500Z 2025-05-07T20:00:13.9978923Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:13.9979700Z 2025-05-07T20:00:13.9981518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:13.9984249Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:13.9985570Z ^ 2025-05-07T20:00:13.9985934Z 2025-05-07T20:00:13.9987449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:13.9989368Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:13.9990125Z ^ 2025-05-07T20:00:13.9990411Z 2025-05-07T20:00:13.9991943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:13.9993947Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:13.9994503Z ^ 2025-05-07T20:00:13.9994812Z 2025-05-07T20:00:13.9996283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:13.9998075Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:13.9998570Z ^ 2025-05-07T20:00:13.9998832Z 2025-05-07T20:00:14.0000309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:14.0002356Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:14.0002885Z ^ 2025-05-07T20:00:14.0003150Z 2025-05-07T20:00:14.0004754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:14.0007349Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:14.0008552Z ^ 2025-05-07T20:00:14.0008807Z 2025-05-07T20:00:14.0009268Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:14.0009953Z 2025-05-07T20:00:14.0011537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:14.0014243Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:14.0015328Z ^ 2025-05-07T20:00:14.0015685Z 2025-05-07T20:00:14.0017088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:14.0018972Z if (output_d >= 0 && output_d < D && packed_bag_store_idx < num_packed_bags) { 2025-05-07T20:00:14.0019754Z ^ 2025-05-07T20:00:14.0020027Z 2025-05-07T20:00:14.0021556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:14.0023125Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:14.0023641Z ^ 2025-05-07T20:00:14.0023907Z 2025-05-07T20:00:14.0025246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:14.0026899Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:14.0027450Z ^ 2025-05-07T20:00:14.0027704Z 2025-05-07T20:00:14.0029084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T20:00:14.0031086Z if (output_d >= 0 && output_d < D) { 2025-05-07T20:00:14.0031653Z ^ 2025-05-07T20:00:14.0031918Z 2025-05-07T20:00:15.1522312Z [323/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_inference.so -o fbgemm_gpu_tbe_inference.so CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed asmjit.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:00:24.6521944Z [324/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o 2025-05-07T20:00:24.6545686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.6548575Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.6549799Z ^ 2025-05-07T20:00:24.6550086Z 2025-05-07T20:00:24.6550565Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:24.6551258Z 2025-05-07T20:00:24.6553003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.6555878Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.6557061Z ^ 2025-05-07T20:00:24.6557440Z 2025-05-07T20:00:24.6559101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.6561817Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.6563344Z ^ 2025-05-07T20:00:24.6563619Z 2025-05-07T20:00:24.6564064Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:24.6564717Z 2025-05-07T20:00:24.6566430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.6569144Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.6570349Z ^ 2025-05-07T20:00:24.6570709Z 2025-05-07T20:00:24.6572378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.6575058Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.6576584Z ^ 2025-05-07T20:00:24.6576854Z 2025-05-07T20:00:24.6577435Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:24.6578141Z 2025-05-07T20:00:24.6579832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.6583061Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.6584286Z ^ 2025-05-07T20:00:24.6584679Z 2025-05-07T20:00:24.6586396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.6589394Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.6590570Z ^ 2025-05-07T20:00:24.6590844Z 2025-05-07T20:00:24.6591298Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:24.6592065Z 2025-05-07T20:00:24.6593761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.6596470Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.6597677Z ^ 2025-05-07T20:00:24.6598041Z 2025-05-07T20:00:24.6599712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.6602376Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.6603588Z ^ 2025-05-07T20:00:24.6603847Z 2025-05-07T20:00:24.6604295Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:24.6605130Z 2025-05-07T20:00:24.6606867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.6609886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:24.6611109Z ^ 2025-05-07T20:00:24.6611509Z 2025-05-07T20:00:25.9853776Z [325/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o 2025-05-07T20:00:25.9876887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.9879635Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:25.9880821Z ^ 2025-05-07T20:00:25.9881083Z 2025-05-07T20:00:25.9881540Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:25.9882241Z 2025-05-07T20:00:25.9883905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.9886513Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:25.9887649Z ^ 2025-05-07T20:00:25.9888011Z 2025-05-07T20:00:25.9889596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.9892523Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:25.9893597Z ^ 2025-05-07T20:00:25.9893820Z 2025-05-07T20:00:25.9894266Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:25.9894922Z 2025-05-07T20:00:25.9896517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.9899165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:25.9900440Z ^ 2025-05-07T20:00:25.9900808Z 2025-05-07T20:00:25.9902361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.9905026Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:25.9906188Z ^ 2025-05-07T20:00:25.9906449Z 2025-05-07T20:00:25.9906878Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:25.9907541Z 2025-05-07T20:00:25.9909461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.9912188Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:25.9913401Z ^ 2025-05-07T20:00:25.9913753Z 2025-05-07T20:00:25.9915535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.9918053Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:25.9919246Z ^ 2025-05-07T20:00:25.9919480Z 2025-05-07T20:00:25.9920064Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:25.9920775Z 2025-05-07T20:00:25.9922440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.9925062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:25.9926112Z ^ 2025-05-07T20:00:25.9926470Z 2025-05-07T20:00:25.9927982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.9930811Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:25.9932031Z ^ 2025-05-07T20:00:25.9932304Z 2025-05-07T20:00:25.9932769Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:25.9933472Z 2025-05-07T20:00:25.9935412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:25.9938151Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:25.9939336Z ^ 2025-05-07T20:00:25.9939650Z 2025-05-07T20:00:38.4120344Z [326/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o 2025-05-07T20:00:38.4143728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:38.4146353Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:38.4147548Z ^ 2025-05-07T20:00:38.4147806Z 2025-05-07T20:00:38.4148252Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:38.4148875Z 2025-05-07T20:00:38.4150347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:38.4153242Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:38.4154292Z ^ 2025-05-07T20:00:38.4154699Z 2025-05-07T20:00:38.4156290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:38.4158961Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:38.4160041Z ^ 2025-05-07T20:00:38.4160294Z 2025-05-07T20:00:38.4160717Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:38.4161366Z 2025-05-07T20:00:38.4162996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:38.4165489Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:38.4166671Z ^ 2025-05-07T20:00:38.4167032Z 2025-05-07T20:00:38.4168597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:38.4171320Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:38.4172418Z ^ 2025-05-07T20:00:38.4172646Z 2025-05-07T20:00:38.4173054Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:38.4173721Z 2025-05-07T20:00:38.4175319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:38.4178189Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:38.4179285Z ^ 2025-05-07T20:00:38.4179754Z 2025-05-07T20:00:38.4181374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:38.4184010Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:38.4185063Z ^ 2025-05-07T20:00:38.4185309Z 2025-05-07T20:00:38.4185738Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:38.4186360Z 2025-05-07T20:00:38.4187978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:38.4190567Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:38.4191704Z ^ 2025-05-07T20:00:38.4192037Z 2025-05-07T20:00:38.4193657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:38.4196241Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:38.4197577Z ^ 2025-05-07T20:00:38.4197832Z 2025-05-07T20:00:38.4198279Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:38.4199086Z 2025-05-07T20:00:38.4200819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:38.4203278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:38.4204375Z ^ 2025-05-07T20:00:38.4204655Z 2025-05-07T20:00:42.7686379Z [327/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:00:42.7710438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:42.7713127Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:42.7714225Z ^ 2025-05-07T20:00:42.7714467Z 2025-05-07T20:00:42.7714906Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:42.7715776Z 2025-05-07T20:00:42.7717318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:42.7720141Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:42.7721330Z ^ 2025-05-07T20:00:42.7721684Z 2025-05-07T20:00:42.7723203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:42.7725791Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:42.7726901Z ^ 2025-05-07T20:00:42.7727161Z 2025-05-07T20:00:42.7727608Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:42.7728272Z 2025-05-07T20:00:42.7729881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:42.7732505Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:42.7733703Z ^ 2025-05-07T20:00:42.7734076Z 2025-05-07T20:00:42.7735914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:42.7738639Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:42.7739836Z ^ 2025-05-07T20:00:42.7740097Z 2025-05-07T20:00:42.7740786Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:42.7741450Z 2025-05-07T20:00:42.7743144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:42.7745903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:42.7747108Z ^ 2025-05-07T20:00:42.7747480Z 2025-05-07T20:00:42.7749063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:42.7751790Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:42.7752948Z ^ 2025-05-07T20:00:42.7753217Z 2025-05-07T20:00:42.7753670Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:42.7754352Z 2025-05-07T20:00:42.7756106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:42.7758585Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:42.7759776Z ^ 2025-05-07T20:00:42.7760269Z 2025-05-07T20:00:42.7761911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:42.7764023Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:42.7764937Z ^ 2025-05-07T20:00:42.7765148Z 2025-05-07T20:00:42.7765528Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:42.7766096Z 2025-05-07T20:00:42.7767580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:42.7770082Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:42.7771214Z ^ 2025-05-07T20:00:42.7771577Z 2025-05-07T20:00:47.9044551Z [328/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o 2025-05-07T20:00:47.9068094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9071223Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9072453Z ^ 2025-05-07T20:00:47.9072696Z 2025-05-07T20:00:47.9073139Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:47.9073754Z 2025-05-07T20:00:47.9075367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9078321Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9079457Z ^ 2025-05-07T20:00:47.9079814Z 2025-05-07T20:00:47.9081392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9083826Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9084901Z ^ 2025-05-07T20:00:47.9085141Z 2025-05-07T20:00:47.9085550Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:47.9086167Z 2025-05-07T20:00:47.9088048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9090670Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9091871Z ^ 2025-05-07T20:00:47.9092239Z 2025-05-07T20:00:47.9096386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9098920Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9100123Z ^ 2025-05-07T20:00:47.9100496Z 2025-05-07T20:00:47.9100935Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:47.9101532Z 2025-05-07T20:00:47.9103017Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9105648Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9106849Z ^ 2025-05-07T20:00:47.9107227Z 2025-05-07T20:00:47.9108910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9111549Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9112716Z ^ 2025-05-07T20:00:47.9112984Z 2025-05-07T20:00:47.9113434Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:47.9114052Z 2025-05-07T20:00:47.9115682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9118536Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9119829Z ^ 2025-05-07T20:00:47.9120222Z 2025-05-07T20:00:47.9121949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9124750Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9125993Z ^ 2025-05-07T20:00:47.9126373Z 2025-05-07T20:00:47.9126831Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:47.9127535Z 2025-05-07T20:00:47.9129236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:47.9131886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:47.9133075Z ^ 2025-05-07T20:00:47.9133445Z 2025-05-07T20:00:48.5362169Z [329/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:00:48.5382875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.5384850Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.5385746Z ^ 2025-05-07T20:00:48.5385948Z 2025-05-07T20:00:48.5386292Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:48.5386822Z 2025-05-07T20:00:48.5388199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.5390336Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.5391230Z ^ 2025-05-07T20:00:48.5391525Z 2025-05-07T20:00:48.5392806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.5394905Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.5395786Z ^ 2025-05-07T20:00:48.5395995Z 2025-05-07T20:00:48.5396704Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:48.5397187Z 2025-05-07T20:00:48.5398546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.5401060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.5402150Z ^ 2025-05-07T20:00:48.5402508Z 2025-05-07T20:00:48.5404021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.5406388Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.5407350Z ^ 2025-05-07T20:00:48.5407587Z 2025-05-07T20:00:48.5407957Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:48.5408521Z 2025-05-07T20:00:48.5409932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.5412292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.5413384Z ^ 2025-05-07T20:00:48.5413717Z 2025-05-07T20:00:48.5415273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.5417514Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.5418881Z ^ 2025-05-07T20:00:48.5419092Z 2025-05-07T20:00:48.5419465Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:48.5419996Z 2025-05-07T20:00:48.5421515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.5423672Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.5424618Z ^ 2025-05-07T20:00:48.5424933Z 2025-05-07T20:00:48.5426255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.5428391Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.5429332Z ^ 2025-05-07T20:00:48.5429556Z 2025-05-07T20:00:48.5429917Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:48.5430447Z 2025-05-07T20:00:48.5432002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.5434230Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.5435315Z ^ 2025-05-07T20:00:48.5435610Z 2025-05-07T20:00:48.8794406Z [330/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o 2025-05-07T20:00:48.8816671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.8819413Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.8820693Z ^ 2025-05-07T20:00:48.8820940Z 2025-05-07T20:00:48.8821370Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:48.8821992Z 2025-05-07T20:00:48.8823487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.8826011Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.8827101Z ^ 2025-05-07T20:00:48.8827431Z 2025-05-07T20:00:48.8829111Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.8831561Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.8832674Z ^ 2025-05-07T20:00:48.8832934Z 2025-05-07T20:00:48.8833363Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:48.8833975Z 2025-05-07T20:00:48.8835662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.8838079Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.8839289Z ^ 2025-05-07T20:00:48.8839622Z 2025-05-07T20:00:48.8841210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.8844008Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.8845308Z ^ 2025-05-07T20:00:48.8845585Z 2025-05-07T20:00:48.8846022Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:48.8846517Z 2025-05-07T20:00:48.8848198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.8850801Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.8852003Z ^ 2025-05-07T20:00:48.8852352Z 2025-05-07T20:00:48.8853967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.8856868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.8858046Z ^ 2025-05-07T20:00:48.8858467Z 2025-05-07T20:00:48.8858912Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:48.8859590Z 2025-05-07T20:00:48.8861379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.8864088Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.8865238Z ^ 2025-05-07T20:00:48.8865634Z 2025-05-07T20:00:48.8867285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.8869971Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.8871116Z ^ 2025-05-07T20:00:48.8871387Z 2025-05-07T20:00:48.8871832Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:48.8872476Z 2025-05-07T20:00:48.8874453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:48.8877621Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:48.8878849Z ^ 2025-05-07T20:00:48.8879226Z 2025-05-07T20:00:51.0544263Z [331/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o 2025-05-07T20:00:51.0566938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.0569539Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.0570771Z ^ 2025-05-07T20:00:51.0571026Z 2025-05-07T20:00:51.0571514Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:51.0572217Z 2025-05-07T20:00:51.0573968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.0577077Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.0578537Z ^ 2025-05-07T20:00:51.0578849Z 2025-05-07T20:00:51.0580462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.0583023Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.0584292Z ^ 2025-05-07T20:00:51.0584558Z 2025-05-07T20:00:51.0584983Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:51.0585627Z 2025-05-07T20:00:51.0587265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.0589858Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.0591027Z ^ 2025-05-07T20:00:51.0591565Z 2025-05-07T20:00:51.0593155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.0595672Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.0596727Z ^ 2025-05-07T20:00:51.0596972Z 2025-05-07T20:00:51.0597360Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:51.0597968Z 2025-05-07T20:00:51.0599574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.0602115Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.0603445Z ^ 2025-05-07T20:00:51.0603824Z 2025-05-07T20:00:51.0605327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.0607839Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.0609047Z ^ 2025-05-07T20:00:51.0609505Z 2025-05-07T20:00:51.0609982Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:51.0610680Z 2025-05-07T20:00:51.0612474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.0615255Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.0616296Z ^ 2025-05-07T20:00:51.0616653Z 2025-05-07T20:00:51.0618145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.0620895Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.0622068Z ^ 2025-05-07T20:00:51.0622474Z 2025-05-07T20:00:51.0622937Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:51.0623593Z 2025-05-07T20:00:51.0625123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:51.0627916Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:51.0629075Z ^ 2025-05-07T20:00:51.0629478Z 2025-05-07T20:00:59.4702021Z [332/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:00:59.4726109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:59.4728840Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:59.4729989Z ^ 2025-05-07T20:00:59.4730272Z 2025-05-07T20:00:59.4730706Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:59.4731340Z 2025-05-07T20:00:59.4733378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:59.4735584Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:59.4736566Z ^ 2025-05-07T20:00:59.4736880Z 2025-05-07T20:00:59.4738229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:59.4740522Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:59.4741539Z ^ 2025-05-07T20:00:59.4741781Z 2025-05-07T20:00:59.4742174Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:59.4742747Z 2025-05-07T20:00:59.4744323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:59.4746916Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:59.4748191Z ^ 2025-05-07T20:00:59.4748597Z 2025-05-07T20:00:59.4750346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:59.4752845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:59.4754016Z ^ 2025-05-07T20:00:59.4754280Z 2025-05-07T20:00:59.4754754Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:59.4755424Z 2025-05-07T20:00:59.4757143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:59.4759950Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:59.4761179Z ^ 2025-05-07T20:00:59.4761544Z 2025-05-07T20:00:59.4763112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:59.4765578Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:59.4766677Z ^ 2025-05-07T20:00:59.4766917Z 2025-05-07T20:00:59.4767347Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:59.4767936Z 2025-05-07T20:00:59.4769361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:59.4771845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:59.4772965Z ^ 2025-05-07T20:00:59.4773308Z 2025-05-07T20:00:59.4774965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:59.4777881Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:59.4779130Z ^ 2025-05-07T20:00:59.4779342Z 2025-05-07T20:00:59.4779780Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:00:59.4780522Z 2025-05-07T20:00:59.4782477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:59.4785438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:00:59.4786695Z ^ 2025-05-07T20:00:59.4787080Z 2025-05-07T20:01:00.2190889Z [333/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o 2025-05-07T20:01:00.2214348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.2217074Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.2218376Z ^ 2025-05-07T20:01:00.2218665Z 2025-05-07T20:01:00.2219529Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:00.2220354Z 2025-05-07T20:01:00.2222108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.2224810Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.2226028Z ^ 2025-05-07T20:01:00.2226381Z 2025-05-07T20:01:00.2227957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.2230820Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.2231983Z ^ 2025-05-07T20:01:00.2232252Z 2025-05-07T20:01:00.2232708Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:00.2233367Z 2025-05-07T20:01:00.2234979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.2237688Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.2238783Z ^ 2025-05-07T20:01:00.2239088Z 2025-05-07T20:01:00.2240393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.2242910Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.2244105Z ^ 2025-05-07T20:01:00.2244402Z 2025-05-07T20:01:00.2244853Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:00.2245497Z 2025-05-07T20:01:00.2247179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.2250018Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.2251166Z ^ 2025-05-07T20:01:00.2251524Z 2025-05-07T20:01:00.2253115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.2255765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.2256915Z ^ 2025-05-07T20:01:00.2257168Z 2025-05-07T20:01:00.2257627Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:00.2258446Z 2025-05-07T20:01:00.2260337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.2263244Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.2264604Z ^ 2025-05-07T20:01:00.2265001Z 2025-05-07T20:01:00.2266636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.2269407Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.2270730Z ^ 2025-05-07T20:01:00.2271000Z 2025-05-07T20:01:00.2271489Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:00.2272167Z 2025-05-07T20:01:00.2273941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.2276984Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.2278245Z ^ 2025-05-07T20:01:00.2278634Z 2025-05-07T20:01:00.4552820Z [334/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o 2025-05-07T20:01:00.4577117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4579871Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4581113Z ^ 2025-05-07T20:01:00.4581392Z 2025-05-07T20:01:00.4581847Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:00.4582517Z 2025-05-07T20:01:00.4584334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4587040Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4588316Z ^ 2025-05-07T20:01:00.4588684Z 2025-05-07T20:01:00.4590354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4593014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4594181Z ^ 2025-05-07T20:01:00.4594432Z 2025-05-07T20:01:00.4594892Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:00.4595572Z 2025-05-07T20:01:00.4597412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4600400Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4601640Z ^ 2025-05-07T20:01:00.4602043Z 2025-05-07T20:01:00.4603781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4606813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4607994Z ^ 2025-05-07T20:01:00.4608270Z 2025-05-07T20:01:00.4608716Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:00.4609390Z 2025-05-07T20:01:00.4610853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4613518Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4614594Z ^ 2025-05-07T20:01:00.4614953Z 2025-05-07T20:01:00.4616402Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4618349Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4619299Z ^ 2025-05-07T20:01:00.4619496Z 2025-05-07T20:01:00.4619811Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:00.4620466Z 2025-05-07T20:01:00.4621970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4624026Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4625169Z ^ 2025-05-07T20:01:00.4625546Z 2025-05-07T20:01:00.4627238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4629813Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4630940Z ^ 2025-05-07T20:01:00.4631183Z 2025-05-07T20:01:00.4631631Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:00.4632288Z 2025-05-07T20:01:00.4633932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.4636465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:00.4637617Z ^ 2025-05-07T20:01:00.4637979Z 2025-05-07T20:01:08.9778554Z [335/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T20:01:08.9798709Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:09.9237462Z [336/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T20:01:09.9255637Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:11.6579179Z [337/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:11.6597506Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:14.5814467Z [338/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o 2025-05-07T20:01:14.5843133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.5846021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.5847355Z ^ 2025-05-07T20:01:14.5847646Z 2025-05-07T20:01:14.5848148Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:14.5848904Z 2025-05-07T20:01:14.5851008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.5853846Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.5855161Z ^ 2025-05-07T20:01:14.5855571Z 2025-05-07T20:01:14.5857604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.5860698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.5862079Z ^ 2025-05-07T20:01:14.5862402Z 2025-05-07T20:01:14.5862919Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:14.5863619Z 2025-05-07T20:01:14.5865415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.5868323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.5869707Z ^ 2025-05-07T20:01:14.5870133Z 2025-05-07T20:01:14.5871973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.5874935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.5876619Z ^ 2025-05-07T20:01:14.5876913Z 2025-05-07T20:01:14.5877431Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:14.5878216Z 2025-05-07T20:01:14.5879662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.5882768Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.5884078Z ^ 2025-05-07T20:01:14.5884531Z 2025-05-07T20:01:14.5886180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.5889123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.5890369Z ^ 2025-05-07T20:01:14.5890704Z 2025-05-07T20:01:14.5891135Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:14.5891841Z 2025-05-07T20:01:14.5893602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.5896413Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.5897707Z ^ 2025-05-07T20:01:14.5898137Z 2025-05-07T20:01:14.5900227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.5903093Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.5904381Z ^ 2025-05-07T20:01:14.5904643Z 2025-05-07T20:01:14.5905151Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:14.5905926Z 2025-05-07T20:01:14.5907940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.5910945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:14.5912416Z ^ 2025-05-07T20:01:14.5912871Z 2025-05-07T20:01:18.3593104Z [339/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T20:01:18.3612994Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:19.3078077Z [340/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T20:01:19.3099276Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:19.9812943Z [341/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:01:19.9837284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.9839920Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.9841121Z ^ 2025-05-07T20:01:19.9841366Z 2025-05-07T20:01:19.9841815Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:19.9842448Z 2025-05-07T20:01:19.9844088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.9846825Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.9848115Z ^ 2025-05-07T20:01:19.9848488Z 2025-05-07T20:01:19.9850238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.9852953Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.9854135Z ^ 2025-05-07T20:01:19.9854390Z 2025-05-07T20:01:19.9854863Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:19.9855529Z 2025-05-07T20:01:19.9857221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.9860332Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.9861639Z ^ 2025-05-07T20:01:19.9862039Z 2025-05-07T20:01:19.9863794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.9866524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.9867721Z ^ 2025-05-07T20:01:19.9867984Z 2025-05-07T20:01:19.9868467Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:19.9869103Z 2025-05-07T20:01:19.9870617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.9873300Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.9874451Z ^ 2025-05-07T20:01:19.9874858Z 2025-05-07T20:01:19.9876978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.9879456Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.9880646Z ^ 2025-05-07T20:01:19.9880947Z 2025-05-07T20:01:19.9881394Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:19.9882061Z 2025-05-07T20:01:19.9883930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.9886655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.9888016Z ^ 2025-05-07T20:01:19.9888398Z 2025-05-07T20:01:19.9890124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.9892608Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.9893825Z ^ 2025-05-07T20:01:19.9894068Z 2025-05-07T20:01:19.9894503Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:19.9895196Z 2025-05-07T20:01:19.9896760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.9899264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:19.9900607Z ^ 2025-05-07T20:01:19.9901001Z 2025-05-07T20:01:21.3578560Z [342/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:21.3601633Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.3604448Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.3605698Z ^ 2025-05-07T20:01:21.3605993Z 2025-05-07T20:01:21.3606447Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.3607148Z 2025-05-07T20:01:21.3608919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.3611768Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.3613025Z ^ 2025-05-07T20:01:21.3613424Z 2025-05-07T20:01:21.3615258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.3618030Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.3619430Z ^ 2025-05-07T20:01:21.3619697Z 2025-05-07T20:01:21.3620280Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.3620974Z 2025-05-07T20:01:21.3622485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.3625104Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.3626329Z ^ 2025-05-07T20:01:21.3626709Z 2025-05-07T20:01:21.3628286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.3630825Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.3632023Z ^ 2025-05-07T20:01:21.3632287Z 2025-05-07T20:01:21.3632737Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.3633422Z 2025-05-07T20:01:21.3634965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.3637750Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.3638957Z ^ 2025-05-07T20:01:21.3639313Z 2025-05-07T20:01:21.3641006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.3646693Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.3648011Z ^ 2025-05-07T20:01:21.3648247Z 2025-05-07T20:01:21.3648653Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.3649423Z 2025-05-07T20:01:21.3651101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.3653826Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.3655024Z ^ 2025-05-07T20:01:21.3655386Z 2025-05-07T20:01:21.3657007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.3659725Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.3661059Z ^ 2025-05-07T20:01:21.3661272Z 2025-05-07T20:01:21.3661718Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.3662375Z 2025-05-07T20:01:21.3663950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.3666676Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.3667831Z ^ 2025-05-07T20:01:21.3668188Z 2025-05-07T20:01:21.4766681Z [343/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o 2025-05-07T20:01:21.4789421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.4792122Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.4793272Z ^ 2025-05-07T20:01:21.4793529Z 2025-05-07T20:01:21.4793967Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.4794639Z 2025-05-07T20:01:21.4796140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.4798632Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.4799826Z ^ 2025-05-07T20:01:21.4800196Z 2025-05-07T20:01:21.4801722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.4804638Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.4805731Z ^ 2025-05-07T20:01:21.4805951Z 2025-05-07T20:01:21.4806355Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.4807013Z 2025-05-07T20:01:21.4808605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.4810859Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.4811847Z ^ 2025-05-07T20:01:21.4812171Z 2025-05-07T20:01:21.4813536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.4815914Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.4817074Z ^ 2025-05-07T20:01:21.4817330Z 2025-05-07T20:01:21.4817800Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.4818463Z 2025-05-07T20:01:21.4820749Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.4823379Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.4824456Z ^ 2025-05-07T20:01:21.4824811Z 2025-05-07T20:01:21.4826587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.4829330Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.4830473Z ^ 2025-05-07T20:01:21.4830703Z 2025-05-07T20:01:21.4831118Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.4831753Z 2025-05-07T20:01:21.4833368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.4835917Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.4837061Z ^ 2025-05-07T20:01:21.4837418Z 2025-05-07T20:01:21.4839257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.4842007Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.4843214Z ^ 2025-05-07T20:01:21.4843458Z 2025-05-07T20:01:21.4843913Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.4844705Z 2025-05-07T20:01:21.4846350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.4849029Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.4850234Z ^ 2025-05-07T20:01:21.4850599Z 2025-05-07T20:01:21.7602525Z [344/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o 2025-05-07T20:01:21.7627843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.7630439Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.7631583Z ^ 2025-05-07T20:01:21.7631824Z 2025-05-07T20:01:21.7632286Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.7632956Z 2025-05-07T20:01:21.7634672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.7637784Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.7638953Z ^ 2025-05-07T20:01:21.7639343Z 2025-05-07T20:01:21.7641012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.7643647Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.7644813Z ^ 2025-05-07T20:01:21.7645072Z 2025-05-07T20:01:21.7645514Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.7646174Z 2025-05-07T20:01:21.7647870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.7650638Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.7651861Z ^ 2025-05-07T20:01:21.7652230Z 2025-05-07T20:01:21.7653956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.7656849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.7672980Z ^ 2025-05-07T20:01:21.7673275Z 2025-05-07T20:01:21.7673719Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.7674539Z 2025-05-07T20:01:21.7676826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.7679540Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.7680813Z ^ 2025-05-07T20:01:21.7681199Z 2025-05-07T20:01:21.7682844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.7685486Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.7686620Z ^ 2025-05-07T20:01:21.7686871Z 2025-05-07T20:01:21.7687345Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.7687931Z 2025-05-07T20:01:21.7689652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.7692383Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.7693618Z ^ 2025-05-07T20:01:21.7693992Z 2025-05-07T20:01:21.7695704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.7698656Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.7699820Z ^ 2025-05-07T20:01:21.7700215Z 2025-05-07T20:01:21.7700607Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:21.7701247Z 2025-05-07T20:01:21.7702854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:21.7705443Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:21.7706658Z ^ 2025-05-07T20:01:21.7707024Z 2025-05-07T20:01:22.7961162Z [345/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:22.7982549Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:23.7050780Z [346/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T20:01:23.7071623Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:24.8351835Z [347/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T20:01:24.8375785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:24.8378997Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:24.8380306Z ^ 2025-05-07T20:01:24.8380585Z 2025-05-07T20:01:24.8381044Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:24.8381725Z 2025-05-07T20:01:24.8383422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:24.8386233Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:24.8387510Z ^ 2025-05-07T20:01:24.8387878Z 2025-05-07T20:01:24.8389717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:24.8392875Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:24.8394110Z ^ 2025-05-07T20:01:24.8394367Z 2025-05-07T20:01:24.8394820Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:24.8395512Z 2025-05-07T20:01:24.8397304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:24.8400013Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:24.8401359Z ^ 2025-05-07T20:01:24.8401723Z 2025-05-07T20:01:24.8403181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:24.8405930Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:24.8407097Z ^ 2025-05-07T20:01:24.8407359Z 2025-05-07T20:01:24.8407847Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:24.8408481Z 2025-05-07T20:01:24.8410236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:24.8412450Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:24.8413613Z ^ 2025-05-07T20:01:24.8413945Z 2025-05-07T20:01:24.8415372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:24.8418121Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:24.8419311Z ^ 2025-05-07T20:01:24.8419535Z 2025-05-07T20:01:24.8419978Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:24.8420762Z 2025-05-07T20:01:24.8422326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:24.8425045Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:24.8426257Z ^ 2025-05-07T20:01:24.8426770Z 2025-05-07T20:01:24.8428550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:24.8431278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:24.8432478Z ^ 2025-05-07T20:01:24.8432737Z 2025-05-07T20:01:24.8433196Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:24.8433872Z 2025-05-07T20:01:24.8435724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:24.8438493Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:24.8439743Z ^ 2025-05-07T20:01:24.8440129Z 2025-05-07T20:01:27.4762581Z [348/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:27.4782086Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:28.5118538Z [349/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T20:01:28.5138004Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:29.7904432Z [350/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:29.7925029Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:31.1550668Z [351/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T20:01:31.1570031Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:32.3017999Z [352/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:32.3042373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:32.3045115Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:32.3046424Z ^ 2025-05-07T20:01:32.3046708Z 2025-05-07T20:01:32.3047158Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:32.3047823Z 2025-05-07T20:01:32.3049540Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:32.3052215Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:32.3053386Z ^ 2025-05-07T20:01:32.3053744Z 2025-05-07T20:01:32.3055341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:32.3058186Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:32.3059346Z ^ 2025-05-07T20:01:32.3059599Z 2025-05-07T20:01:32.3060210Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:32.3060918Z 2025-05-07T20:01:32.3062503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:32.3065389Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:32.3066634Z ^ 2025-05-07T20:01:32.3067043Z 2025-05-07T20:01:32.3068481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:32.3071156Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:32.3072305Z ^ 2025-05-07T20:01:32.3072568Z 2025-05-07T20:01:32.3073000Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:32.3073655Z 2025-05-07T20:01:32.3075325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:32.3078324Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:32.3079504Z ^ 2025-05-07T20:01:32.3079981Z 2025-05-07T20:01:32.3081543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:32.3084254Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:32.3085384Z ^ 2025-05-07T20:01:32.3085631Z 2025-05-07T20:01:32.3086037Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:32.3086638Z 2025-05-07T20:01:32.3088359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:32.3090860Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:32.3092093Z ^ 2025-05-07T20:01:32.3092457Z 2025-05-07T20:01:32.3093926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:32.3096427Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:32.3097561Z ^ 2025-05-07T20:01:32.3097828Z 2025-05-07T20:01:32.3098258Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:32.3098916Z 2025-05-07T20:01:32.3100745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:32.3103487Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:32.3104674Z ^ 2025-05-07T20:01:32.3105011Z 2025-05-07T20:01:32.4937007Z [353/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T20:01:32.4958024Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:33.1410409Z [354/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:01:33.1434903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.1437615Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.1438733Z ^ 2025-05-07T20:01:33.1438972Z 2025-05-07T20:01:33.1439390Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:33.1440023Z 2025-05-07T20:01:33.1441698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.1444748Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.1445951Z ^ 2025-05-07T20:01:33.1446327Z 2025-05-07T20:01:33.1447875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.1450724Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.1451919Z ^ 2025-05-07T20:01:33.1452182Z 2025-05-07T20:01:33.1452653Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:33.1453351Z 2025-05-07T20:01:33.1455032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.1457778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.1458967Z ^ 2025-05-07T20:01:33.1459320Z 2025-05-07T20:01:33.1461049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.1463817Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.1465042Z ^ 2025-05-07T20:01:33.1465290Z 2025-05-07T20:01:33.1465676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:33.1466370Z 2025-05-07T20:01:33.1468064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.1470686Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.1471961Z ^ 2025-05-07T20:01:33.1472344Z 2025-05-07T20:01:33.1474037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.1477062Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.1478258Z ^ 2025-05-07T20:01:33.1478511Z 2025-05-07T20:01:33.1478986Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:33.1479663Z 2025-05-07T20:01:33.1481315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.1484045Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.1485246Z ^ 2025-05-07T20:01:33.1485620Z 2025-05-07T20:01:33.1487233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.1490036Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.1491187Z ^ 2025-05-07T20:01:33.1491457Z 2025-05-07T20:01:33.1491858Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:33.1492462Z 2025-05-07T20:01:33.1494163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:33.1496976Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:33.1498238Z ^ 2025-05-07T20:01:33.1498719Z 2025-05-07T20:01:39.9927402Z [355/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o 2025-05-07T20:01:39.9949778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.9952499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.9953686Z ^ 2025-05-07T20:01:39.9953953Z 2025-05-07T20:01:39.9954406Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:39.9955059Z 2025-05-07T20:01:39.9957041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.9959744Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.9960932Z ^ 2025-05-07T20:01:39.9961300Z 2025-05-07T20:01:39.9963074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.9965557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.9966790Z ^ 2025-05-07T20:01:39.9967050Z 2025-05-07T20:01:39.9967482Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:39.9968104Z 2025-05-07T20:01:39.9969663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.9972299Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.9973399Z ^ 2025-05-07T20:01:39.9973762Z 2025-05-07T20:01:39.9975369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.9978363Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.9979562Z ^ 2025-05-07T20:01:39.9979816Z 2025-05-07T20:01:39.9980342Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:39.9980987Z 2025-05-07T20:01:39.9982836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.9985572Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.9986623Z ^ 2025-05-07T20:01:39.9986977Z 2025-05-07T20:01:39.9988437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.9990873Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.9991887Z ^ 2025-05-07T20:01:39.9992153Z 2025-05-07T20:01:39.9992582Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:39.9993253Z 2025-05-07T20:01:39.9994873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:39.9997509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:39.9998713Z ^ 2025-05-07T20:01:39.9999074Z 2025-05-07T20:01:40.0000949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.0003563Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:40.0004718Z ^ 2025-05-07T20:01:40.0004961Z 2025-05-07T20:01:40.0005404Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:40.0006074Z 2025-05-07T20:01:40.0007835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:40.0010623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:40.0011744Z ^ 2025-05-07T20:01:40.0012129Z 2025-05-07T20:01:41.3751098Z [356/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o 2025-05-07T20:01:41.3774003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.3777444Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.3778613Z ^ 2025-05-07T20:01:41.3778852Z 2025-05-07T20:01:41.3779259Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:41.3779940Z 2025-05-07T20:01:41.3781780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.3784452Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.3785639Z ^ 2025-05-07T20:01:41.3786111Z 2025-05-07T20:01:41.3787690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.3790439Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.3791562Z ^ 2025-05-07T20:01:41.3791813Z 2025-05-07T20:01:41.3792256Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:41.3792872Z 2025-05-07T20:01:41.3794536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.3797333Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.3798511Z ^ 2025-05-07T20:01:41.3798837Z 2025-05-07T20:01:41.3800358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.3802879Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.3804127Z ^ 2025-05-07T20:01:41.3804392Z 2025-05-07T20:01:41.3804798Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:41.3805378Z 2025-05-07T20:01:41.3806988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.3809605Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.3810839Z ^ 2025-05-07T20:01:41.3811206Z 2025-05-07T20:01:41.3812901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.3815628Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.3816831Z ^ 2025-05-07T20:01:41.3817092Z 2025-05-07T20:01:41.3817547Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:41.3818236Z 2025-05-07T20:01:41.3819942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.3823178Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.3824390Z ^ 2025-05-07T20:01:41.3824783Z 2025-05-07T20:01:41.3826528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.3829334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.3830512Z ^ 2025-05-07T20:01:41.3830772Z 2025-05-07T20:01:41.3831214Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:41.3831982Z 2025-05-07T20:01:41.3833736Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.3836441Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.3837680Z ^ 2025-05-07T20:01:41.3838053Z 2025-05-07T20:01:41.4746019Z [357/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o 2025-05-07T20:01:41.4769990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.4772698Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.4773784Z ^ 2025-05-07T20:01:41.4774042Z 2025-05-07T20:01:41.4774624Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:41.4775225Z 2025-05-07T20:01:41.4777091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.4779990Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.4781090Z ^ 2025-05-07T20:01:41.4781406Z 2025-05-07T20:01:41.4783049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.4785825Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.4786967Z ^ 2025-05-07T20:01:41.4787201Z 2025-05-07T20:01:41.4787623Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:41.4788272Z 2025-05-07T20:01:41.4790079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.4792618Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.4793752Z ^ 2025-05-07T20:01:41.4794249Z 2025-05-07T20:01:41.4795936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.4798694Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.4799899Z ^ 2025-05-07T20:01:41.4800145Z 2025-05-07T20:01:41.4800599Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:41.4801235Z 2025-05-07T20:01:41.4802908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.4805532Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.4806666Z ^ 2025-05-07T20:01:41.4807027Z 2025-05-07T20:01:41.4808527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.4811070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.4812226Z ^ 2025-05-07T20:01:41.4812501Z 2025-05-07T20:01:41.4812959Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:41.4813829Z 2025-05-07T20:01:41.4815544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.4818417Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.4819662Z ^ 2025-05-07T20:01:41.4820190Z 2025-05-07T20:01:41.4822122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.4825131Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.4826356Z ^ 2025-05-07T20:01:41.4826624Z 2025-05-07T20:01:41.4827063Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:41.4827718Z 2025-05-07T20:01:41.4829430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.4832308Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.4833532Z ^ 2025-05-07T20:01:41.4833927Z 2025-05-07T20:01:41.9055258Z [358/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o 2025-05-07T20:01:41.9079656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.9082492Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.9084008Z ^ 2025-05-07T20:01:41.9084267Z 2025-05-07T20:01:41.9084725Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:41.9085424Z 2025-05-07T20:01:41.9087150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.9090029Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.9091213Z ^ 2025-05-07T20:01:41.9091584Z 2025-05-07T20:01:41.9093271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.9095949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.9097073Z ^ 2025-05-07T20:01:41.9097335Z 2025-05-07T20:01:41.9097778Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:41.9098441Z 2025-05-07T20:01:41.9100176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.9102959Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.9104139Z ^ 2025-05-07T20:01:41.9104495Z 2025-05-07T20:01:41.9106039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.9108670Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.9109875Z ^ 2025-05-07T20:01:41.9110135Z 2025-05-07T20:01:41.9110585Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:41.9111294Z 2025-05-07T20:01:41.9112998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.9115983Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.9117111Z ^ 2025-05-07T20:01:41.9117467Z 2025-05-07T20:01:41.9119145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.9122019Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.9123229Z ^ 2025-05-07T20:01:41.9123474Z 2025-05-07T20:01:41.9123926Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:41.9124605Z 2025-05-07T20:01:41.9126362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.9129030Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.9130327Z ^ 2025-05-07T20:01:41.9130701Z 2025-05-07T20:01:41.9132320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.9135043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.9136182Z ^ 2025-05-07T20:01:41.9136428Z 2025-05-07T20:01:41.9136863Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:41.9137486Z 2025-05-07T20:01:41.9139177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:41.9142080Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:41.9143292Z ^ 2025-05-07T20:01:41.9143683Z 2025-05-07T20:01:43.7631199Z [359/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o 2025-05-07T20:01:43.7654438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:43.7658751Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:43.7659841Z ^ 2025-05-07T20:01:43.7660222Z 2025-05-07T20:01:43.7660635Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:43.7661186Z 2025-05-07T20:01:43.7662719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:43.7665143Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:43.7666275Z ^ 2025-05-07T20:01:43.7666641Z 2025-05-07T20:01:43.7668175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:43.7670539Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:43.7671838Z ^ 2025-05-07T20:01:43.7672108Z 2025-05-07T20:01:43.7672547Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:43.7673290Z 2025-05-07T20:01:43.7675043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:43.7677904Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:43.7679099Z ^ 2025-05-07T20:01:43.7679459Z 2025-05-07T20:01:43.7681118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:43.7683655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:43.7684809Z ^ 2025-05-07T20:01:43.7685059Z 2025-05-07T20:01:43.7685506Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:43.7686192Z 2025-05-07T20:01:43.7687848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:43.7690472Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:43.7691799Z ^ 2025-05-07T20:01:43.7692186Z 2025-05-07T20:01:43.7693794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:43.7696384Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:43.7697665Z ^ 2025-05-07T20:01:43.7697944Z 2025-05-07T20:01:43.7698381Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:43.7699042Z 2025-05-07T20:01:43.7700957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:43.7703429Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:43.7704648Z ^ 2025-05-07T20:01:43.7704973Z 2025-05-07T20:01:43.7706361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:43.7708854Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:43.7709995Z ^ 2025-05-07T20:01:43.7710413Z 2025-05-07T20:01:43.7710858Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:43.7711535Z 2025-05-07T20:01:43.7713179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:43.7715845Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:43.7717216Z ^ 2025-05-07T20:01:43.7717590Z 2025-05-07T20:01:45.5897222Z [360/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T20:01:45.5921963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.5924959Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.5926275Z ^ 2025-05-07T20:01:45.5926483Z 2025-05-07T20:01:45.5926938Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:45.5927667Z 2025-05-07T20:01:45.5929455Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.5932356Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.5933658Z ^ 2025-05-07T20:01:45.5934063Z 2025-05-07T20:01:45.5935907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.5939058Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.5940534Z ^ 2025-05-07T20:01:45.5940821Z 2025-05-07T20:01:45.5941251Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:45.5941961Z 2025-05-07T20:01:45.5943762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.5946659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.5947972Z ^ 2025-05-07T20:01:45.5948371Z 2025-05-07T20:01:45.5950109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.5952949Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.5954394Z ^ 2025-05-07T20:01:45.5954699Z 2025-05-07T20:01:45.5955106Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:45.5955792Z 2025-05-07T20:01:45.5957812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.5960692Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.5962018Z ^ 2025-05-07T20:01:45.5962417Z 2025-05-07T20:01:45.5964270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.5967207Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.5968492Z ^ 2025-05-07T20:01:45.5968778Z 2025-05-07T20:01:45.5969284Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:45.5969810Z 2025-05-07T20:01:45.5971367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.5974066Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.5975323Z ^ 2025-05-07T20:01:45.5975744Z 2025-05-07T20:01:45.5977627Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.5980578Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.5981758Z ^ 2025-05-07T20:01:45.5982061Z 2025-05-07T20:01:45.5982544Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:45.5983187Z 2025-05-07T20:01:45.5985190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:45.5987941Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:45.5989151Z ^ 2025-05-07T20:01:45.5989563Z 2025-05-07T20:01:54.8051934Z [361/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o 2025-05-07T20:01:54.8075296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.8077926Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.8078858Z ^ 2025-05-07T20:01:54.8079100Z 2025-05-07T20:01:54.8079430Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:54.8080045Z 2025-05-07T20:01:54.8081473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.8083856Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.8085244Z ^ 2025-05-07T20:01:54.8085580Z 2025-05-07T20:01:54.8087104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.8089497Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.8090574Z ^ 2025-05-07T20:01:54.8090814Z 2025-05-07T20:01:54.8091200Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:54.8091779Z 2025-05-07T20:01:54.8093354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.8095893Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.8096955Z ^ 2025-05-07T20:01:54.8097339Z 2025-05-07T20:01:54.8098980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.8101820Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.8103188Z ^ 2025-05-07T20:01:54.8103463Z 2025-05-07T20:01:54.8103911Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:54.8104577Z 2025-05-07T20:01:54.8106270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.8109087Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.8110298Z ^ 2025-05-07T20:01:54.8110664Z 2025-05-07T20:01:54.8112396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.8115081Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.8116245Z ^ 2025-05-07T20:01:54.8116491Z 2025-05-07T20:01:54.8116939Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:54.8117613Z 2025-05-07T20:01:54.8119282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.8121869Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.8122865Z ^ 2025-05-07T20:01:54.8123205Z 2025-05-07T20:01:54.8124645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.8126868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.8128031Z ^ 2025-05-07T20:01:54.8128263Z 2025-05-07T20:01:54.8128676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:54.8129255Z 2025-05-07T20:01:54.8130764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:54.8133246Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:54.8134346Z ^ 2025-05-07T20:01:54.8134663Z 2025-05-07T20:01:58.3354189Z [362/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o 2025-05-07T20:01:58.3375718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:58.3379751Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:58.3380971Z ^ 2025-05-07T20:01:58.3381248Z 2025-05-07T20:01:58.3381658Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:58.3382290Z 2025-05-07T20:01:58.3383828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:58.3386779Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:58.3387920Z ^ 2025-05-07T20:01:58.3388292Z 2025-05-07T20:01:58.3389864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:58.3392335Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:58.3393408Z ^ 2025-05-07T20:01:58.3393654Z 2025-05-07T20:01:58.3394068Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:58.3394689Z 2025-05-07T20:01:58.3396258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:58.3398744Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:58.3399886Z ^ 2025-05-07T20:01:58.3400249Z 2025-05-07T20:01:58.3402034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:58.3404573Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:58.3405724Z ^ 2025-05-07T20:01:58.3405968Z 2025-05-07T20:01:58.3406388Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:58.3407065Z 2025-05-07T20:01:58.3408906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:58.3411536Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:58.3412657Z ^ 2025-05-07T20:01:58.3413017Z 2025-05-07T20:01:58.3414586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:58.3417102Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:58.3418185Z ^ 2025-05-07T20:01:58.3418408Z 2025-05-07T20:01:58.3418831Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:58.3419425Z 2025-05-07T20:01:58.3421108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:58.3423605Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:58.3424740Z ^ 2025-05-07T20:01:58.3425085Z 2025-05-07T20:01:58.3426761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:58.3429391Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:58.3430530Z ^ 2025-05-07T20:01:58.3430768Z 2025-05-07T20:01:58.3431160Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:58.3431781Z 2025-05-07T20:01:58.3433273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:58.3435689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:58.3436768Z ^ 2025-05-07T20:01:58.3437124Z 2025-05-07T20:01:59.9206558Z [363/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T20:01:59.9229137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9231645Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9233024Z ^ 2025-05-07T20:01:59.9233288Z 2025-05-07T20:01:59.9233707Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:59.9234307Z 2025-05-07T20:01:59.9235858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9238179Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9239068Z ^ 2025-05-07T20:01:59.9239357Z 2025-05-07T20:01:59.9240792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9242947Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9243832Z ^ 2025-05-07T20:01:59.9244086Z 2025-05-07T20:01:59.9244512Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:59.9245132Z 2025-05-07T20:01:59.9246640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9249140Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9250107Z ^ 2025-05-07T20:01:59.9250427Z 2025-05-07T20:01:59.9251836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9254414Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9255687Z ^ 2025-05-07T20:01:59.9255944Z 2025-05-07T20:01:59.9256361Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:59.9257115Z 2025-05-07T20:01:59.9258696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9261506Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9262683Z ^ 2025-05-07T20:01:59.9263052Z 2025-05-07T20:01:59.9264631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9267242Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9268337Z ^ 2025-05-07T20:01:59.9268597Z 2025-05-07T20:01:59.9269004Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:59.9269753Z 2025-05-07T20:01:59.9271253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9273765Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9274847Z ^ 2025-05-07T20:01:59.9275165Z 2025-05-07T20:01:59.9276918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9279106Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9280068Z ^ 2025-05-07T20:01:59.9280276Z 2025-05-07T20:01:59.9280655Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:01:59.9281221Z 2025-05-07T20:01:59.9282522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:59.9284772Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:01:59.9285909Z ^ 2025-05-07T20:01:59.9286256Z 2025-05-07T20:02:07.3725459Z [364/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o 2025-05-07T20:02:07.3749409Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:07.3752070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:07.3753202Z ^ 2025-05-07T20:02:07.3753435Z 2025-05-07T20:02:07.3753868Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:07.3754521Z 2025-05-07T20:02:07.3756083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:07.3758642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:07.3759775Z ^ 2025-05-07T20:02:07.3760138Z 2025-05-07T20:02:07.3761739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:07.3764287Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:07.3765335Z ^ 2025-05-07T20:02:07.3765592Z 2025-05-07T20:02:07.3766032Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:07.3766672Z 2025-05-07T20:02:07.3768387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:07.3770400Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:07.3771313Z ^ 2025-05-07T20:02:07.3771636Z 2025-05-07T20:02:07.3773057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:07.3775483Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:07.3776988Z ^ 2025-05-07T20:02:07.3777259Z 2025-05-07T20:02:07.3777652Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:07.3778276Z 2025-05-07T20:02:07.3779850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:07.3782386Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:07.3783506Z ^ 2025-05-07T20:02:07.3783834Z 2025-05-07T20:02:07.3785326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:07.3788058Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:07.3789279Z ^ 2025-05-07T20:02:07.3789539Z 2025-05-07T20:02:07.3789994Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:07.3790918Z 2025-05-07T20:02:07.3792604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:07.3795320Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:07.3796527Z ^ 2025-05-07T20:02:07.3796933Z 2025-05-07T20:02:07.3798588Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:07.3801310Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:07.3802498Z ^ 2025-05-07T20:02:07.3802771Z 2025-05-07T20:02:07.3803254Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:07.3804041Z 2025-05-07T20:02:07.3805771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:07.3808607Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:07.3809816Z ^ 2025-05-07T20:02:07.3810190Z 2025-05-07T20:02:11.0119579Z [365/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:02:11.0144319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.0146994Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.0148188Z ^ 2025-05-07T20:02:11.0148489Z 2025-05-07T20:02:11.0148948Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:11.0149634Z 2025-05-07T20:02:11.0151290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.0153824Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.0155074Z ^ 2025-05-07T20:02:11.0155465Z 2025-05-07T20:02:11.0157341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.0159875Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.0161021Z ^ 2025-05-07T20:02:11.0161288Z 2025-05-07T20:02:11.0161725Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:11.0162395Z 2025-05-07T20:02:11.0164219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.0166803Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.0168110Z ^ 2025-05-07T20:02:11.0168466Z 2025-05-07T20:02:11.0170278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.0172454Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.0173379Z ^ 2025-05-07T20:02:11.0173587Z 2025-05-07T20:02:11.0173941Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:11.0174425Z 2025-05-07T20:02:11.0175645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.0178008Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.0178947Z ^ 2025-05-07T20:02:11.0179221Z 2025-05-07T20:02:11.0180547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.0182905Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.0183797Z ^ 2025-05-07T20:02:11.0184002Z 2025-05-07T20:02:11.0184349Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:11.0184901Z 2025-05-07T20:02:11.0186367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.0188770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.0189825Z ^ 2025-05-07T20:02:11.0190150Z 2025-05-07T20:02:11.0191574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.0193970Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.0195049Z ^ 2025-05-07T20:02:11.0195279Z 2025-05-07T20:02:11.0195693Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:11.0196279Z 2025-05-07T20:02:11.0197871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.0200199Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.0201350Z ^ 2025-05-07T20:02:11.0201675Z 2025-05-07T20:02:11.9761347Z [366/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T20:02:11.9784550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.9787359Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.9788574Z ^ 2025-05-07T20:02:11.9788840Z 2025-05-07T20:02:11.9789309Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:11.9790337Z 2025-05-07T20:02:11.9792006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.9795054Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.9796195Z ^ 2025-05-07T20:02:11.9796544Z 2025-05-07T20:02:11.9798080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.9800623Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.9801873Z ^ 2025-05-07T20:02:11.9802139Z 2025-05-07T20:02:11.9802574Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:11.9803304Z 2025-05-07T20:02:11.9804941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.9807599Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.9808779Z ^ 2025-05-07T20:02:11.9809143Z 2025-05-07T20:02:11.9810357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.9812599Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.9813653Z ^ 2025-05-07T20:02:11.9813895Z 2025-05-07T20:02:11.9814312Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:11.9814941Z 2025-05-07T20:02:11.9816475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.9819105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.9820597Z ^ 2025-05-07T20:02:11.9820963Z 2025-05-07T20:02:11.9822648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.9825410Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.9826644Z ^ 2025-05-07T20:02:11.9826913Z 2025-05-07T20:02:11.9827570Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:11.9828194Z 2025-05-07T20:02:11.9829661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.9832165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.9833299Z ^ 2025-05-07T20:02:11.9833666Z 2025-05-07T20:02:11.9835337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.9838107Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.9839490Z ^ 2025-05-07T20:02:11.9839763Z 2025-05-07T20:02:11.9840222Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:11.9840915Z 2025-05-07T20:02:11.9842659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:11.9845815Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:11.9847011Z ^ 2025-05-07T20:02:11.9847389Z 2025-05-07T20:02:19.0864921Z [367/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o 2025-05-07T20:02:19.0887982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.0890627Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.0891789Z ^ 2025-05-07T20:02:19.0892033Z 2025-05-07T20:02:19.0892460Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.0893086Z 2025-05-07T20:02:19.0894897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.0897155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.0898282Z ^ 2025-05-07T20:02:19.0898633Z 2025-05-07T20:02:19.0900518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.0903098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.0904429Z ^ 2025-05-07T20:02:19.0904694Z 2025-05-07T20:02:19.0905102Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.0905700Z 2025-05-07T20:02:19.0907254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.0909805Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.0911056Z ^ 2025-05-07T20:02:19.0911411Z 2025-05-07T20:02:19.0912943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.0915278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.0916299Z ^ 2025-05-07T20:02:19.0916563Z 2025-05-07T20:02:19.0916993Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.0917633Z 2025-05-07T20:02:19.0919257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.0922028Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.0923164Z ^ 2025-05-07T20:02:19.0923534Z 2025-05-07T20:02:19.0924929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.0927401Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.0928574Z ^ 2025-05-07T20:02:19.0928863Z 2025-05-07T20:02:19.0929280Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.0929901Z 2025-05-07T20:02:19.0931487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.0934031Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.0935250Z ^ 2025-05-07T20:02:19.0935618Z 2025-05-07T20:02:19.0937346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.0940245Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.0941418Z ^ 2025-05-07T20:02:19.0941670Z 2025-05-07T20:02:19.0942087Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.0942774Z 2025-05-07T20:02:19.0944461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.0946908Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.0948134Z ^ 2025-05-07T20:02:19.0948520Z 2025-05-07T20:02:19.5853007Z [368/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o 2025-05-07T20:02:19.5876713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5879690Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5880854Z ^ 2025-05-07T20:02:19.5881121Z 2025-05-07T20:02:19.5881557Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.5882215Z 2025-05-07T20:02:19.5883883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5888885Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5890254Z ^ 2025-05-07T20:02:19.5890803Z 2025-05-07T20:02:19.5892378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5894894Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5896074Z ^ 2025-05-07T20:02:19.5896350Z 2025-05-07T20:02:19.5896826Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.5897520Z 2025-05-07T20:02:19.5899044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5901811Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5903027Z ^ 2025-05-07T20:02:19.5903391Z 2025-05-07T20:02:19.5905016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5907565Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5908891Z ^ 2025-05-07T20:02:19.5909157Z 2025-05-07T20:02:19.5909571Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.5910230Z 2025-05-07T20:02:19.5911855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5914459Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5915782Z ^ 2025-05-07T20:02:19.5916114Z 2025-05-07T20:02:19.5917645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5920081Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5921237Z ^ 2025-05-07T20:02:19.5921504Z 2025-05-07T20:02:19.5921938Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.5922603Z 2025-05-07T20:02:19.5924215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5927060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5928351Z ^ 2025-05-07T20:02:19.5928717Z 2025-05-07T20:02:19.5930348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5933165Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5934253Z ^ 2025-05-07T20:02:19.5934532Z 2025-05-07T20:02:19.5934980Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:19.5935764Z 2025-05-07T20:02:19.5937317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:19.5939919Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:19.5941299Z ^ 2025-05-07T20:02:19.5941643Z 2025-05-07T20:02:30.2466793Z [369/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:30.2488861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.2491410Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.2492509Z ^ 2025-05-07T20:02:30.2492765Z 2025-05-07T20:02:30.2493350Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:30.2494002Z 2025-05-07T20:02:30.2495493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.2498220Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.2499378Z ^ 2025-05-07T20:02:30.2499737Z 2025-05-07T20:02:30.2501607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.2504047Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.2505086Z ^ 2025-05-07T20:02:30.2505299Z 2025-05-07T20:02:30.2505727Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:30.2506320Z 2025-05-07T20:02:30.2507573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.2509792Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.2510778Z ^ 2025-05-07T20:02:30.2511344Z 2025-05-07T20:02:30.2512869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.2515363Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.2516602Z ^ 2025-05-07T20:02:30.2516878Z 2025-05-07T20:02:30.2517270Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:30.2517873Z 2025-05-07T20:02:30.2519400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.2521979Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.2523090Z ^ 2025-05-07T20:02:30.2523430Z 2025-05-07T20:02:30.2524998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.2527414Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.2528700Z ^ 2025-05-07T20:02:30.2528946Z 2025-05-07T20:02:30.2529524Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:30.2530187Z 2025-05-07T20:02:30.2531701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.2534288Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.2535548Z ^ 2025-05-07T20:02:30.2535929Z 2025-05-07T20:02:30.2537472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.2539839Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.2540992Z ^ 2025-05-07T20:02:30.2541228Z 2025-05-07T20:02:30.2541624Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:30.2542221Z 2025-05-07T20:02:30.2543511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.2545884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:30.2546982Z ^ 2025-05-07T20:02:30.2547286Z 2025-05-07T20:02:32.2311089Z [370/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o 2025-05-07T20:02:32.2332091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:32.2334627Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:32.2335870Z ^ 2025-05-07T20:02:32.2336090Z 2025-05-07T20:02:32.2336502Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:32.2337117Z 2025-05-07T20:02:32.2338751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:32.2341264Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:32.2342306Z ^ 2025-05-07T20:02:32.2342613Z 2025-05-07T20:02:32.2344092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:32.2346520Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:32.2347578Z ^ 2025-05-07T20:02:32.2347806Z 2025-05-07T20:02:32.2348219Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:32.2348835Z 2025-05-07T20:02:32.2350297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:32.2353021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:32.2354138Z ^ 2025-05-07T20:02:32.2354490Z 2025-05-07T20:02:32.2355975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:32.2358258Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:32.2359322Z ^ 2025-05-07T20:02:32.2359569Z 2025-05-07T20:02:32.2359998Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:32.2360567Z 2025-05-07T20:02:32.2361962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:32.2364211Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:32.2365285Z ^ 2025-05-07T20:02:32.2365626Z 2025-05-07T20:02:32.2367213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:32.2369343Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:32.2370361Z ^ 2025-05-07T20:02:32.2370609Z 2025-05-07T20:02:32.2371008Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:32.2371612Z 2025-05-07T20:02:32.2373182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:32.2375611Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:32.2377060Z ^ 2025-05-07T20:02:32.2377357Z 2025-05-07T20:02:32.2378840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:32.2381378Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:32.2382219Z ^ 2025-05-07T20:02:32.2382400Z 2025-05-07T20:02:32.2382726Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:32.2383304Z 2025-05-07T20:02:32.2384779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:32.2387095Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:32.2388160Z ^ 2025-05-07T20:02:32.2388504Z 2025-05-07T20:02:40.0171573Z [371/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:02:40.0196298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.0198957Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.0199971Z ^ 2025-05-07T20:02:40.0200243Z 2025-05-07T20:02:40.0200626Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.0201245Z 2025-05-07T20:02:40.0202771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.0205208Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.0206384Z ^ 2025-05-07T20:02:40.0206768Z 2025-05-07T20:02:40.0219350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.0222194Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.0223582Z ^ 2025-05-07T20:02:40.0223834Z 2025-05-07T20:02:40.0224268Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.0224843Z 2025-05-07T20:02:40.0226472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.0229187Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.0230380Z ^ 2025-05-07T20:02:40.0230741Z 2025-05-07T20:02:40.0232335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.0234889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.0236043Z ^ 2025-05-07T20:02:40.0236294Z 2025-05-07T20:02:40.0236752Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.0237444Z 2025-05-07T20:02:40.0239137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.0242192Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.0243449Z ^ 2025-05-07T20:02:40.0243834Z 2025-05-07T20:02:40.0245535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.0248341Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.0249547Z ^ 2025-05-07T20:02:40.0249805Z 2025-05-07T20:02:40.0250265Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.0251052Z 2025-05-07T20:02:40.0252693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.0255342Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.0256454Z ^ 2025-05-07T20:02:40.0256828Z 2025-05-07T20:02:40.0258410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.0261186Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.0262401Z ^ 2025-05-07T20:02:40.0262678Z 2025-05-07T20:02:40.0263143Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:40.0263840Z 2025-05-07T20:02:40.0265516Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.0268466Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:40.0269652Z ^ 2025-05-07T20:02:40.0270004Z 2025-05-07T20:02:41.8563701Z [372/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o 2025-05-07T20:02:41.8587772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:41.8590518Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:41.8591649Z ^ 2025-05-07T20:02:41.8591897Z 2025-05-07T20:02:41.8592316Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:41.8592955Z 2025-05-07T20:02:41.8594601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:41.8597322Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:41.8598506Z ^ 2025-05-07T20:02:41.8599142Z 2025-05-07T20:02:41.8600795Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:41.8603499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:41.8604681Z ^ 2025-05-07T20:02:41.8604939Z 2025-05-07T20:02:41.8605386Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:41.8606080Z 2025-05-07T20:02:41.8607677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:41.8610254Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:41.8611390Z ^ 2025-05-07T20:02:41.8611779Z 2025-05-07T20:02:41.8613411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:41.8616044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:41.8617234Z ^ 2025-05-07T20:02:41.8617501Z 2025-05-07T20:02:41.8618179Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:41.8618838Z 2025-05-07T20:02:41.8620718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:41.8623778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:41.8625149Z ^ 2025-05-07T20:02:41.8625536Z 2025-05-07T20:02:41.8627271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:41.8630060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:41.8631305Z ^ 2025-05-07T20:02:41.8631520Z 2025-05-07T20:02:41.8631948Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:41.8632589Z 2025-05-07T20:02:41.8634179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:41.8636742Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:41.8637702Z ^ 2025-05-07T20:02:41.8638051Z 2025-05-07T20:02:41.8639646Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:41.8642343Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:41.8643491Z ^ 2025-05-07T20:02:41.8643894Z 2025-05-07T20:02:41.8644354Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:41.8645033Z 2025-05-07T20:02:41.8646615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:41.8649275Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:41.8650462Z ^ 2025-05-07T20:02:41.8650842Z 2025-05-07T20:02:42.6258721Z [373/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o 2025-05-07T20:02:42.6281684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6284457Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6285795Z ^ 2025-05-07T20:02:42.6286063Z 2025-05-07T20:02:42.6286540Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:42.6287203Z 2025-05-07T20:02:42.6288866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6291995Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6293123Z ^ 2025-05-07T20:02:42.6293477Z 2025-05-07T20:02:42.6295078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6297624Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6298766Z ^ 2025-05-07T20:02:42.6299013Z 2025-05-07T20:02:42.6299449Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:42.6300251Z 2025-05-07T20:02:42.6302161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6304817Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6305972Z ^ 2025-05-07T20:02:42.6306293Z 2025-05-07T20:02:42.6307987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6310637Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6311800Z ^ 2025-05-07T20:02:42.6312058Z 2025-05-07T20:02:42.6312525Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:42.6313206Z 2025-05-07T20:02:42.6315040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6317800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6319106Z ^ 2025-05-07T20:02:42.6319490Z 2025-05-07T20:02:42.6321056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6323797Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6324954Z ^ 2025-05-07T20:02:42.6325208Z 2025-05-07T20:02:42.6325632Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:42.6326298Z 2025-05-07T20:02:42.6327830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6330568Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6331780Z ^ 2025-05-07T20:02:42.6332152Z 2025-05-07T20:02:42.6333864Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6336708Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6337887Z ^ 2025-05-07T20:02:42.6338127Z 2025-05-07T20:02:42.6338537Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:42.6339210Z 2025-05-07T20:02:42.6341020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:42.6343783Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:42.6344926Z ^ 2025-05-07T20:02:42.6345297Z 2025-05-07T20:02:52.4022467Z [374/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o 2025-05-07T20:02:52.4043857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4046306Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4047796Z ^ 2025-05-07T20:02:52.4048027Z 2025-05-07T20:02:52.4048414Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.4048928Z 2025-05-07T20:02:52.4050213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4052635Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4053694Z ^ 2025-05-07T20:02:52.4054081Z 2025-05-07T20:02:52.4055625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4058116Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4059178Z ^ 2025-05-07T20:02:52.4059422Z 2025-05-07T20:02:52.4059865Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.4060644Z 2025-05-07T20:02:52.4062171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4064801Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4065936Z ^ 2025-05-07T20:02:52.4066251Z 2025-05-07T20:02:52.4067778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4070431Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4071495Z ^ 2025-05-07T20:02:52.4071725Z 2025-05-07T20:02:52.4072136Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.4072895Z 2025-05-07T20:02:52.4074441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4077276Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4078424Z ^ 2025-05-07T20:02:52.4078786Z 2025-05-07T20:02:52.4080342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4082876Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4084003Z ^ 2025-05-07T20:02:52.4084257Z 2025-05-07T20:02:52.4084695Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.4085332Z 2025-05-07T20:02:52.4086783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4089530Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4090558Z ^ 2025-05-07T20:02:52.4090907Z 2025-05-07T20:02:52.4092441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4094909Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4096008Z ^ 2025-05-07T20:02:52.4096271Z 2025-05-07T20:02:52.4096708Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:52.4097291Z 2025-05-07T20:02:52.4098866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:52.4101560Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:52.4102895Z ^ 2025-05-07T20:02:52.4103259Z 2025-05-07T20:02:59.5607003Z [375/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o 2025-05-07T20:02:59.5630128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:59.5632732Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:59.5633884Z ^ 2025-05-07T20:02:59.5634140Z 2025-05-07T20:02:59.5634553Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:59.5635143Z 2025-05-07T20:02:59.5636630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:59.5639199Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:59.5640329Z ^ 2025-05-07T20:02:59.5640726Z 2025-05-07T20:02:59.5642445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:59.5645214Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:59.5646375Z ^ 2025-05-07T20:02:59.5646613Z 2025-05-07T20:02:59.5646974Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:59.5647795Z 2025-05-07T20:02:59.5649285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:59.5651829Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:59.5652934Z ^ 2025-05-07T20:02:59.5653422Z 2025-05-07T20:02:59.5655012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:59.5657591Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:59.5658724Z ^ 2025-05-07T20:02:59.5658959Z 2025-05-07T20:02:59.5659378Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:59.5660051Z 2025-05-07T20:02:59.5661876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:59.5664536Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:59.5665680Z ^ 2025-05-07T20:02:59.5666046Z 2025-05-07T20:02:59.5667586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:59.5670205Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:59.5671354Z ^ 2025-05-07T20:02:59.5671591Z 2025-05-07T20:02:59.5672023Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:59.5672828Z 2025-05-07T20:02:59.5674378Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:59.5677202Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:59.5678300Z ^ 2025-05-07T20:02:59.5678632Z 2025-05-07T20:02:59.5680225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:59.5682837Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:59.5684079Z ^ 2025-05-07T20:02:59.5684336Z 2025-05-07T20:02:59.5684791Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:02:59.5685495Z 2025-05-07T20:02:59.5687194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:59.5689604Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:02:59.5690632Z ^ 2025-05-07T20:02:59.5690978Z 2025-05-07T20:03:03.4925277Z [376/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o 2025-05-07T20:03:03.4948379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.4950830Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.4951940Z ^ 2025-05-07T20:03:03.4952200Z 2025-05-07T20:03:03.4952669Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:03.4953250Z 2025-05-07T20:03:03.4954857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.4957280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.4958328Z ^ 2025-05-07T20:03:03.4958644Z 2025-05-07T20:03:03.4960125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.4962255Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.4963211Z ^ 2025-05-07T20:03:03.4963430Z 2025-05-07T20:03:03.4963778Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:03.4964299Z 2025-05-07T20:03:03.4965858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.4968098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.4969296Z ^ 2025-05-07T20:03:03.4969632Z 2025-05-07T20:03:03.4971170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.4973636Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.4974685Z ^ 2025-05-07T20:03:03.4974911Z 2025-05-07T20:03:03.4975342Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:03.4976265Z 2025-05-07T20:03:03.4977777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.4980424Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.4981574Z ^ 2025-05-07T20:03:03.4981910Z 2025-05-07T20:03:03.4983459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.4986147Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.4987236Z ^ 2025-05-07T20:03:03.4987491Z 2025-05-07T20:03:03.4987914Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:03.4988534Z 2025-05-07T20:03:03.4990104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.4992636Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.4993790Z ^ 2025-05-07T20:03:03.4994145Z 2025-05-07T20:03:03.4995720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.4998215Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.4999349Z ^ 2025-05-07T20:03:03.4999579Z 2025-05-07T20:03:03.4999980Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:03.5000603Z 2025-05-07T20:03:03.5002315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:03.5004823Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:03.5005898Z ^ 2025-05-07T20:03:03.5006248Z 2025-05-07T20:03:10.0596713Z [377/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:03:10.0618674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.0621170Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.0622201Z ^ 2025-05-07T20:03:10.0622439Z 2025-05-07T20:03:10.0622826Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:10.0623372Z 2025-05-07T20:03:10.0624747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.0627463Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.0628427Z ^ 2025-05-07T20:03:10.0628729Z 2025-05-07T20:03:10.0630131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.0632715Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.0633732Z ^ 2025-05-07T20:03:10.0633991Z 2025-05-07T20:03:10.0634409Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:10.0635112Z 2025-05-07T20:03:10.0636590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.0638967Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.0640064Z ^ 2025-05-07T20:03:10.0640405Z 2025-05-07T20:03:10.0641899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.0644161Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.0645354Z ^ 2025-05-07T20:03:10.0645592Z 2025-05-07T20:03:10.0646183Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:10.0646827Z 2025-05-07T20:03:10.0648399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.0651069Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.0652193Z ^ 2025-05-07T20:03:10.0652556Z 2025-05-07T20:03:10.0654049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.0656535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.0657573Z ^ 2025-05-07T20:03:10.0657822Z 2025-05-07T20:03:10.0658234Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:10.0658854Z 2025-05-07T20:03:10.0660459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.0662901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.0664001Z ^ 2025-05-07T20:03:10.0664310Z 2025-05-07T20:03:10.0665735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.0668098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.0669077Z ^ 2025-05-07T20:03:10.0669287Z 2025-05-07T20:03:10.0669657Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:10.0670260Z 2025-05-07T20:03:10.0671675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:10.0674032Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:10.0674991Z ^ 2025-05-07T20:03:10.0675402Z 2025-05-07T20:03:13.7349979Z [378/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:13.7362700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.7364116Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.7364758Z ^ 2025-05-07T20:03:13.7364904Z 2025-05-07T20:03:13.7365150Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:13.7365648Z 2025-05-07T20:03:13.7366535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.7367960Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.7368598Z ^ 2025-05-07T20:03:13.7368816Z 2025-05-07T20:03:13.7369753Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.7371237Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.7371863Z ^ 2025-05-07T20:03:13.7372025Z 2025-05-07T20:03:13.7372272Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:13.7372627Z 2025-05-07T20:03:13.7373510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.7374942Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.7375587Z ^ 2025-05-07T20:03:13.7375790Z 2025-05-07T20:03:13.7376905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.7378324Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.7378958Z ^ 2025-05-07T20:03:13.7379099Z 2025-05-07T20:03:13.7379340Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:13.7379867Z 2025-05-07T20:03:13.7380897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.7382328Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.7382972Z ^ 2025-05-07T20:03:13.7383176Z 2025-05-07T20:03:13.7384062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.7385465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.7386106Z ^ 2025-05-07T20:03:13.7386248Z 2025-05-07T20:03:13.7386500Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:13.7386856Z 2025-05-07T20:03:13.7387730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.7389146Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.7389791Z ^ 2025-05-07T20:03:13.7389991Z 2025-05-07T20:03:13.7390925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.7392341Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.7392962Z ^ 2025-05-07T20:03:13.7393119Z 2025-05-07T20:03:13.7393419Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:13.7393774Z 2025-05-07T20:03:13.7394660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:13.7396116Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:13.7396767Z ^ 2025-05-07T20:03:13.7396971Z 2025-05-07T20:03:14.4355206Z [379/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:03:14.4368053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.4369478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.4370109Z ^ 2025-05-07T20:03:14.4370295Z 2025-05-07T20:03:14.4370548Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:14.4370906Z 2025-05-07T20:03:14.4371869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.4373269Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.4374007Z ^ 2025-05-07T20:03:14.4374217Z 2025-05-07T20:03:14.4375099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.4376782Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.4377446Z ^ 2025-05-07T20:03:14.4377596Z 2025-05-07T20:03:14.4377846Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:14.4378236Z 2025-05-07T20:03:14.4379102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.4380689Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.4381332Z ^ 2025-05-07T20:03:14.4381562Z 2025-05-07T20:03:14.4382414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.4383918Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.4384544Z ^ 2025-05-07T20:03:14.4384725Z 2025-05-07T20:03:14.4384972Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:14.4385330Z 2025-05-07T20:03:14.4386194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.4387626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.4388289Z ^ 2025-05-07T20:03:14.4388497Z 2025-05-07T20:03:14.4389348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.4390752Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.4391407Z ^ 2025-05-07T20:03:14.4391553Z 2025-05-07T20:03:14.4391797Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:14.4392186Z 2025-05-07T20:03:14.4393109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.4394530Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.4395176Z ^ 2025-05-07T20:03:14.4395383Z 2025-05-07T20:03:14.4396314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.4397696Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.4398416Z ^ 2025-05-07T20:03:14.4398568Z 2025-05-07T20:03:14.4398843Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:14.4399198Z 2025-05-07T20:03:14.4400066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.4401479Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.4402142Z ^ 2025-05-07T20:03:14.4402350Z 2025-05-07T20:03:14.7693144Z [380/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:03:14.7716689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.7719758Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.7720955Z ^ 2025-05-07T20:03:14.7721233Z 2025-05-07T20:03:14.7721689Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:14.7722472Z 2025-05-07T20:03:14.7724226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.7727164Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.7728418Z ^ 2025-05-07T20:03:14.7728803Z 2025-05-07T20:03:14.7730584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.7733180Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.7734319Z ^ 2025-05-07T20:03:14.7734575Z 2025-05-07T20:03:14.7735010Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:14.7735687Z 2025-05-07T20:03:14.7737336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.7739935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.7741292Z ^ 2025-05-07T20:03:14.7741617Z 2025-05-07T20:03:14.7743016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.7745697Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.7746793Z ^ 2025-05-07T20:03:14.7747040Z 2025-05-07T20:03:14.7747414Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:14.7748013Z 2025-05-07T20:03:14.7749606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.7752150Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.7753345Z ^ 2025-05-07T20:03:14.7753718Z 2025-05-07T20:03:14.7755425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.7758139Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.7759248Z ^ 2025-05-07T20:03:14.7759496Z 2025-05-07T20:03:14.7759946Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:14.7760641Z 2025-05-07T20:03:14.7762366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.7765362Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.7766629Z ^ 2025-05-07T20:03:14.7767080Z 2025-05-07T20:03:14.7768770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.7771535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.7772743Z ^ 2025-05-07T20:03:14.7773033Z 2025-05-07T20:03:14.7773499Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:14.7774199Z 2025-05-07T20:03:14.7776202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:14.7779077Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:14.7780402Z ^ 2025-05-07T20:03:14.7780766Z 2025-05-07T20:03:16.3921647Z [381/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o 2025-05-07T20:03:16.3945390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.3948063Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.3949267Z ^ 2025-05-07T20:03:16.3949518Z 2025-05-07T20:03:16.3949972Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:16.3950646Z 2025-05-07T20:03:16.3952299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.3955201Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.3956481Z ^ 2025-05-07T20:03:16.3956888Z 2025-05-07T20:03:16.3958651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.3961490Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.3962559Z ^ 2025-05-07T20:03:16.3962808Z 2025-05-07T20:03:16.3963241Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:16.3964091Z 2025-05-07T20:03:16.3965762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.3968358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.3969566Z ^ 2025-05-07T20:03:16.3969939Z 2025-05-07T20:03:16.3971613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.3974171Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.3975277Z ^ 2025-05-07T20:03:16.3975550Z 2025-05-07T20:03:16.3976340Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:16.3977058Z 2025-05-07T20:03:16.3978725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.3981504Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.3982738Z ^ 2025-05-07T20:03:16.3983117Z 2025-05-07T20:03:16.3985066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.3987837Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.3988986Z ^ 2025-05-07T20:03:16.3989242Z 2025-05-07T20:03:16.3989697Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:16.3990550Z 2025-05-07T20:03:16.3992382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.3995186Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.3996421Z ^ 2025-05-07T20:03:16.3996793Z 2025-05-07T20:03:16.3998471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.4001167Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.4002385Z ^ 2025-05-07T20:03:16.4002645Z 2025-05-07T20:03:16.4003088Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:16.4003752Z 2025-05-07T20:03:16.4005463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:16.4008226Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:16.4009472Z ^ 2025-05-07T20:03:16.4009848Z 2025-05-07T20:03:18.7546610Z [382/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o 2025-05-07T20:03:18.7572547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.7575026Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.7576721Z ^ 2025-05-07T20:03:18.7576978Z 2025-05-07T20:03:18.7577434Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:18.7578115Z 2025-05-07T20:03:18.7579715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.7582278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.7583309Z ^ 2025-05-07T20:03:18.7583654Z 2025-05-07T20:03:18.7585190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.7588161Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.7589262Z ^ 2025-05-07T20:03:18.7589475Z 2025-05-07T20:03:18.7589874Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:18.7590414Z 2025-05-07T20:03:18.7591815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.7594285Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.7595381Z ^ 2025-05-07T20:03:18.7595751Z 2025-05-07T20:03:18.7597244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.7599849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.7600990Z ^ 2025-05-07T20:03:18.7601263Z 2025-05-07T20:03:18.7601715Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:18.7602356Z 2025-05-07T20:03:18.7604418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.7607160Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.7608379Z ^ 2025-05-07T20:03:18.7608732Z 2025-05-07T20:03:18.7610589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.7613332Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.7614622Z ^ 2025-05-07T20:03:18.7614884Z 2025-05-07T20:03:18.7615343Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:18.7616048Z 2025-05-07T20:03:18.7617770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.7620714Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.7621840Z ^ 2025-05-07T20:03:18.7622236Z 2025-05-07T20:03:18.7623991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.7626557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.7627857Z ^ 2025-05-07T20:03:18.7628138Z 2025-05-07T20:03:18.7628536Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:18.7629135Z 2025-05-07T20:03:18.7630667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:18.7633500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:18.7634670Z ^ 2025-05-07T20:03:18.7635045Z 2025-05-07T20:03:21.9908445Z [383/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:21.9931730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:21.9934362Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:21.9935504Z ^ 2025-05-07T20:03:21.9935770Z 2025-05-07T20:03:21.9936215Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:21.9936884Z 2025-05-07T20:03:21.9938499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:21.9941315Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:21.9942763Z ^ 2025-05-07T20:03:21.9943134Z 2025-05-07T20:03:21.9944702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:21.9947254Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:21.9948369Z ^ 2025-05-07T20:03:21.9948639Z 2025-05-07T20:03:21.9949073Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:21.9949695Z 2025-05-07T20:03:21.9951349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:21.9953965Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:21.9955057Z ^ 2025-05-07T20:03:21.9955402Z 2025-05-07T20:03:21.9956965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:21.9959472Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:21.9960635Z ^ 2025-05-07T20:03:21.9961066Z 2025-05-07T20:03:21.9961695Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:21.9962328Z 2025-05-07T20:03:21.9963908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:21.9966509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:21.9967645Z ^ 2025-05-07T20:03:21.9968030Z 2025-05-07T20:03:21.9969506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:21.9972155Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:21.9973300Z ^ 2025-05-07T20:03:21.9973535Z 2025-05-07T20:03:21.9973991Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:21.9974645Z 2025-05-07T20:03:21.9976712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:21.9979352Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:21.9980636Z ^ 2025-05-07T20:03:21.9981000Z 2025-05-07T20:03:21.9982611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:21.9985183Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:21.9986567Z ^ 2025-05-07T20:03:21.9986822Z 2025-05-07T20:03:21.9987250Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:21.9987869Z 2025-05-07T20:03:21.9989366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:21.9991886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:21.9992948Z ^ 2025-05-07T20:03:21.9993266Z 2025-05-07T20:03:22.9849414Z [384/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:22.9872243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:22.9875012Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:22.9876510Z ^ 2025-05-07T20:03:22.9876811Z 2025-05-07T20:03:22.9877264Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:22.9877916Z 2025-05-07T20:03:22.9879459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:22.9882253Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:22.9883378Z ^ 2025-05-07T20:03:22.9883736Z 2025-05-07T20:03:22.9885390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:22.9887933Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:22.9889076Z ^ 2025-05-07T20:03:22.9889343Z 2025-05-07T20:03:22.9889794Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:22.9890472Z 2025-05-07T20:03:22.9892112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:22.9894724Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:22.9895870Z ^ 2025-05-07T20:03:22.9896251Z 2025-05-07T20:03:22.9898113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:22.9900771Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:22.9901879Z ^ 2025-05-07T20:03:22.9902156Z 2025-05-07T20:03:22.9902589Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:22.9903226Z 2025-05-07T20:03:22.9905009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:22.9907583Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:22.9908629Z ^ 2025-05-07T20:03:22.9908932Z 2025-05-07T20:03:22.9910413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:22.9912884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:22.9914009Z ^ 2025-05-07T20:03:22.9914265Z 2025-05-07T20:03:22.9914706Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:22.9915357Z 2025-05-07T20:03:22.9916997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:22.9919610Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:22.9920766Z ^ 2025-05-07T20:03:22.9921155Z 2025-05-07T20:03:22.9922770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:22.9925637Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:22.9926814Z ^ 2025-05-07T20:03:22.9927099Z 2025-05-07T20:03:22.9927539Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:22.9928194Z 2025-05-07T20:03:22.9929525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:22.9931870Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:22.9933001Z ^ 2025-05-07T20:03:22.9933361Z 2025-05-07T20:03:23.3945675Z [385/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o 2025-05-07T20:03:23.3970232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.3972972Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.3974323Z ^ 2025-05-07T20:03:23.3974592Z 2025-05-07T20:03:23.3975046Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.3975726Z 2025-05-07T20:03:23.3977714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.3980409Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.3981607Z ^ 2025-05-07T20:03:23.3981975Z 2025-05-07T20:03:23.3983597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.3986027Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.3987215Z ^ 2025-05-07T20:03:23.3987462Z 2025-05-07T20:03:23.3987907Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.3988534Z 2025-05-07T20:03:23.3990200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.3993070Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.3994388Z ^ 2025-05-07T20:03:23.3994777Z 2025-05-07T20:03:23.3996433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.3998877Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.3999822Z ^ 2025-05-07T20:03:23.4000028Z 2025-05-07T20:03:23.4000546Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.4001119Z 2025-05-07T20:03:23.4002664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.4005276Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.4006458Z ^ 2025-05-07T20:03:23.4006817Z 2025-05-07T20:03:23.4008430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.4010875Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.4011827Z ^ 2025-05-07T20:03:23.4012091Z 2025-05-07T20:03:23.4012485Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.4013043Z 2025-05-07T20:03:23.4014511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.4017075Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.4018263Z ^ 2025-05-07T20:03:23.4018633Z 2025-05-07T20:03:23.4020418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.4022946Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.4024070Z ^ 2025-05-07T20:03:23.4024321Z 2025-05-07T20:03:23.4024697Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:23.4025313Z 2025-05-07T20:03:23.4026783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:23.4029238Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:23.4030380Z ^ 2025-05-07T20:03:23.4030756Z 2025-05-07T20:03:24.0989852Z [386/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T20:03:24.1010132Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:25.7575444Z [387/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T20:03:25.7595862Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:28.7496387Z [388/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T20:03:28.7516133Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:41.8729644Z [389/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:41.8750312Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:41.8752388Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:41.8753418Z ^ 2025-05-07T20:03:41.8753646Z 2025-05-07T20:03:41.8754050Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:41.8754626Z 2025-05-07T20:03:41.8756088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:41.8758747Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:41.8759803Z ^ 2025-05-07T20:03:41.8760125Z 2025-05-07T20:03:41.8761573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:41.8763904Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:41.8764924Z ^ 2025-05-07T20:03:41.8765148Z 2025-05-07T20:03:41.8765545Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:41.8766137Z 2025-05-07T20:03:41.8767562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:41.8769929Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:41.8770877Z ^ 2025-05-07T20:03:41.8771206Z 2025-05-07T20:03:41.8772493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:41.8774953Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:41.8776363Z ^ 2025-05-07T20:03:41.8776627Z 2025-05-07T20:03:41.8777081Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:41.8777724Z 2025-05-07T20:03:41.8779227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:41.8781992Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:41.8782931Z ^ 2025-05-07T20:03:41.8783335Z 2025-05-07T20:03:41.8784748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:41.8786963Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:41.8788073Z ^ 2025-05-07T20:03:41.8788305Z 2025-05-07T20:03:41.8788721Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:41.8789279Z 2025-05-07T20:03:41.8790437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:41.8792445Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:41.8793543Z ^ 2025-05-07T20:03:41.8793862Z 2025-05-07T20:03:41.8795372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:41.8797923Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:41.8799051Z ^ 2025-05-07T20:03:41.8799258Z 2025-05-07T20:03:41.8799652Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:41.8800250Z 2025-05-07T20:03:41.8801514Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:41.8803772Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:41.8804892Z ^ 2025-05-07T20:03:41.8805245Z 2025-05-07T20:03:43.6782079Z [390/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:43.6805226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.6807797Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.6808941Z ^ 2025-05-07T20:03:43.6809447Z 2025-05-07T20:03:43.6809922Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:43.6810586Z 2025-05-07T20:03:43.6812287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.6815023Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.6816236Z ^ 2025-05-07T20:03:43.6816636Z 2025-05-07T20:03:43.6818314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.6821172Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.6822363Z ^ 2025-05-07T20:03:43.6822649Z 2025-05-07T20:03:43.6823096Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:43.6823763Z 2025-05-07T20:03:43.6825412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.6827985Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.6829177Z ^ 2025-05-07T20:03:43.6829706Z 2025-05-07T20:03:43.6831347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.6833958Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.6835368Z ^ 2025-05-07T20:03:43.6835628Z 2025-05-07T20:03:43.6836148Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:43.6836751Z 2025-05-07T20:03:43.6838182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.6840798Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.6841743Z ^ 2025-05-07T20:03:43.6842051Z 2025-05-07T20:03:43.6843362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.6845665Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.6846559Z ^ 2025-05-07T20:03:43.6846821Z 2025-05-07T20:03:43.6847167Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:43.6847711Z 2025-05-07T20:03:43.6849132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.6851465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.6852519Z ^ 2025-05-07T20:03:43.6852839Z 2025-05-07T20:03:43.6854179Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.6856846Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.6857802Z ^ 2025-05-07T20:03:43.6858013Z 2025-05-07T20:03:43.6858432Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:43.6859036Z 2025-05-07T20:03:43.6860551Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:43.6862743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:43.6863844Z ^ 2025-05-07T20:03:43.6864229Z 2025-05-07T20:03:45.4536340Z [391/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:45.4560087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.4563358Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.4564594Z ^ 2025-05-07T20:03:45.4564861Z 2025-05-07T20:03:45.4565313Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:45.4565959Z 2025-05-07T20:03:45.4567479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.4570223Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.4571443Z ^ 2025-05-07T20:03:45.4571822Z 2025-05-07T20:03:45.4573508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.4576734Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.4577755Z ^ 2025-05-07T20:03:45.4578003Z 2025-05-07T20:03:45.4578383Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:45.4578947Z 2025-05-07T20:03:45.4580491Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.4582625Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.4583561Z ^ 2025-05-07T20:03:45.4583853Z 2025-05-07T20:03:45.4585355Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.4587337Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.4588385Z ^ 2025-05-07T20:03:45.4588586Z 2025-05-07T20:03:45.4588953Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:45.4589510Z 2025-05-07T20:03:45.4590786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.4593015Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.4593955Z ^ 2025-05-07T20:03:45.4594294Z 2025-05-07T20:03:45.4595611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.4597739Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.4598626Z ^ 2025-05-07T20:03:45.4598879Z 2025-05-07T20:03:45.4599249Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:45.4599773Z 2025-05-07T20:03:45.4601081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.4603509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.4604607Z ^ 2025-05-07T20:03:45.4604947Z 2025-05-07T20:03:45.4606645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.4609147Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.4610242Z ^ 2025-05-07T20:03:45.4610493Z 2025-05-07T20:03:45.4610889Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:45.4611499Z 2025-05-07T20:03:45.4612956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.4615320Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.4616472Z ^ 2025-05-07T20:03:45.4616859Z 2025-05-07T20:03:45.6236240Z [392/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:03:45.6260123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.6263017Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.6264206Z ^ 2025-05-07T20:03:45.6264472Z 2025-05-07T20:03:45.6264936Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:45.6265631Z 2025-05-07T20:03:45.6267271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.6269860Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.6271008Z ^ 2025-05-07T20:03:45.6271372Z 2025-05-07T20:03:45.6272719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.6275524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.6277030Z ^ 2025-05-07T20:03:45.6277288Z 2025-05-07T20:03:45.6277773Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:45.6278497Z 2025-05-07T20:03:45.6280182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.6283159Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.6284407Z ^ 2025-05-07T20:03:45.6284887Z 2025-05-07T20:03:45.6286561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.6289275Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.6290471Z ^ 2025-05-07T20:03:45.6290760Z 2025-05-07T20:03:45.6291254Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:45.6291924Z 2025-05-07T20:03:45.6293567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.6296254Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.6297502Z ^ 2025-05-07T20:03:45.6297870Z 2025-05-07T20:03:45.6299538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.6302377Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.6303707Z ^ 2025-05-07T20:03:45.6303969Z 2025-05-07T20:03:45.6304413Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:45.6305090Z 2025-05-07T20:03:45.6306863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.6309533Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.6310738Z ^ 2025-05-07T20:03:45.6311119Z 2025-05-07T20:03:45.6312805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.6315495Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.6316650Z ^ 2025-05-07T20:03:45.6316902Z 2025-05-07T20:03:45.6317330Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:45.6318002Z 2025-05-07T20:03:45.6319636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:45.6322484Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:45.6323655Z ^ 2025-05-07T20:03:45.6324038Z 2025-05-07T20:03:46.5671801Z [393/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_dense.cpp 2025-05-07T20:03:46.5691789Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:03:47.2889060Z [394/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:47.2911404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.2914077Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.2915218Z ^ 2025-05-07T20:03:47.2915469Z 2025-05-07T20:03:47.2915900Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:47.2916543Z 2025-05-07T20:03:47.2918154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.2920738Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.2922172Z ^ 2025-05-07T20:03:47.2922480Z 2025-05-07T20:03:47.2924040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.2926408Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.2927461Z ^ 2025-05-07T20:03:47.2927697Z 2025-05-07T20:03:47.2928119Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:47.2928718Z 2025-05-07T20:03:47.2930206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.2932670Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.2933741Z ^ 2025-05-07T20:03:47.2934098Z 2025-05-07T20:03:47.2935560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.2938182Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.2939372Z ^ 2025-05-07T20:03:47.2939617Z 2025-05-07T20:03:47.2941299Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:47.2941958Z 2025-05-07T20:03:47.2943461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.2946003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.2947293Z ^ 2025-05-07T20:03:47.2947635Z 2025-05-07T20:03:47.2949128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.2951679Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.2952802Z ^ 2025-05-07T20:03:47.2953036Z 2025-05-07T20:03:47.2953450Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:47.2954073Z 2025-05-07T20:03:47.2955663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.2958158Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.2959209Z ^ 2025-05-07T20:03:47.2959558Z 2025-05-07T20:03:47.2961045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.2963554Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.2964574Z ^ 2025-05-07T20:03:47.2964976Z 2025-05-07T20:03:47.2965403Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:47.2966026Z 2025-05-07T20:03:47.2967641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:47.2970200Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:47.2971282Z ^ 2025-05-07T20:03:47.2971569Z 2025-05-07T20:03:48.5174233Z [395/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:48.5197175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.5199798Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.5200966Z ^ 2025-05-07T20:03:48.5201218Z 2025-05-07T20:03:48.5201680Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:48.5202321Z 2025-05-07T20:03:48.5203943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.5206826Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.5207975Z ^ 2025-05-07T20:03:48.5208320Z 2025-05-07T20:03:48.5209874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.5212281Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.5213290Z ^ 2025-05-07T20:03:48.5213530Z 2025-05-07T20:03:48.5213922Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:48.5214550Z 2025-05-07T20:03:48.5216048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.5218406Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.5219494Z ^ 2025-05-07T20:03:48.5219850Z 2025-05-07T20:03:48.5221534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.5224294Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.5225443Z ^ 2025-05-07T20:03:48.5225709Z 2025-05-07T20:03:48.5226157Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:48.5226816Z 2025-05-07T20:03:48.5228489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.5242183Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.5243737Z ^ 2025-05-07T20:03:48.5244091Z 2025-05-07T20:03:48.5245665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.5248346Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.5249591Z ^ 2025-05-07T20:03:48.5249810Z 2025-05-07T20:03:48.5250214Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:48.5250799Z 2025-05-07T20:03:48.5252242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.5254737Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.5255865Z ^ 2025-05-07T20:03:48.5256224Z 2025-05-07T20:03:48.5257735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.5260578Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.5261448Z ^ 2025-05-07T20:03:48.5261681Z 2025-05-07T20:03:48.5262104Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:48.5262781Z 2025-05-07T20:03:48.5264250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:48.5266745Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:48.5267884Z ^ 2025-05-07T20:03:48.5268230Z 2025-05-07T20:03:50.7671745Z [396/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:50.7695911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.7698525Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.7700029Z ^ 2025-05-07T20:03:50.7700483Z 2025-05-07T20:03:50.7700927Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:50.7701596Z 2025-05-07T20:03:50.7703193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.7705885Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.7707056Z ^ 2025-05-07T20:03:50.7707424Z 2025-05-07T20:03:50.7709062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.7711666Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.7712871Z ^ 2025-05-07T20:03:50.7713135Z 2025-05-07T20:03:50.7713599Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:50.7714251Z 2025-05-07T20:03:50.7715967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.7718791Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.7720138Z ^ 2025-05-07T20:03:50.7720516Z 2025-05-07T20:03:50.7722155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.7724749Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.7726065Z ^ 2025-05-07T20:03:50.7726314Z 2025-05-07T20:03:50.7726731Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:50.7727528Z 2025-05-07T20:03:50.7729211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.7731900Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.7733079Z ^ 2025-05-07T20:03:50.7733447Z 2025-05-07T20:03:50.7735091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.7737626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.7738774Z ^ 2025-05-07T20:03:50.7739036Z 2025-05-07T20:03:50.7739498Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:50.7740136Z 2025-05-07T20:03:50.7741859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.7744438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.7745754Z ^ 2025-05-07T20:03:50.7746106Z 2025-05-07T20:03:50.7747684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.7750218Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.7751339Z ^ 2025-05-07T20:03:50.7751624Z 2025-05-07T20:03:50.7752068Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:50.7752700Z 2025-05-07T20:03:50.7754242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:50.7756753Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:50.7757947Z ^ 2025-05-07T20:03:50.7758321Z 2025-05-07T20:03:53.7774586Z [397/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:03:53.7797972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7800869Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7802225Z ^ 2025-05-07T20:03:53.7802520Z 2025-05-07T20:03:53.7803027Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.7803801Z 2025-05-07T20:03:53.7805670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7808734Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7810109Z ^ 2025-05-07T20:03:53.7810551Z 2025-05-07T20:03:53.7812424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7815458Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7816812Z ^ 2025-05-07T20:03:53.7817105Z 2025-05-07T20:03:53.7817625Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.7818368Z 2025-05-07T20:03:53.7820797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7823895Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7825271Z ^ 2025-05-07T20:03:53.7825693Z 2025-05-07T20:03:53.7827696Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7830825Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7832190Z ^ 2025-05-07T20:03:53.7832608Z 2025-05-07T20:03:53.7833103Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.7834020Z 2025-05-07T20:03:53.7835858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7838824Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7840158Z ^ 2025-05-07T20:03:53.7840565Z 2025-05-07T20:03:53.7842394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7845323Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7846648Z ^ 2025-05-07T20:03:53.7846932Z 2025-05-07T20:03:53.7847442Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.7848377Z 2025-05-07T20:03:53.7850207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7853351Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7854720Z ^ 2025-05-07T20:03:53.7855140Z 2025-05-07T20:03:53.7856985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7860003Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7861471Z ^ 2025-05-07T20:03:53.7861782Z 2025-05-07T20:03:53.7862285Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.7863039Z 2025-05-07T20:03:53.7864949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.7867969Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.7869347Z ^ 2025-05-07T20:03:53.7869760Z 2025-05-07T20:03:53.8698849Z [398/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:53.8720524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.8723095Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.8724237Z ^ 2025-05-07T20:03:53.8724517Z 2025-05-07T20:03:53.8725019Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.8725671Z 2025-05-07T20:03:53.8727316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.8729930Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.8731089Z ^ 2025-05-07T20:03:53.8731448Z 2025-05-07T20:03:53.8733080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.8735912Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.8737024Z ^ 2025-05-07T20:03:53.8737273Z 2025-05-07T20:03:53.8737691Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.8738355Z 2025-05-07T20:03:53.8740023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.8742702Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.8743815Z ^ 2025-05-07T20:03:53.8744168Z 2025-05-07T20:03:53.8745626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.8748033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.8749067Z ^ 2025-05-07T20:03:53.8749305Z 2025-05-07T20:03:53.8749696Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.8750257Z 2025-05-07T20:03:53.8751665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.8754079Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.8755190Z ^ 2025-05-07T20:03:53.8755566Z 2025-05-07T20:03:53.8757246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.8759972Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.8761102Z ^ 2025-05-07T20:03:53.8761347Z 2025-05-07T20:03:53.8761763Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.8762408Z 2025-05-07T20:03:53.8763948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.8766668Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.8767840Z ^ 2025-05-07T20:03:53.8768243Z 2025-05-07T20:03:53.8769954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.8772812Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.8773957Z ^ 2025-05-07T20:03:53.8774207Z 2025-05-07T20:03:53.8774658Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:53.8775316Z 2025-05-07T20:03:53.8777343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:53.8780419Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:53.8781584Z ^ 2025-05-07T20:03:53.8781952Z 2025-05-07T20:03:57.7862871Z [399/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:03:57.7884067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.7886566Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.7887651Z ^ 2025-05-07T20:03:57.7887884Z 2025-05-07T20:03:57.7888269Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.7888846Z 2025-05-07T20:03:57.7890365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.7892919Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.7894099Z ^ 2025-05-07T20:03:57.7894454Z 2025-05-07T20:03:57.7896306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.7898936Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.7900138Z ^ 2025-05-07T20:03:57.7900542Z 2025-05-07T20:03:57.7901167Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.7901779Z 2025-05-07T20:03:57.7903328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.7905973Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.7907074Z ^ 2025-05-07T20:03:57.7907397Z 2025-05-07T20:03:57.7908945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.7911242Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.7912233Z ^ 2025-05-07T20:03:57.7912471Z 2025-05-07T20:03:57.7912882Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.7913497Z 2025-05-07T20:03:57.7914924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.7917275Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.7918269Z ^ 2025-05-07T20:03:57.7918803Z 2025-05-07T20:03:57.7920187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.7922631Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.7923650Z ^ 2025-05-07T20:03:57.7923916Z 2025-05-07T20:03:57.7924279Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.7924866Z 2025-05-07T20:03:57.7926266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.7928697Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.7929800Z ^ 2025-05-07T20:03:57.7930132Z 2025-05-07T20:03:57.7931610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.7933993Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.7935091Z ^ 2025-05-07T20:03:57.7935329Z 2025-05-07T20:03:57.7935952Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.7936578Z 2025-05-07T20:03:57.7937978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.7940643Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.7941893Z ^ 2025-05-07T20:03:57.7942275Z 2025-05-07T20:03:57.8874147Z [400/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o 2025-05-07T20:03:57.8898781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.8901379Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.8902550Z ^ 2025-05-07T20:03:57.8902784Z 2025-05-07T20:03:57.8903168Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.8903794Z 2025-05-07T20:03:57.8905822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.8908368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.8909491Z ^ 2025-05-07T20:03:57.8909932Z 2025-05-07T20:03:57.8911520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.8914419Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.8915784Z ^ 2025-05-07T20:03:57.8916169Z 2025-05-07T20:03:57.8916598Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.8917234Z 2025-05-07T20:03:57.8918884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.8921659Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.8922909Z ^ 2025-05-07T20:03:57.8923304Z 2025-05-07T20:03:57.8925036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.8927411Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.8928483Z ^ 2025-05-07T20:03:57.8928727Z 2025-05-07T20:03:57.8929165Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.8929819Z 2025-05-07T20:03:57.8931545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.8934272Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.8935596Z ^ 2025-05-07T20:03:57.8935925Z 2025-05-07T20:03:57.8937466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.8940015Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.8941248Z ^ 2025-05-07T20:03:57.8941473Z 2025-05-07T20:03:57.8941874Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.8942516Z 2025-05-07T20:03:57.8944094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.8946628Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.8947727Z ^ 2025-05-07T20:03:57.8948067Z 2025-05-07T20:03:57.8949640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.8952478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.8953615Z ^ 2025-05-07T20:03:57.8953886Z 2025-05-07T20:03:57.8954347Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:57.8955011Z 2025-05-07T20:03:57.8956818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:57.8959410Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:57.8960676Z ^ 2025-05-07T20:03:57.8961048Z 2025-05-07T20:03:58.3346690Z [401/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:03:58.3370199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:58.3373005Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:58.3374216Z ^ 2025-05-07T20:03:58.3374480Z 2025-05-07T20:03:58.3375321Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:58.3376347Z 2025-05-07T20:03:58.3378062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:58.3380906Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:58.3382023Z ^ 2025-05-07T20:03:58.3382372Z 2025-05-07T20:03:58.3383829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:58.3386692Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:58.3387881Z ^ 2025-05-07T20:03:58.3388145Z 2025-05-07T20:03:58.3388601Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:58.3389271Z 2025-05-07T20:03:58.3390957Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:58.3393618Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:58.3394823Z ^ 2025-05-07T20:03:58.3395217Z 2025-05-07T20:03:58.3396886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:58.3399627Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:58.3401029Z ^ 2025-05-07T20:03:58.3401271Z 2025-05-07T20:03:58.3401724Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:58.3402388Z 2025-05-07T20:03:58.3404112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:58.3406894Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:58.3408118Z ^ 2025-05-07T20:03:58.3408503Z 2025-05-07T20:03:58.3409988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:58.3412520Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:58.3413664Z ^ 2025-05-07T20:03:58.3413939Z 2025-05-07T20:03:58.3414371Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:58.3415005Z 2025-05-07T20:03:58.3416690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:58.3419136Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:58.3420666Z ^ 2025-05-07T20:03:58.3421030Z 2025-05-07T20:03:58.3422550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:58.3425099Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:58.3426322Z ^ 2025-05-07T20:03:58.3426554Z 2025-05-07T20:03:58.3426962Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:03:58.3427590Z 2025-05-07T20:03:58.3429193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:03:58.3431945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:03:58.3433122Z ^ 2025-05-07T20:03:58.3433509Z 2025-05-07T20:04:02.6624668Z [402/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:02.6647335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.6650044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.6651192Z ^ 2025-05-07T20:04:02.6651463Z 2025-05-07T20:04:02.6651892Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:02.6652507Z 2025-05-07T20:04:02.6654307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.6657091Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.6658236Z ^ 2025-05-07T20:04:02.6658591Z 2025-05-07T20:04:02.6660222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.6662855Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.6664008Z ^ 2025-05-07T20:04:02.6664252Z 2025-05-07T20:04:02.6664687Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:02.6665320Z 2025-05-07T20:04:02.6666919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.6669653Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.6670957Z ^ 2025-05-07T20:04:02.6671331Z 2025-05-07T20:04:02.6672937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.6675748Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.6677225Z ^ 2025-05-07T20:04:02.6677493Z 2025-05-07T20:04:02.6677910Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:02.6678577Z 2025-05-07T20:04:02.6680266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.6682888Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.6684094Z ^ 2025-05-07T20:04:02.6684449Z 2025-05-07T20:04:02.6686091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.6688723Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.6689877Z ^ 2025-05-07T20:04:02.6690139Z 2025-05-07T20:04:02.6690592Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:02.6691245Z 2025-05-07T20:04:02.6693079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.6695677Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.6696826Z ^ 2025-05-07T20:04:02.6697216Z 2025-05-07T20:04:02.6698974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.6701693Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.6702975Z ^ 2025-05-07T20:04:02.6703246Z 2025-05-07T20:04:02.6703692Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:02.6704335Z 2025-05-07T20:04:02.6705991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:02.6708647Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:02.6709787Z ^ 2025-05-07T20:04:02.6710138Z 2025-05-07T20:04:06.3747409Z [403/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:06.3769138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3771931Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3773061Z ^ 2025-05-07T20:04:06.3773302Z 2025-05-07T20:04:06.3773692Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.3774436Z 2025-05-07T20:04:06.3776170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3778655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3779796Z ^ 2025-05-07T20:04:06.3780149Z 2025-05-07T20:04:06.3781771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3784222Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3785234Z ^ 2025-05-07T20:04:06.3785478Z 2025-05-07T20:04:06.3785865Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.3786440Z 2025-05-07T20:04:06.3787902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3790333Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3791728Z ^ 2025-05-07T20:04:06.3792089Z 2025-05-07T20:04:06.3793525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3795994Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3797151Z ^ 2025-05-07T20:04:06.3797362Z 2025-05-07T20:04:06.3797711Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.3798267Z 2025-05-07T20:04:06.3799660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3802040Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3803138Z ^ 2025-05-07T20:04:06.3803492Z 2025-05-07T20:04:06.3804956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3807720Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3808850Z ^ 2025-05-07T20:04:06.3809119Z 2025-05-07T20:04:06.3809562Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.3810181Z 2025-05-07T20:04:06.3811827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3814292Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3815455Z ^ 2025-05-07T20:04:06.3815957Z 2025-05-07T20:04:06.3817461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3819987Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3821260Z ^ 2025-05-07T20:04:06.3821503Z 2025-05-07T20:04:06.3821927Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:06.3822598Z 2025-05-07T20:04:06.3824121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:06.3826465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:06.3827509Z ^ 2025-05-07T20:04:06.3827868Z 2025-05-07T20:04:10.2653117Z [404/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_adagrad.cpp 2025-05-07T20:04:10.2670981Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:11.3381593Z [405/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_sgd.cpp 2025-05-07T20:04:11.3398398Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:13.1761631Z [406/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:13.1773651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.1775043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.1775674Z ^ 2025-05-07T20:04:13.1775820Z 2025-05-07T20:04:13.1776367Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:13.1776737Z 2025-05-07T20:04:13.1777606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.1779011Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.1779636Z ^ 2025-05-07T20:04:13.1779961Z 2025-05-07T20:04:13.1780907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.1782451Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.1783060Z ^ 2025-05-07T20:04:13.1783216Z 2025-05-07T20:04:13.1783459Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:13.1783806Z 2025-05-07T20:04:13.1784727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.1786096Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.1786729Z ^ 2025-05-07T20:04:13.1786926Z 2025-05-07T20:04:13.1787791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.1789152Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.1789779Z ^ 2025-05-07T20:04:13.1789918Z 2025-05-07T20:04:13.1790152Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:13.1790622Z 2025-05-07T20:04:13.1791484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.1792868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.1793485Z ^ 2025-05-07T20:04:13.1793698Z 2025-05-07T20:04:13.1794603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.1796045Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.1796653Z ^ 2025-05-07T20:04:13.1796796Z 2025-05-07T20:04:13.1797051Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:13.1797400Z 2025-05-07T20:04:13.1798258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.1799652Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.1800289Z ^ 2025-05-07T20:04:13.1800487Z 2025-05-07T20:04:13.1801330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.1802712Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.1803334Z ^ 2025-05-07T20:04:13.1803475Z 2025-05-07T20:04:13.1803719Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:13.1804186Z 2025-05-07T20:04:13.1805051Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:13.1806441Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:13.1807073Z ^ 2025-05-07T20:04:13.1807271Z 2025-05-07T20:04:15.6346447Z [407/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_lamb.cpp 2025-05-07T20:04:15.6365304Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:15.8703277Z [408/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T20:04:15.8721763Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:17.3101233Z [409/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:17.3123917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.3126928Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.3128048Z ^ 2025-05-07T20:04:17.3128322Z 2025-05-07T20:04:17.3128758Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:17.3129421Z 2025-05-07T20:04:17.3130951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.3133319Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.3134480Z ^ 2025-05-07T20:04:17.3134794Z 2025-05-07T20:04:17.3136294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.3138892Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.3140032Z ^ 2025-05-07T20:04:17.3140456Z 2025-05-07T20:04:17.3140875Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:17.3141511Z 2025-05-07T20:04:17.3143326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.3145877Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.3147038Z ^ 2025-05-07T20:04:17.3147413Z 2025-05-07T20:04:17.3149114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.3151705Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.3152928Z ^ 2025-05-07T20:04:17.3153176Z 2025-05-07T20:04:17.3153632Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:17.3154267Z 2025-05-07T20:04:17.3155879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.3158441Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.3159602Z ^ 2025-05-07T20:04:17.3159965Z 2025-05-07T20:04:17.3161525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.3164085Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.3165234Z ^ 2025-05-07T20:04:17.3165478Z 2025-05-07T20:04:17.3165919Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:17.3166552Z 2025-05-07T20:04:17.3168119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.3170677Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.3171801Z ^ 2025-05-07T20:04:17.3172172Z 2025-05-07T20:04:17.3173959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.3176799Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.3177913Z ^ 2025-05-07T20:04:17.3178122Z 2025-05-07T20:04:17.3178497Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:17.3179028Z 2025-05-07T20:04:17.3180472Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:17.3182697Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:17.3183700Z ^ 2025-05-07T20:04:17.3184016Z 2025-05-07T20:04:18.3237883Z [410/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T20:04:18.3257835Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:18.7405852Z [411/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:18.7428574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7431482Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7432606Z ^ 2025-05-07T20:04:18.7432861Z 2025-05-07T20:04:18.7433313Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.7433973Z 2025-05-07T20:04:18.7435668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7438322Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7439518Z ^ 2025-05-07T20:04:18.7439880Z 2025-05-07T20:04:18.7441546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7444204Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7445498Z ^ 2025-05-07T20:04:18.7445742Z 2025-05-07T20:04:18.7446187Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.7446868Z 2025-05-07T20:04:18.7448465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7451175Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7452372Z ^ 2025-05-07T20:04:18.7452757Z 2025-05-07T20:04:18.7454446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7457123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7458274Z ^ 2025-05-07T20:04:18.7458540Z 2025-05-07T20:04:18.7458987Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.7459656Z 2025-05-07T20:04:18.7461505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7464302Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7465502Z ^ 2025-05-07T20:04:18.7465860Z 2025-05-07T20:04:18.7467682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7470462Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7471768Z ^ 2025-05-07T20:04:18.7472031Z 2025-05-07T20:04:18.7472494Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.7473267Z 2025-05-07T20:04:18.7475012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7478056Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7479275Z ^ 2025-05-07T20:04:18.7479670Z 2025-05-07T20:04:18.7481458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7484137Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7485415Z ^ 2025-05-07T20:04:18.7485666Z 2025-05-07T20:04:18.7486123Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.7486792Z 2025-05-07T20:04:18.7488492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7491209Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7492617Z ^ 2025-05-07T20:04:18.7492987Z 2025-05-07T20:04:18.7867745Z [412/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:18.7892506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7894655Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7895572Z ^ 2025-05-07T20:04:18.7895752Z 2025-05-07T20:04:18.7896107Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.7896656Z 2025-05-07T20:04:18.7898125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7900825Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7902019Z ^ 2025-05-07T20:04:18.7902404Z 2025-05-07T20:04:18.7903920Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7906812Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7907873Z ^ 2025-05-07T20:04:18.7908120Z 2025-05-07T20:04:18.7908541Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.7909170Z 2025-05-07T20:04:18.7910686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7913376Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7914576Z ^ 2025-05-07T20:04:18.7914938Z 2025-05-07T20:04:18.7916554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7919085Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7920261Z ^ 2025-05-07T20:04:18.7920512Z 2025-05-07T20:04:18.7920949Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.7921578Z 2025-05-07T20:04:18.7923192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7925754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7926853Z ^ 2025-05-07T20:04:18.7927225Z 2025-05-07T20:04:18.7928946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7931626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7932626Z ^ 2025-05-07T20:04:18.7932883Z 2025-05-07T20:04:18.7933409Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.7934043Z 2025-05-07T20:04:18.7935667Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7938506Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7939557Z ^ 2025-05-07T20:04:18.7939886Z 2025-05-07T20:04:18.7941470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7943992Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7944925Z ^ 2025-05-07T20:04:18.7945130Z 2025-05-07T20:04:18.7945502Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:18.7946254Z 2025-05-07T20:04:18.7947678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:18.7950013Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:18.7951141Z ^ 2025-05-07T20:04:18.7951486Z 2025-05-07T20:04:18.8516222Z [413/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_none.cpp 2025-05-07T20:04:18.8534288Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:19.2226404Z [414/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_adam.cpp 2025-05-07T20:04:19.2244166Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:19.6804888Z [415/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T20:04:19.6824682Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:19.7460870Z [416/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T20:04:19.7480980Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:19.7760756Z [417/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T20:04:19.7779948Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:21.0767034Z [418/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:21.0785482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:21.0787585Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:21.0788559Z ^ 2025-05-07T20:04:21.0788746Z 2025-05-07T20:04:21.0789087Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:21.0789571Z 2025-05-07T20:04:21.0790895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:21.0793005Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:21.0793928Z ^ 2025-05-07T20:04:21.0794205Z 2025-05-07T20:04:21.0795529Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:21.0797621Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:21.0798397Z ^ 2025-05-07T20:04:21.0798573Z 2025-05-07T20:04:21.0798889Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:21.0799336Z 2025-05-07T20:04:21.0800433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:21.0802239Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:21.0803099Z ^ 2025-05-07T20:04:21.0803353Z 2025-05-07T20:04:21.0804469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:21.0806325Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:21.0807183Z ^ 2025-05-07T20:04:21.0807380Z 2025-05-07T20:04:21.0807742Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:21.0808274Z 2025-05-07T20:04:21.0809637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:21.0811580Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:21.0812484Z ^ 2025-05-07T20:04:21.0812758Z 2025-05-07T20:04:21.0814081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:21.0816207Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:21.0817062Z ^ 2025-05-07T20:04:21.0817288Z 2025-05-07T20:04:21.0817634Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:21.0818111Z 2025-05-07T20:04:21.0819328Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:21.0821419Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:21.0822332Z ^ 2025-05-07T20:04:21.0822621Z 2025-05-07T20:04:21.0823922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:21.0825891Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:21.0826813Z ^ 2025-05-07T20:04:21.0827023Z 2025-05-07T20:04:21.0827367Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:21.0828016Z 2025-05-07T20:04:21.0829267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:21.0831263Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:21.0832143Z ^ 2025-05-07T20:04:21.0832454Z 2025-05-07T20:04:22.6307051Z [419/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:22.6326095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:22.6328340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:22.6329324Z ^ 2025-05-07T20:04:22.6329547Z 2025-05-07T20:04:22.6329923Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:22.6330609Z 2025-05-07T20:04:22.6332104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:22.6334520Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:22.6335451Z ^ 2025-05-07T20:04:22.6335783Z 2025-05-07T20:04:22.6337040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:22.6339088Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:22.6339986Z ^ 2025-05-07T20:04:22.6340205Z 2025-05-07T20:04:22.6340742Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:22.6341268Z 2025-05-07T20:04:22.6342818Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:22.6345193Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:22.6346204Z ^ 2025-05-07T20:04:22.6346510Z 2025-05-07T20:04:22.6347904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:22.6350277Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:22.6351358Z ^ 2025-05-07T20:04:22.6351584Z 2025-05-07T20:04:22.6352021Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:22.6352658Z 2025-05-07T20:04:22.6354094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:22.6356269Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:22.6357456Z ^ 2025-05-07T20:04:22.6357811Z 2025-05-07T20:04:22.6359456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:22.6362064Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:22.6363243Z ^ 2025-05-07T20:04:22.6363496Z 2025-05-07T20:04:22.6363950Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:22.6364612Z 2025-05-07T20:04:22.6365934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:22.6368055Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:22.6369016Z ^ 2025-05-07T20:04:22.6369311Z 2025-05-07T20:04:22.6370643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:22.6376357Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:22.6377281Z ^ 2025-05-07T20:04:22.6377496Z 2025-05-07T20:04:22.6377875Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:22.6378400Z 2025-05-07T20:04:22.6379853Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:22.6382501Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:22.6383558Z ^ 2025-05-07T20:04:22.6383877Z 2025-05-07T20:04:23.2194035Z [420/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:23.2216476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.2219275Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.2220652Z ^ 2025-05-07T20:04:23.2220913Z 2025-05-07T20:04:23.2221704Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:23.2222389Z 2025-05-07T20:04:23.2224163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.2226943Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.2228174Z ^ 2025-05-07T20:04:23.2228554Z 2025-05-07T20:04:23.2230332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.2232731Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.2233833Z ^ 2025-05-07T20:04:23.2234078Z 2025-05-07T20:04:23.2234529Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:23.2235178Z 2025-05-07T20:04:23.2236835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.2239478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.2240641Z ^ 2025-05-07T20:04:23.2242841Z 2025-05-07T20:04:23.2244527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.2247105Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.2248124Z ^ 2025-05-07T20:04:23.2248381Z 2025-05-07T20:04:23.2248931Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:23.2249691Z 2025-05-07T20:04:23.2251374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.2254128Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.2255230Z ^ 2025-05-07T20:04:23.2255611Z 2025-05-07T20:04:23.2257101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.2259778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.2261125Z ^ 2025-05-07T20:04:23.2261382Z 2025-05-07T20:04:23.2261838Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:23.2262454Z 2025-05-07T20:04:23.2263905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.2266510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.2267833Z ^ 2025-05-07T20:04:23.2268225Z 2025-05-07T20:04:23.2269814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.2272477Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.2273659Z ^ 2025-05-07T20:04:23.2273943Z 2025-05-07T20:04:23.2274393Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:23.2275008Z 2025-05-07T20:04:23.2276762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:23.2279516Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:23.2280705Z ^ 2025-05-07T20:04:23.2281059Z 2025-05-07T20:04:24.3963741Z [421/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T20:04:24.3984118Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:24.7724100Z [422/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:24.7747916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.7750428Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.7751478Z ^ 2025-05-07T20:04:24.7751723Z 2025-05-07T20:04:24.7752183Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:24.7752924Z 2025-05-07T20:04:24.7754406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.7757065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.7758227Z ^ 2025-05-07T20:04:24.7758562Z 2025-05-07T20:04:24.7760019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.7762511Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.7763604Z ^ 2025-05-07T20:04:24.7763883Z 2025-05-07T20:04:24.7764311Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:24.7765174Z 2025-05-07T20:04:24.7766650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.7769019Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.7770075Z ^ 2025-05-07T20:04:24.7770428Z 2025-05-07T20:04:24.7772014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.7774307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.7775325Z ^ 2025-05-07T20:04:24.7775571Z 2025-05-07T20:04:24.7776312Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:24.7776918Z 2025-05-07T20:04:24.7778547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.7781028Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.7782066Z ^ 2025-05-07T20:04:24.7782425Z 2025-05-07T20:04:24.7784163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.7786513Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.7787587Z ^ 2025-05-07T20:04:24.7787855Z 2025-05-07T20:04:24.7788272Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:24.7789045Z 2025-05-07T20:04:24.7790560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.7793177Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.7794298Z ^ 2025-05-07T20:04:24.7794605Z 2025-05-07T20:04:24.7795993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.7798191Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.7799198Z ^ 2025-05-07T20:04:24.7799456Z 2025-05-07T20:04:24.7799877Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:24.7800669Z 2025-05-07T20:04:24.7802157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:24.7804465Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:24.7805533Z ^ 2025-05-07T20:04:24.7805883Z 2025-05-07T20:04:25.2558863Z [423/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T20:04:25.2576938Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:26.2639231Z [424/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:04:26.2661578Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:26.5151936Z [425/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o 2025-05-07T20:04:26.5176536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.5179278Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.5180558Z ^ 2025-05-07T20:04:26.5180852Z 2025-05-07T20:04:26.5181303Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.5181982Z 2025-05-07T20:04:26.5183838Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.5186536Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.5187775Z ^ 2025-05-07T20:04:26.5188151Z 2025-05-07T20:04:26.5189820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.5192532Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.5193753Z ^ 2025-05-07T20:04:26.5194014Z 2025-05-07T20:04:26.5194475Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.5195180Z 2025-05-07T20:04:26.5196882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.5199653Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.5200867Z ^ 2025-05-07T20:04:26.5201271Z 2025-05-07T20:04:26.5203132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.5205868Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.5207068Z ^ 2025-05-07T20:04:26.5207335Z 2025-05-07T20:04:26.5207826Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.5208508Z 2025-05-07T20:04:26.5210281Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.5213086Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.5214333Z ^ 2025-05-07T20:04:26.5214720Z 2025-05-07T20:04:26.5216404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.5219116Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.5220434Z ^ 2025-05-07T20:04:26.5220697Z 2025-05-07T20:04:26.5221147Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.5221845Z 2025-05-07T20:04:26.5223541Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.5226262Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.5227459Z ^ 2025-05-07T20:04:26.5227829Z 2025-05-07T20:04:26.5229517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.5232290Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.5233532Z ^ 2025-05-07T20:04:26.5233793Z 2025-05-07T20:04:26.5234269Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:26.5234950Z 2025-05-07T20:04:26.5236664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:26.5239382Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:26.5240576Z ^ 2025-05-07T20:04:26.5240936Z 2025-05-07T20:04:27.2238573Z [426/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T20:04:27.2257246Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:27.2409701Z [427/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:27.2433683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.2436452Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.2437676Z ^ 2025-05-07T20:04:27.2437888Z 2025-05-07T20:04:27.2438253Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:27.2439177Z 2025-05-07T20:04:27.2440729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.2443012Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.2444051Z ^ 2025-05-07T20:04:27.2444401Z 2025-05-07T20:04:27.2445836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.2448331Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.2449373Z ^ 2025-05-07T20:04:27.2449618Z 2025-05-07T20:04:27.2450084Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:27.2450754Z 2025-05-07T20:04:27.2452341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.2454722Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.2455775Z ^ 2025-05-07T20:04:27.2456131Z 2025-05-07T20:04:27.2457452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.2459732Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.2460848Z ^ 2025-05-07T20:04:27.2461028Z 2025-05-07T20:04:27.2461385Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:27.2461903Z 2025-05-07T20:04:27.2463212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.2465438Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.2466498Z ^ 2025-05-07T20:04:27.2466846Z 2025-05-07T20:04:27.2468166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.2470945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.2472002Z ^ 2025-05-07T20:04:27.2472260Z 2025-05-07T20:04:27.2472722Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:27.2473349Z 2025-05-07T20:04:27.2475087Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.2477971Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.2479223Z ^ 2025-05-07T20:04:27.2479565Z 2025-05-07T20:04:27.2480972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.2483351Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.2484368Z ^ 2025-05-07T20:04:27.2484578Z 2025-05-07T20:04:27.2484937Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:27.2485552Z 2025-05-07T20:04:27.2486991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.2489282Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.2490235Z ^ 2025-05-07T20:04:27.2490562Z 2025-05-07T20:04:27.6429528Z [428/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T20:04:27.6447490Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:27.6862913Z [429/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:27.6884315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.6886859Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.6887972Z ^ 2025-05-07T20:04:27.6888216Z 2025-05-07T20:04:27.6888555Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:27.6889001Z 2025-05-07T20:04:27.6890120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.6892714Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.6893760Z ^ 2025-05-07T20:04:27.6894071Z 2025-05-07T20:04:27.6895539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.6898274Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.6899328Z ^ 2025-05-07T20:04:27.6899547Z 2025-05-07T20:04:27.6899925Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:27.6900801Z 2025-05-07T20:04:27.6902236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.6904632Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.6905646Z ^ 2025-05-07T20:04:27.6905991Z 2025-05-07T20:04:27.6907400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.6909732Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.6910761Z ^ 2025-05-07T20:04:27.6910999Z 2025-05-07T20:04:27.6911409Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:27.6911991Z 2025-05-07T20:04:27.6913454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.6915836Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.6917123Z ^ 2025-05-07T20:04:27.6917468Z 2025-05-07T20:04:27.6918893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.6921315Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.6922388Z ^ 2025-05-07T20:04:27.6922581Z 2025-05-07T20:04:27.6922936Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:27.6923484Z 2025-05-07T20:04:27.6924894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.6927173Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.6928250Z ^ 2025-05-07T20:04:27.6928591Z 2025-05-07T20:04:27.6930012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.6932419Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.6933455Z ^ 2025-05-07T20:04:27.6933690Z 2025-05-07T20:04:27.6934100Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:27.6934667Z 2025-05-07T20:04:27.6936180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:27.6938671Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:27.6951086Z ^ 2025-05-07T20:04:27.6951640Z 2025-05-07T20:04:27.9244236Z [430/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T20:04:27.9265720Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:28.3142569Z [431/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T20:04:28.3161658Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:28.3668915Z [432/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T20:04:28.3688624Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:28.7477008Z [433/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T20:04:28.7496839Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:28.8588343Z [434/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T20:04:28.8606379Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:29.1725985Z [435/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T20:04:29.1747594Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:30.1517011Z [436/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:04:30.1534561Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:30.3425729Z [437/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T20:04:30.3444913Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:30.3848504Z [438/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T20:04:30.3867173Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:30.5392387Z [439/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:30.5414105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:30.5416692Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:30.5417782Z ^ 2025-05-07T20:04:30.5418062Z 2025-05-07T20:04:30.5418509Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:30.5419114Z 2025-05-07T20:04:30.5420902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:30.5423357Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:30.5424533Z ^ 2025-05-07T20:04:30.5424900Z 2025-05-07T20:04:30.5426471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:30.5429087Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:30.5430446Z ^ 2025-05-07T20:04:30.5430681Z 2025-05-07T20:04:30.5431102Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:30.5431738Z 2025-05-07T20:04:30.5433366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:30.5435861Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:30.5437020Z ^ 2025-05-07T20:04:30.5437376Z 2025-05-07T20:04:30.5438898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:30.5441591Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:30.5442687Z ^ 2025-05-07T20:04:30.5442951Z 2025-05-07T20:04:30.5443405Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:30.5444041Z 2025-05-07T20:04:30.5445741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:30.5448426Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:30.5449563Z ^ 2025-05-07T20:04:30.5449873Z 2025-05-07T20:04:30.5451495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:30.5454010Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:30.5455011Z ^ 2025-05-07T20:04:30.5455244Z 2025-05-07T20:04:30.5455646Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:30.5456361Z 2025-05-07T20:04:30.5457883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:30.5460418Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:30.5461405Z ^ 2025-05-07T20:04:30.5461744Z 2025-05-07T20:04:30.5463222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:30.5465877Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:30.5467017Z ^ 2025-05-07T20:04:30.5467270Z 2025-05-07T20:04:30.5467710Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:30.5468284Z 2025-05-07T20:04:30.5469949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:30.5472422Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:30.5473700Z ^ 2025-05-07T20:04:30.5474043Z 2025-05-07T20:04:31.3166935Z [440/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T20:04:31.3186996Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:31.3955950Z [441/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:31.3980229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.3983268Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.3984445Z ^ 2025-05-07T20:04:31.3984708Z 2025-05-07T20:04:31.3985423Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:31.3986103Z 2025-05-07T20:04:31.3987779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.3990533Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.3991857Z ^ 2025-05-07T20:04:31.3992263Z 2025-05-07T20:04:31.3993837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.3996942Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.3998105Z ^ 2025-05-07T20:04:31.3998356Z 2025-05-07T20:04:31.3998797Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:31.3999461Z 2025-05-07T20:04:31.4001134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.4003807Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.4004996Z ^ 2025-05-07T20:04:31.4005357Z 2025-05-07T20:04:31.4007021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.4009563Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.4010688Z ^ 2025-05-07T20:04:31.4010933Z 2025-05-07T20:04:31.4011506Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:31.4012116Z 2025-05-07T20:04:31.4013884Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.4016638Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.4018034Z ^ 2025-05-07T20:04:31.4018430Z 2025-05-07T20:04:31.4020166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.4023036Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.4024171Z ^ 2025-05-07T20:04:31.4024431Z 2025-05-07T20:04:31.4024801Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:31.4025421Z 2025-05-07T20:04:31.4027009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.4029759Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.4030994Z ^ 2025-05-07T20:04:31.4031509Z 2025-05-07T20:04:31.4033208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.4036098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.4037277Z ^ 2025-05-07T20:04:31.4037519Z 2025-05-07T20:04:31.4038074Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:31.4038753Z 2025-05-07T20:04:31.4040364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:31.4043090Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:31.4044271Z ^ 2025-05-07T20:04:31.4044636Z 2025-05-07T20:04:31.8696347Z [442/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:31.8717209Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:32.5337390Z [443/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T20:04:32.5357375Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:33.3111961Z [444/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T20:04:33.3128243Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:33.3603691Z [445/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T20:04:33.3624009Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:33.9269824Z [446/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T20:04:33.9288575Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:34.7543921Z [447/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T20:04:34.7563413Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:35.3702189Z [448/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T20:04:35.3720476Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:36.5688285Z [449/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T20:04:36.5707766Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:36.6153663Z [450/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:36.6177237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:36.6179754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:36.6180970Z ^ 2025-05-07T20:04:36.6181225Z 2025-05-07T20:04:36.6181665Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:36.6182303Z 2025-05-07T20:04:36.6183869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:36.6186453Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:36.6187588Z ^ 2025-05-07T20:04:36.6188222Z 2025-05-07T20:04:36.6189819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:36.6192422Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:36.6193574Z ^ 2025-05-07T20:04:36.6193823Z 2025-05-07T20:04:36.6194348Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:36.6194995Z 2025-05-07T20:04:36.6196627Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:36.6199368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:36.6200452Z ^ 2025-05-07T20:04:36.6200809Z 2025-05-07T20:04:36.6202324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:36.6204790Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:36.6205906Z ^ 2025-05-07T20:04:36.6206143Z 2025-05-07T20:04:36.6206537Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:36.6207159Z 2025-05-07T20:04:36.6208739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:36.6211315Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:36.6212599Z ^ 2025-05-07T20:04:36.6212951Z 2025-05-07T20:04:36.6214559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:36.6217095Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:36.6218173Z ^ 2025-05-07T20:04:36.6218405Z 2025-05-07T20:04:36.6218821Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:36.6219426Z 2025-05-07T20:04:36.6221156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:36.6223569Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:36.6224703Z ^ 2025-05-07T20:04:36.6225054Z 2025-05-07T20:04:36.6226648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:36.6229362Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:36.6230392Z ^ 2025-05-07T20:04:36.6230619Z 2025-05-07T20:04:36.6231148Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:36.6231752Z 2025-05-07T20:04:36.6233280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:36.6235876Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:36.6237038Z ^ 2025-05-07T20:04:36.6237387Z 2025-05-07T20:04:36.6565912Z [451/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T20:04:36.6585730Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:36.6864183Z [452/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T20:04:36.6883983Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:36.8322063Z [453/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T20:04:36.8343108Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:37.2341464Z [454/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T20:04:37.2362177Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:39.4171689Z [455/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:39.4191352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.4193599Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.4194561Z ^ 2025-05-07T20:04:39.4194817Z 2025-05-07T20:04:39.4195229Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:39.4195788Z 2025-05-07T20:04:39.4197109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.4199343Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.4200320Z ^ 2025-05-07T20:04:39.4200656Z 2025-05-07T20:04:39.4201963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.4204484Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.4205479Z ^ 2025-05-07T20:04:39.4205767Z 2025-05-07T20:04:39.4206132Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:39.4206706Z 2025-05-07T20:04:39.4208104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.4210353Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.4211313Z ^ 2025-05-07T20:04:39.4211620Z 2025-05-07T20:04:39.4212997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.4215217Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.4216249Z ^ 2025-05-07T20:04:39.4216466Z 2025-05-07T20:04:39.4216815Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:39.4217395Z 2025-05-07T20:04:39.4219002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.4221343Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.4222334Z ^ 2025-05-07T20:04:39.4222677Z 2025-05-07T20:04:39.4224114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.4226273Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.4227336Z ^ 2025-05-07T20:04:39.4227554Z 2025-05-07T20:04:39.4227922Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:39.4228462Z 2025-05-07T20:04:39.4229856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.4232110Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.4233079Z ^ 2025-05-07T20:04:39.4233380Z 2025-05-07T20:04:39.4234697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.4236837Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.4237816Z ^ 2025-05-07T20:04:39.4238023Z 2025-05-07T20:04:39.4238377Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:39.4239022Z 2025-05-07T20:04:39.4240342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.4242888Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.4243923Z ^ 2025-05-07T20:04:39.4244222Z 2025-05-07T20:04:39.9093130Z [456/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:04:39.9115284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9117691Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9118952Z ^ 2025-05-07T20:04:39.9119192Z 2025-05-07T20:04:39.9119601Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:39.9120225Z 2025-05-07T20:04:39.9121773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9124145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9125483Z ^ 2025-05-07T20:04:39.9125824Z 2025-05-07T20:04:39.9127329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9129715Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9130817Z ^ 2025-05-07T20:04:39.9131038Z 2025-05-07T20:04:39.9131449Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:39.9132039Z 2025-05-07T20:04:39.9133594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9136079Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9137156Z ^ 2025-05-07T20:04:39.9137488Z 2025-05-07T20:04:39.9139026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9141636Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9142665Z ^ 2025-05-07T20:04:39.9142914Z 2025-05-07T20:04:39.9143477Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:39.9144093Z 2025-05-07T20:04:39.9145665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9148107Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9149225Z ^ 2025-05-07T20:04:39.9149566Z 2025-05-07T20:04:39.9151084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9153676Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9154911Z ^ 2025-05-07T20:04:39.9155146Z 2025-05-07T20:04:39.9155553Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:39.9156208Z 2025-05-07T20:04:39.9157790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9160349Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9161488Z ^ 2025-05-07T20:04:39.9161841Z 2025-05-07T20:04:39.9163390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9165853Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9166947Z ^ 2025-05-07T20:04:39.9167303Z 2025-05-07T20:04:39.9167754Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:39.9168521Z 2025-05-07T20:04:39.9170046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:39.9172544Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:39.9173676Z ^ 2025-05-07T20:04:39.9174016Z 2025-05-07T20:04:40.4592717Z [457/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cpp 2025-05-07T20:04:40.4607126Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:41.6498800Z [458/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T20:04:41.6518218Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:42.1509295Z [459/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cpp 2025-05-07T20:04:42.1527087Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:42.2049133Z [460/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:04:42.2072332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.2075382Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.2076749Z ^ 2025-05-07T20:04:42.2076994Z 2025-05-07T20:04:42.2077424Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:42.2078192Z 2025-05-07T20:04:42.2079652Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.2082187Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.2083354Z ^ 2025-05-07T20:04:42.2083720Z 2025-05-07T20:04:42.2085259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.2087866Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.2089002Z ^ 2025-05-07T20:04:42.2089252Z 2025-05-07T20:04:42.2089691Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:42.2090340Z 2025-05-07T20:04:42.2091961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.2094607Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.2096003Z ^ 2025-05-07T20:04:42.2096370Z 2025-05-07T20:04:42.2098022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.2100724Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.2101869Z ^ 2025-05-07T20:04:42.2102130Z 2025-05-07T20:04:42.2102579Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:42.2103235Z 2025-05-07T20:04:42.2104908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.2107557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.2108670Z ^ 2025-05-07T20:04:42.2109006Z 2025-05-07T20:04:42.2110451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.2112882Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.2117462Z ^ 2025-05-07T20:04:42.2117852Z 2025-05-07T20:04:42.2118274Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:42.2118916Z 2025-05-07T20:04:42.2120459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.2123195Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.2124367Z ^ 2025-05-07T20:04:42.2124736Z 2025-05-07T20:04:42.2126428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.2128946Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.2130103Z ^ 2025-05-07T20:04:42.2130338Z 2025-05-07T20:04:42.2130786Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:42.2131442Z 2025-05-07T20:04:42.2133088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:42.2135618Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:42.2136812Z ^ 2025-05-07T20:04:42.2137147Z 2025-05-07T20:04:42.2977903Z [461/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_split_host.so -o fbgemm_gpu_tbe_training_backward_split_host.so CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so && : 2025-05-07T20:04:42.5766021Z [462/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T20:04:42.5786362Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:43.1944371Z [463/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_models.cpp 2025-05-07T20:04:43.1961455Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:44.9939673Z [464/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:04:44.9964278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.9966917Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.9968086Z ^ 2025-05-07T20:04:44.9968349Z 2025-05-07T20:04:44.9968807Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.9969460Z 2025-05-07T20:04:44.9971102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.9973895Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.9975119Z ^ 2025-05-07T20:04:44.9975462Z 2025-05-07T20:04:44.9983623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.9986557Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.9987730Z ^ 2025-05-07T20:04:44.9987972Z 2025-05-07T20:04:44.9988501Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:44.9989191Z 2025-05-07T20:04:44.9990888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.9993581Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:44.9994787Z ^ 2025-05-07T20:04:44.9995156Z 2025-05-07T20:04:44.9996771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:44.9999482Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.0000653Z ^ 2025-05-07T20:04:45.0000913Z 2025-05-07T20:04:45.0001544Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.0002220Z 2025-05-07T20:04:45.0003879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.0006662Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.0007846Z ^ 2025-05-07T20:04:45.0008199Z 2025-05-07T20:04:45.0009871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.0012459Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.0013563Z ^ 2025-05-07T20:04:45.0013835Z 2025-05-07T20:04:45.0014280Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.0014932Z 2025-05-07T20:04:45.0016549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.0019119Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.0020471Z ^ 2025-05-07T20:04:45.0020816Z 2025-05-07T20:04:45.0022478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.0025232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.0026375Z ^ 2025-05-07T20:04:45.0026622Z 2025-05-07T20:04:45.0027047Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:45.0027707Z 2025-05-07T20:04:45.0029381Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:45.0032103Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:45.0033288Z ^ 2025-05-07T20:04:45.0033668Z 2025-05-07T20:04:45.4647409Z [465/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T20:04:45.4667641Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:45.9366389Z [466/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T20:04:45.9386856Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:47.0458094Z [467/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T20:04:47.0476264Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:47.0723361Z [468/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T20:04:47.0740871Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:47.2950379Z [469/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T20:04:47.2968194Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:47.6365416Z [470/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T20:04:47.6383854Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:48.1020097Z [471/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T20:04:48.1037172Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:48.8539174Z [472/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T20:04:48.8556974Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:50.7238144Z [473/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T20:04:50.7255741Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:50.8090856Z [474/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T20:04:50.8108551Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:51.0359808Z [475/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T20:04:51.0377197Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:51.2097128Z [476/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T20:04:51.2116140Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:52.4018794Z [477/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T20:04:52.4035001Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:53.0357362Z [478/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T20:04:53.0374759Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:53.0969838Z [479/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T20:04:53.0986404Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:53.1909998Z [480/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T20:04:53.1927367Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:54.8426815Z [481/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o 2025-05-07T20:04:54.8444570Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8446485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8447286Z ^ 2025-05-07T20:04:54.8447491Z 2025-05-07T20:04:54.8447805Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.8448276Z 2025-05-07T20:04:54.8449629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8451479Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8452313Z ^ 2025-05-07T20:04:54.8452599Z 2025-05-07T20:04:54.8453910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8455741Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8456680Z ^ 2025-05-07T20:04:54.8456877Z 2025-05-07T20:04:54.8457199Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.8457700Z 2025-05-07T20:04:54.8458934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8460997Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8461933Z ^ 2025-05-07T20:04:54.8462218Z 2025-05-07T20:04:54.8463432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8465435Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8466328Z ^ 2025-05-07T20:04:54.8466561Z 2025-05-07T20:04:54.8466914Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.8467412Z 2025-05-07T20:04:54.8468663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8470774Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8471702Z ^ 2025-05-07T20:04:54.8471990Z 2025-05-07T20:04:54.8473218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8475204Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8476426Z ^ 2025-05-07T20:04:54.8476626Z 2025-05-07T20:04:54.8476963Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.8477472Z 2025-05-07T20:04:54.8478712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8480720Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8481607Z ^ 2025-05-07T20:04:54.8481914Z 2025-05-07T20:04:54.8483125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8485269Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8486160Z ^ 2025-05-07T20:04:54.8486375Z 2025-05-07T20:04:54.8486703Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:04:54.8487182Z 2025-05-07T20:04:54.8488471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:04:54.8490316Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:04:54.8491240Z ^ 2025-05-07T20:04:54.8491493Z 2025-05-07T20:04:55.1634698Z [482/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/topology_utils.cpp 2025-05-07T20:04:55.1651770Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:58.5321877Z [483/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T20:04:58.5337159Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:58.8736604Z [484/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_utils.cpp 2025-05-07T20:04:58.8754875Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:04:59.7237687Z [485/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops_host.cpp 2025-05-07T20:04:59.7255920Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:00.1980750Z [486/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T20:05:00.1998390Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:00.6438453Z [487/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T20:05:00.6456464Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:00.7710622Z [488/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T20:05:00.7729265Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:01.4536254Z [489/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator.cpp 2025-05-07T20:05:01.4552858Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:01.7125391Z [490/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:05:01.7148602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.7151312Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.7152488Z ^ 2025-05-07T20:05:01.7152775Z 2025-05-07T20:05:01.7153194Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.7153786Z 2025-05-07T20:05:01.7155337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.7158089Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.7159221Z ^ 2025-05-07T20:05:01.7159606Z 2025-05-07T20:05:01.7161205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.7163747Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.7164955Z ^ 2025-05-07T20:05:01.7165395Z 2025-05-07T20:05:01.7165835Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.7166493Z 2025-05-07T20:05:01.7168108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.7170626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.7171792Z ^ 2025-05-07T20:05:01.7172142Z 2025-05-07T20:05:01.7173519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.7175895Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.7177255Z ^ 2025-05-07T20:05:01.7177497Z 2025-05-07T20:05:01.7177964Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.7178543Z 2025-05-07T20:05:01.7180046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.7182724Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.7183763Z ^ 2025-05-07T20:05:01.7184321Z 2025-05-07T20:05:01.7185746Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.7188096Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.7189174Z ^ 2025-05-07T20:05:01.7189434Z 2025-05-07T20:05:01.7190064Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.7190696Z 2025-05-07T20:05:01.7192131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.7194915Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.7196051Z ^ 2025-05-07T20:05:01.7196394Z 2025-05-07T20:05:01.7197930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.7200458Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.7201633Z ^ 2025-05-07T20:05:01.7201881Z 2025-05-07T20:05:01.7202363Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:01.7203027Z 2025-05-07T20:05:01.7204705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:01.7207318Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:01.7208684Z ^ 2025-05-07T20:05:01.7209037Z 2025-05-07T20:05:01.9182659Z [491/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T20:05:01.9200135Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:02.5632412Z [492/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T20:05:02.5651728Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:03.2384250Z [493/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T20:05:03.2403662Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:04.1082922Z [494/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T20:05:04.1100844Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:04.1622451Z [495/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o 2025-05-07T20:05:04.1641875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.1643965Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.1645040Z ^ 2025-05-07T20:05:04.1645243Z 2025-05-07T20:05:04.1645605Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:04.1646218Z 2025-05-07T20:05:04.1647616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.1649796Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.1650792Z ^ 2025-05-07T20:05:04.1651090Z 2025-05-07T20:05:04.1652391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.1654419Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.1655344Z ^ 2025-05-07T20:05:04.1655587Z 2025-05-07T20:05:04.1655989Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:04.1656566Z 2025-05-07T20:05:04.1658229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.1660728Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.1661750Z ^ 2025-05-07T20:05:04.1662058Z 2025-05-07T20:05:04.1663330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.1665480Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.1666550Z ^ 2025-05-07T20:05:04.1666782Z 2025-05-07T20:05:04.1667177Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:04.1669441Z 2025-05-07T20:05:04.1670910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.1673060Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.1673980Z ^ 2025-05-07T20:05:04.1674273Z 2025-05-07T20:05:04.1675677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.1678429Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.1679405Z ^ 2025-05-07T20:05:04.1679631Z 2025-05-07T20:05:04.1680044Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:04.1680548Z 2025-05-07T20:05:04.1681882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.1683901Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.1685047Z ^ 2025-05-07T20:05:04.1685350Z 2025-05-07T20:05:04.1686719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.1688938Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.1689823Z ^ 2025-05-07T20:05:04.1690068Z 2025-05-07T20:05:04.1690434Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:04.1690985Z 2025-05-07T20:05:04.1692285Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:04.1694352Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:04.1695416Z ^ 2025-05-07T20:05:04.1695732Z 2025-05-07T20:05:04.4616300Z [496/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator.cpp 2025-05-07T20:05:04.4630354Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:04.6223754Z [497/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T20:05:04.6248790Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:04.9190634Z [498/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T20:05:04.9212253Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:10.8504976Z [499/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T20:05:10.8523016Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:14.0022274Z [500/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o 2025-05-07T20:05:14.0044713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0047247Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0048425Z ^ 2025-05-07T20:05:14.0048689Z 2025-05-07T20:05:14.0049118Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.0049784Z 2025-05-07T20:05:14.0051343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0053878Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0055323Z ^ 2025-05-07T20:05:14.0055716Z 2025-05-07T20:05:14.0057265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0060089Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0061510Z ^ 2025-05-07T20:05:14.0061776Z 2025-05-07T20:05:14.0062227Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.0062876Z 2025-05-07T20:05:14.0064466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0067232Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0068370Z ^ 2025-05-07T20:05:14.0068745Z 2025-05-07T20:05:14.0070349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0072989Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0074131Z ^ 2025-05-07T20:05:14.0074390Z 2025-05-07T20:05:14.0074850Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.0075521Z 2025-05-07T20:05:14.0077466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0080172Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0081595Z ^ 2025-05-07T20:05:14.0081976Z 2025-05-07T20:05:14.0083665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0086263Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0087406Z ^ 2025-05-07T20:05:14.0087658Z 2025-05-07T20:05:14.0088115Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.0088773Z 2025-05-07T20:05:14.0090376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0092959Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0094100Z ^ 2025-05-07T20:05:14.0094480Z 2025-05-07T20:05:14.0096063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0098652Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0099765Z ^ 2025-05-07T20:05:14.0100313Z 2025-05-07T20:05:14.0100899Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.0101516Z 2025-05-07T20:05:14.0103077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0105889Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0107072Z ^ 2025-05-07T20:05:14.0107434Z 2025-05-07T20:05:14.0665632Z [501/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:05:14.0689624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0692266Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0693563Z ^ 2025-05-07T20:05:14.0693867Z 2025-05-07T20:05:14.0694294Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.0694943Z 2025-05-07T20:05:14.0696857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0699444Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0700782Z ^ 2025-05-07T20:05:14.0701127Z 2025-05-07T20:05:14.0702844Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0705403Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0706674Z ^ 2025-05-07T20:05:14.0706924Z 2025-05-07T20:05:14.0707388Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.0708061Z 2025-05-07T20:05:14.0709651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0712596Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0713798Z ^ 2025-05-07T20:05:14.0714196Z 2025-05-07T20:05:14.0715794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0718390Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0719709Z ^ 2025-05-07T20:05:14.0719970Z 2025-05-07T20:05:14.0720422Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.0721089Z 2025-05-07T20:05:14.0722727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0725189Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0726337Z ^ 2025-05-07T20:05:14.0726687Z 2025-05-07T20:05:14.0728239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0730830Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0731980Z ^ 2025-05-07T20:05:14.0732226Z 2025-05-07T20:05:14.0732629Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.0733268Z 2025-05-07T20:05:14.0734914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0737478Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0738641Z ^ 2025-05-07T20:05:14.0739016Z 2025-05-07T20:05:14.0740945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0743696Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0744906Z ^ 2025-05-07T20:05:14.0745165Z 2025-05-07T20:05:14.0745642Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.0746233Z 2025-05-07T20:05:14.0747929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.0750631Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.0751814Z ^ 2025-05-07T20:05:14.0752241Z 2025-05-07T20:05:14.4427674Z [502/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:05:14.4453014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.4456010Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.4457227Z ^ 2025-05-07T20:05:14.4457480Z 2025-05-07T20:05:14.4457933Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.4458628Z 2025-05-07T20:05:14.4460181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.4462967Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.4464144Z ^ 2025-05-07T20:05:14.4464520Z 2025-05-07T20:05:14.4465971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.4468644Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.4469815Z ^ 2025-05-07T20:05:14.4470072Z 2025-05-07T20:05:14.4470532Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.4471191Z 2025-05-07T20:05:14.4472798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.4475387Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.4476809Z ^ 2025-05-07T20:05:14.4477156Z 2025-05-07T20:05:14.4478528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.4481280Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.4482411Z ^ 2025-05-07T20:05:14.4482660Z 2025-05-07T20:05:14.4483065Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.4483707Z 2025-05-07T20:05:14.4485282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.4487886Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.4488928Z ^ 2025-05-07T20:05:14.4489266Z 2025-05-07T20:05:14.4490804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.4493371Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.4494531Z ^ 2025-05-07T20:05:14.4494779Z 2025-05-07T20:05:14.4495169Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.4495728Z 2025-05-07T20:05:14.4497343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.4499675Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.4500758Z ^ 2025-05-07T20:05:14.4501101Z 2025-05-07T20:05:14.4502505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.4505019Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.4506063Z ^ 2025-05-07T20:05:14.4506412Z 2025-05-07T20:05:14.4506804Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:14.4507462Z 2025-05-07T20:05:14.4509093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:14.4511626Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:14.4512759Z ^ 2025-05-07T20:05:14.4513108Z 2025-05-07T20:05:15.6252365Z [503/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o 2025-05-07T20:05:15.6277234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:15.6279500Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:15.6280644Z ^ 2025-05-07T20:05:15.6280908Z 2025-05-07T20:05:15.6281492Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:15.6282187Z 2025-05-07T20:05:15.6283783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:15.6286203Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:15.6287271Z ^ 2025-05-07T20:05:15.6287615Z 2025-05-07T20:05:15.6288901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:15.6290535Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:15.6291169Z ^ 2025-05-07T20:05:15.6291320Z 2025-05-07T20:05:15.6291589Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:15.6291950Z 2025-05-07T20:05:15.6292813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:15.6294235Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:15.6294892Z ^ 2025-05-07T20:05:15.6295233Z 2025-05-07T20:05:15.6295958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:15.6296927Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:15.6297250Z ^ 2025-05-07T20:05:15.6297430Z 2025-05-07T20:05:15.6298287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:15.6299695Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:15.6300318Z ^ 2025-05-07T20:05:15.6300636Z 2025-05-07T20:05:15.6300882Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:15.6301244Z 2025-05-07T20:05:15.6302135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:15.6303528Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:15.6304186Z ^ 2025-05-07T20:05:15.6304393Z 2025-05-07T20:05:15.6305193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:15.6306136Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:15.6306469Z ^ 2025-05-07T20:05:15.6306622Z 2025-05-07T20:05:15.6307471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:15.6308919Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:15.6309568Z ^ 2025-05-07T20:05:15.6309716Z 2025-05-07T20:05:15.6309960Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:15.6310381Z 2025-05-07T20:05:15.6311253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:15.6312672Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:15.6313316Z ^ 2025-05-07T20:05:15.6313526Z 2025-05-07T20:05:15.6314272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:15.6315212Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:15.6315546Z ^ 2025-05-07T20:05:15.6315694Z 2025-05-07T20:05:15.6316569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:15.6317991Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:15.6318635Z ^ 2025-05-07T20:05:15.6318790Z 2025-05-07T20:05:15.6319038Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:15.6319465Z 2025-05-07T20:05:15.6320331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:15.6321745Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:15.6322385Z ^ 2025-05-07T20:05:15.6322616Z 2025-05-07T20:05:15.6323347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:15.6324319Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:15.6324639Z ^ 2025-05-07T20:05:15.6324795Z 2025-05-07T20:05:16.1777827Z [504/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T20:05:16.1795145Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:20.0247705Z [505/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:05:20.0267827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.0269835Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.0270731Z ^ 2025-05-07T20:05:20.0272286Z 2025-05-07T20:05:20.0272699Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:20.0273222Z 2025-05-07T20:05:20.0274469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.0276878Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.0277789Z ^ 2025-05-07T20:05:20.0278103Z 2025-05-07T20:05:20.0279292Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.0281287Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.0282150Z ^ 2025-05-07T20:05:20.0282379Z 2025-05-07T20:05:20.0282720Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:20.0283208Z 2025-05-07T20:05:20.0284505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.0286413Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.0287529Z ^ 2025-05-07T20:05:20.0287830Z 2025-05-07T20:05:20.0289074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.0291044Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.0291932Z ^ 2025-05-07T20:05:20.0292143Z 2025-05-07T20:05:20.0292487Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:20.0292997Z 2025-05-07T20:05:20.0294186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.0296195Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.0297089Z ^ 2025-05-07T20:05:20.0297405Z 2025-05-07T20:05:20.0298615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.0300708Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.0301583Z ^ 2025-05-07T20:05:20.0301815Z 2025-05-07T20:05:20.0302293Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:20.0302787Z 2025-05-07T20:05:20.0304020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.0305948Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.0306926Z ^ 2025-05-07T20:05:20.0307233Z 2025-05-07T20:05:20.0308461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.0310578Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.0311473Z ^ 2025-05-07T20:05:20.0311681Z 2025-05-07T20:05:20.0312005Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:20.0312528Z 2025-05-07T20:05:20.0313734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:20.0315709Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:20.0316642Z ^ 2025-05-07T20:05:20.0316994Z 2025-05-07T20:05:21.8577715Z [506/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o 2025-05-07T20:05:21.8596990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.8599007Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.8600026Z ^ 2025-05-07T20:05:21.8600242Z 2025-05-07T20:05:21.8600594Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:21.8601094Z 2025-05-07T20:05:21.8602370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.8604376Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.8605281Z ^ 2025-05-07T20:05:21.8605572Z 2025-05-07T20:05:21.8606813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.8608797Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.8609692Z ^ 2025-05-07T20:05:21.8610003Z 2025-05-07T20:05:21.8610348Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:21.8610992Z 2025-05-07T20:05:21.8612165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.8614173Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.8615051Z ^ 2025-05-07T20:05:21.8615329Z 2025-05-07T20:05:21.8616542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.8618512Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.8619360Z ^ 2025-05-07T20:05:21.8619562Z 2025-05-07T20:05:21.8619883Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:21.8620530Z 2025-05-07T20:05:21.8621725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.8623611Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.8624523Z ^ 2025-05-07T20:05:21.8624802Z 2025-05-07T20:05:21.8626131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.8628079Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.8628990Z ^ 2025-05-07T20:05:21.8629188Z 2025-05-07T20:05:21.8629528Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:21.8630033Z 2025-05-07T20:05:21.8631292Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.8633181Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.8634134Z ^ 2025-05-07T20:05:21.8634429Z 2025-05-07T20:05:21.8635662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.8638011Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.8639120Z ^ 2025-05-07T20:05:21.8639386Z 2025-05-07T20:05:21.8639938Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:21.8640575Z 2025-05-07T20:05:21.8642856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:21.8646598Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:21.8647841Z ^ 2025-05-07T20:05:21.8648316Z 2025-05-07T20:05:26.0176277Z [507/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o 2025-05-07T20:05:26.0198253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:26.0201007Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:26.0202120Z ^ 2025-05-07T20:05:26.0202357Z 2025-05-07T20:05:26.0202756Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:26.0203409Z 2025-05-07T20:05:26.0204937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:26.0207340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:26.0208456Z ^ 2025-05-07T20:05:26.0208797Z 2025-05-07T20:05:26.0210308Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:26.0212743Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:26.0213957Z ^ 2025-05-07T20:05:26.0214188Z 2025-05-07T20:05:26.0214602Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:26.0215187Z 2025-05-07T20:05:26.0216610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:26.0219014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:26.0220125Z ^ 2025-05-07T20:05:26.0220613Z 2025-05-07T20:05:26.0222135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:26.0224633Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:26.0225659Z ^ 2025-05-07T20:05:26.0225927Z 2025-05-07T20:05:26.0226349Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:26.0226903Z 2025-05-07T20:05:26.0228389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:26.0230678Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:26.0232111Z ^ 2025-05-07T20:05:26.0232447Z 2025-05-07T20:05:26.0233757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:26.0236373Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:26.0237580Z ^ 2025-05-07T20:05:26.0237829Z 2025-05-07T20:05:26.0238243Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:26.0238814Z 2025-05-07T20:05:26.0240297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:26.0242401Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:26.0243386Z ^ 2025-05-07T20:05:26.0243724Z 2025-05-07T20:05:26.0245193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:26.0247509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:26.0248437Z ^ 2025-05-07T20:05:26.0248686Z 2025-05-07T20:05:26.0249103Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:26.0249717Z 2025-05-07T20:05:26.0251174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:26.0253537Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:26.0254808Z ^ 2025-05-07T20:05:26.0255148Z 2025-05-07T20:05:31.5732107Z [508/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o 2025-05-07T20:05:31.5752833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:31.5755058Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:31.5756061Z ^ 2025-05-07T20:05:31.5756290Z 2025-05-07T20:05:31.5756707Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:31.5757254Z 2025-05-07T20:05:31.5758659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:31.5761032Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:31.5762256Z ^ 2025-05-07T20:05:31.5762605Z 2025-05-07T20:05:31.5764193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:31.5767023Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:31.5768097Z ^ 2025-05-07T20:05:31.5768342Z 2025-05-07T20:05:31.5768741Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:31.5769298Z 2025-05-07T20:05:31.5770733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:31.5773211Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:31.5774559Z ^ 2025-05-07T20:05:31.5774911Z 2025-05-07T20:05:31.5776468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:31.5778096Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:31.5778628Z ^ 2025-05-07T20:05:31.5778875Z 2025-05-07T20:05:31.5780547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:31.5783545Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:31.5784713Z ^ 2025-05-07T20:05:31.5784977Z 2025-05-07T20:05:31.5785417Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:31.5786069Z 2025-05-07T20:05:31.5787702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:31.5790340Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:31.5791547Z ^ 2025-05-07T20:05:31.5791994Z 2025-05-07T20:05:31.5793364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:31.5795069Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:31.5795606Z ^ 2025-05-07T20:05:31.5795852Z 2025-05-07T20:05:31.5797482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:31.5800065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:31.5801313Z ^ 2025-05-07T20:05:31.5801545Z 2025-05-07T20:05:31.5802136Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:31.5802799Z 2025-05-07T20:05:31.5804447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:31.5807056Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:31.5808169Z ^ 2025-05-07T20:05:31.5808637Z 2025-05-07T20:05:31.5809840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:31.5811320Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:31.5811795Z ^ 2025-05-07T20:05:31.5812016Z 2025-05-07T20:05:31.5813533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:31.5815916Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:31.5816948Z ^ 2025-05-07T20:05:31.5817181Z 2025-05-07T20:05:31.5817564Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:31.5818177Z 2025-05-07T20:05:31.5819573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:31.5821840Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:31.5822731Z ^ 2025-05-07T20:05:31.5823027Z 2025-05-07T20:05:31.5824215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:05:31.5825607Z const auto offset_idx = idx * D_emb; 2025-05-07T20:05:31.5826025Z ^ 2025-05-07T20:05:31.5826259Z 2025-05-07T20:05:32.4807527Z [509/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_split_grad_index_select.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o 2025-05-07T20:05:32.4828659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:32.4831065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:32.4831971Z ^ 2025-05-07T20:05:32.4832150Z 2025-05-07T20:05:32.4832591Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:32.4833153Z 2025-05-07T20:05:32.4834468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:32.4836814Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:32.4837884Z ^ 2025-05-07T20:05:32.4838231Z 2025-05-07T20:05:32.4840014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:32.4842431Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:32.4843510Z ^ 2025-05-07T20:05:32.4843765Z 2025-05-07T20:05:32.4844191Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:32.4844808Z 2025-05-07T20:05:32.4846545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:32.4849154Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:32.4850266Z ^ 2025-05-07T20:05:32.4850625Z 2025-05-07T20:05:32.4852080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:32.4854550Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:32.4855602Z ^ 2025-05-07T20:05:32.4855845Z 2025-05-07T20:05:32.4856235Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:32.4856811Z 2025-05-07T20:05:32.4858214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:32.4860768Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:32.4861800Z ^ 2025-05-07T20:05:32.4862103Z 2025-05-07T20:05:32.4863521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:32.4866064Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:32.4867181Z ^ 2025-05-07T20:05:32.4867420Z 2025-05-07T20:05:32.4867844Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:32.4868471Z 2025-05-07T20:05:32.4870029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:32.4872533Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:32.4873623Z ^ 2025-05-07T20:05:32.4873990Z 2025-05-07T20:05:32.4875415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:32.4878123Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:32.4878985Z ^ 2025-05-07T20:05:32.4879227Z 2025-05-07T20:05:32.4879599Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:32.4880140Z 2025-05-07T20:05:32.4881730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:32.4884662Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:32.4885785Z ^ 2025-05-07T20:05:32.4886135Z 2025-05-07T20:05:32.8501621Z [510/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_forward.so -o fbgemm_gpu_tbe_training_forward.so CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_common.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && : 2025-05-07T20:05:33.8101382Z [511/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:05:33.8123953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:33.8126653Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:33.8127704Z ^ 2025-05-07T20:05:33.8128008Z 2025-05-07T20:05:33.8128450Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:33.8129090Z 2025-05-07T20:05:33.8130589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:33.8133143Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:33.8134286Z ^ 2025-05-07T20:05:33.8134646Z 2025-05-07T20:05:33.8136146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:33.8138587Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:33.8139680Z ^ 2025-05-07T20:05:33.8140121Z 2025-05-07T20:05:33.8140676Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:33.8141281Z 2025-05-07T20:05:33.8142894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:33.8145591Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:33.8146915Z ^ 2025-05-07T20:05:33.8147264Z 2025-05-07T20:05:33.8148802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:33.8151485Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:33.8152553Z ^ 2025-05-07T20:05:33.8152806Z 2025-05-07T20:05:33.8153231Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:33.8153899Z 2025-05-07T20:05:33.8155403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:33.8157951Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:33.8159077Z ^ 2025-05-07T20:05:33.8159431Z 2025-05-07T20:05:33.8161003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:33.8163586Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:33.8164897Z ^ 2025-05-07T20:05:33.8165147Z 2025-05-07T20:05:33.8165578Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:33.8166216Z 2025-05-07T20:05:33.8167821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:33.8170368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:33.8171459Z ^ 2025-05-07T20:05:33.8171812Z 2025-05-07T20:05:33.8173417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:33.8175821Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:33.8177195Z ^ 2025-05-07T20:05:33.8177434Z 2025-05-07T20:05:33.8177861Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:33.8178527Z 2025-05-07T20:05:33.8180041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:33.8183072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:33.8184247Z ^ 2025-05-07T20:05:33.8184652Z 2025-05-07T20:05:34.3234201Z [512/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_batch_index_select_dim0_forward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o 2025-05-07T20:05:34.3255124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:34.3257922Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:34.3259435Z ^ 2025-05-07T20:05:34.3259714Z 2025-05-07T20:05:34.3260122Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:34.3260898Z 2025-05-07T20:05:34.3262433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:34.3264719Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:34.3265803Z ^ 2025-05-07T20:05:34.3266149Z 2025-05-07T20:05:34.3267828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:34.3270257Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:34.3271325Z ^ 2025-05-07T20:05:34.3271562Z 2025-05-07T20:05:34.3271912Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:34.3272487Z 2025-05-07T20:05:34.3274115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:34.3276865Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:34.3278114Z ^ 2025-05-07T20:05:34.3278468Z 2025-05-07T20:05:34.3279799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:34.3281884Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:34.3282797Z ^ 2025-05-07T20:05:34.3283049Z 2025-05-07T20:05:34.3283450Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:34.3284060Z 2025-05-07T20:05:34.3285535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:34.3287847Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:34.3288924Z ^ 2025-05-07T20:05:34.3289245Z 2025-05-07T20:05:34.3290765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:34.3293307Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:34.3294399Z ^ 2025-05-07T20:05:34.3294628Z 2025-05-07T20:05:34.3295143Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:34.3295719Z 2025-05-07T20:05:34.3297131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:34.3299578Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:34.3300840Z ^ 2025-05-07T20:05:34.3301215Z 2025-05-07T20:05:34.3302813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:34.3305350Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:34.3306420Z ^ 2025-05-07T20:05:34.3306645Z 2025-05-07T20:05:34.3307077Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:34.3307706Z 2025-05-07T20:05:34.3309480Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:34.3312119Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:34.3313320Z ^ 2025-05-07T20:05:34.3313695Z 2025-05-07T20:05:39.1828568Z [513/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_batch_index_select_dim0_forward_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o 2025-05-07T20:05:39.1848015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:39.1850289Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:39.1851282Z ^ 2025-05-07T20:05:39.1851495Z 2025-05-07T20:05:39.1851862Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:39.1852413Z 2025-05-07T20:05:39.1853771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:39.1856023Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:39.1857049Z ^ 2025-05-07T20:05:39.1857378Z 2025-05-07T20:05:39.1858985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:39.1861817Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:39.1862787Z ^ 2025-05-07T20:05:39.1863003Z 2025-05-07T20:05:39.1863468Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:39.1864012Z 2025-05-07T20:05:39.1865490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:39.1867766Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:39.1868819Z ^ 2025-05-07T20:05:39.1869143Z 2025-05-07T20:05:39.1870580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:39.1873052Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:39.1874015Z ^ 2025-05-07T20:05:39.1874231Z 2025-05-07T20:05:39.1874603Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:39.1875178Z 2025-05-07T20:05:39.1876842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:39.1879145Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:39.1880175Z ^ 2025-05-07T20:05:39.1880716Z 2025-05-07T20:05:39.1882040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:39.1884219Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:39.1885121Z ^ 2025-05-07T20:05:39.1885321Z 2025-05-07T20:05:39.1885707Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:39.1886250Z 2025-05-07T20:05:39.1887802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:39.1890197Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:39.1891297Z ^ 2025-05-07T20:05:39.1891627Z 2025-05-07T20:05:39.1893139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:39.1895434Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:39.1896350Z ^ 2025-05-07T20:05:39.1896563Z 2025-05-07T20:05:39.1897130Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:39.1897700Z 2025-05-07T20:05:39.1899062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:39.1901319Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:39.1902439Z ^ 2025-05-07T20:05:39.1902741Z 2025-05-07T20:05:41.9851511Z [514/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++20 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T20:05:41.9869668Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:05:43.7707836Z [515/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o 2025-05-07T20:05:44.4146009Z [516/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o 2025-05-07T20:05:44.4167090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:44.4169903Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:44.4170988Z ^ 2025-05-07T20:05:44.4171247Z 2025-05-07T20:05:44.4171663Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:44.4172252Z 2025-05-07T20:05:44.4174107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:44.4176768Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:44.4177997Z ^ 2025-05-07T20:05:44.4178324Z 2025-05-07T20:05:44.4179903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:44.4182335Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:44.4183401Z ^ 2025-05-07T20:05:44.4183630Z 2025-05-07T20:05:44.4184043Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:44.4184616Z 2025-05-07T20:05:44.4186128Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:44.4188457Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:44.4189503Z ^ 2025-05-07T20:05:44.4189839Z 2025-05-07T20:05:44.4191223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:44.4193898Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:44.4194966Z ^ 2025-05-07T20:05:44.4195228Z 2025-05-07T20:05:44.4195640Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:44.4196275Z 2025-05-07T20:05:44.4197810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:44.4200317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:44.4201361Z ^ 2025-05-07T20:05:44.4201682Z 2025-05-07T20:05:44.4203135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:44.4205490Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:44.4206556Z ^ 2025-05-07T20:05:44.4206799Z 2025-05-07T20:05:44.4207197Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:44.4207819Z 2025-05-07T20:05:44.4209526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:44.4212004Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:44.4213224Z ^ 2025-05-07T20:05:44.4213551Z 2025-05-07T20:05:44.4215167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:44.4217588Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:44.4218732Z ^ 2025-05-07T20:05:44.4218988Z 2025-05-07T20:05:44.4219355Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:44.4219946Z 2025-05-07T20:05:44.4221797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:44.4224258Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:44.4225371Z ^ 2025-05-07T20:05:44.4225697Z 2025-05-07T20:05:47.7165484Z [517/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o 2025-05-07T20:05:53.2326104Z [518/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o 2025-05-07T20:05:53.5309596Z [519/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_batch_index_select_dim0_forward_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o 2025-05-07T20:05:53.5330793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5333615Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5334671Z ^ 2025-05-07T20:05:53.5334914Z 2025-05-07T20:05:53.5335335Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:53.5335922Z 2025-05-07T20:05:53.5337363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5339993Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5341350Z ^ 2025-05-07T20:05:53.5341678Z 2025-05-07T20:05:53.5343198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5345716Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5346973Z ^ 2025-05-07T20:05:53.5347261Z 2025-05-07T20:05:53.5347696Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:53.5348282Z 2025-05-07T20:05:53.5349783Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5352330Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5353468Z ^ 2025-05-07T20:05:53.5353825Z 2025-05-07T20:05:53.5355424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5358002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5359176Z ^ 2025-05-07T20:05:53.5359425Z 2025-05-07T20:05:53.5359874Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:53.5360486Z 2025-05-07T20:05:53.5362039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5364833Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5365994Z ^ 2025-05-07T20:05:53.5366368Z 2025-05-07T20:05:53.5367996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5370643Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5371719Z ^ 2025-05-07T20:05:53.5371969Z 2025-05-07T20:05:53.5372333Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:53.5373103Z 2025-05-07T20:05:53.5374629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5377067Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5378053Z ^ 2025-05-07T20:05:53.5378397Z 2025-05-07T20:05:53.5379858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5382442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5383473Z ^ 2025-05-07T20:05:53.5383688Z 2025-05-07T20:05:53.5384057Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:05:53.5384689Z 2025-05-07T20:05:53.5386260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:05:53.5388704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:05:53.5390097Z ^ 2025-05-07T20:05:53.5390466Z 2025-05-07T20:06:04.1235774Z [520/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/histogram_binning_calibration_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o 2025-05-07T20:06:08.9496472Z [521/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_batch_index_select_dim0_backward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o 2025-05-07T20:06:08.9517728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:08.9520031Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:08.9521064Z ^ 2025-05-07T20:06:08.9521289Z 2025-05-07T20:06:08.9521702Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:08.9522297Z 2025-05-07T20:06:08.9524082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:08.9531194Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:08.9532236Z ^ 2025-05-07T20:06:08.9532593Z 2025-05-07T20:06:08.9534162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:08.9536935Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:08.9537931Z ^ 2025-05-07T20:06:08.9538175Z 2025-05-07T20:06:08.9538578Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:08.9539152Z 2025-05-07T20:06:08.9540751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:08.9543185Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:08.9544278Z ^ 2025-05-07T20:06:08.9544619Z 2025-05-07T20:06:08.9546029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:08.9548432Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:08.9549533Z ^ 2025-05-07T20:06:08.9549770Z 2025-05-07T20:06:08.9550342Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:08.9551127Z 2025-05-07T20:06:08.9552641Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:08.9555033Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:08.9556166Z ^ 2025-05-07T20:06:08.9556509Z 2025-05-07T20:06:08.9558080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:08.9560524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:08.9561615Z ^ 2025-05-07T20:06:08.9561872Z 2025-05-07T20:06:08.9562301Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:08.9562898Z 2025-05-07T20:06:08.9564481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:08.9566941Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:08.9568303Z ^ 2025-05-07T20:06:08.9568644Z 2025-05-07T20:06:08.9570352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:08.9572799Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:08.9574051Z ^ 2025-05-07T20:06:08.9574305Z 2025-05-07T20:06:08.9574731Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:08.9575448Z 2025-05-07T20:06:08.9577317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:08.9579679Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:08.9580920Z ^ 2025-05-07T20:06:08.9581269Z 2025-05-07T20:06:10.3935688Z [522/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_batch_index_select_dim0_backward_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o 2025-05-07T20:06:10.3957595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:10.3960260Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:10.3961738Z ^ 2025-05-07T20:06:10.3962025Z 2025-05-07T20:06:10.3962455Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:10.3963103Z 2025-05-07T20:06:10.3964615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:10.3967464Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:10.3968655Z ^ 2025-05-07T20:06:10.3968992Z 2025-05-07T20:06:10.3970405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:10.3972730Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:10.3973797Z ^ 2025-05-07T20:06:10.3974052Z 2025-05-07T20:06:10.3974454Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:10.3975012Z 2025-05-07T20:06:10.3976924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:10.3979324Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:10.3980515Z ^ 2025-05-07T20:06:10.3980864Z 2025-05-07T20:06:10.3982347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:10.3984828Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:10.3986381Z ^ 2025-05-07T20:06:10.3986641Z 2025-05-07T20:06:10.3987020Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:10.3987659Z 2025-05-07T20:06:10.3988967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:10.3991499Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:10.3992585Z ^ 2025-05-07T20:06:10.3992973Z 2025-05-07T20:06:10.3994440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:10.3996789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:10.3997775Z ^ 2025-05-07T20:06:10.3998004Z 2025-05-07T20:06:10.3998397Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:10.3998923Z 2025-05-07T20:06:10.4000317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:10.4002897Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:10.4003971Z ^ 2025-05-07T20:06:10.4004404Z 2025-05-07T20:06:10.4006067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:10.4008895Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:10.4009975Z ^ 2025-05-07T20:06:10.4010201Z 2025-05-07T20:06:10.4010588Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:10.4011164Z 2025-05-07T20:06:10.4012625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:10.4015012Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:10.4016095Z ^ 2025-05-07T20:06:10.4016437Z 2025-05-07T20:06:13.4087489Z [523/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o 2025-05-07T20:06:14.9514044Z [524/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o 2025-05-07T20:06:17.1598274Z [525/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o 2025-05-07T20:06:17.5837011Z [526/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o 2025-05-07T20:06:17.9625248Z [527/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o 2025-05-07T20:06:19.9475354Z [528/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o 2025-05-07T20:06:20.3655752Z [529/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o 2025-05-07T20:06:20.3677733Z In file included from tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:1: 2025-05-07T20:06:20.3680246Z /tmp/tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:46:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:20.3691237Z static void __device_stub__ZN10fbgemm_gpu28unique_indices_length_kernelIlLl9223372036854775807ELln9223372036854775808EEEvN2at27GenericPackedTensorAccessorIT_Lm1ENS1_17RestrictPtrTraitsEiEES5_S5_S5_(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par0, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par1, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par2, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par3){__cudaLaunchPrologue(4);__cudaSetupArg(__par0, 0UL);__cudaSetupArg(__par1, 16UL);__cudaSetupArg(__par2, 32UL);__cudaSetupArg(__par3, 48UL);__cudaLaunch(((char *)((void ( *)(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE))fbgemm_gpu::unique_indices_length_kernel )));}namespace fbgemm_gpu{ 2025-05-07T20:06:20.3702210Z ^ 2025-05-07T20:06:20.3705066Z /tmp/tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:46:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:20.3708338Z /tmp/tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:46:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:20.3711614Z /tmp/tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:52:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:20.3720872Z static void __device_stub__ZN10fbgemm_gpu24compute_hash_size_kernelIlLln9223372036854775808EEEvN2at27GenericPackedTensorAccessorIT_Lm1ENS1_17RestrictPtrTraitsEiEES5_lS5_(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par0, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par1, const int64_t __par2, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par3){__cudaLaunchPrologue(4);__cudaSetupArg(__par0, 0UL);__cudaSetupArg(__par1, 16UL);__cudaSetupArgSimple(__par2, 32UL);__cudaSetupArg(__par3, 40UL);__cudaLaunch(((char *)((void ( *)(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const int64_t, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE))fbgemm_gpu::compute_hash_size_kernel )));}namespace fbgemm_gpu{ 2025-05-07T20:06:20.3728559Z ^ 2025-05-07T20:06:20.3731036Z /tmp/tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:52:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:20.3734154Z /tmp/tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:52:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:20.3737576Z /tmp/tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:55:445: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:20.3741090Z /tmp/tmpxft_00004276_00000000-6_jagged_unique_indices.compute_90a.cudafe1.stub.c:55:1476: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:06:20.3742934Z 8 warnings generated. 2025-05-07T20:06:20.8220292Z [530/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o 2025-05-07T20:06:20.9152387Z [531/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o 2025-05-07T20:06:21.1055083Z [532/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o 2025-05-07T20:06:22.1166876Z [533/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o 2025-05-07T20:06:24.0273943Z [534/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o 2025-05-07T20:06:24.0330632Z [535/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:06:24.0332906Z ################################################################################ 2025-05-07T20:06:24.0333602Z [CMAKE] Running post-build script ... 2025-05-07T20:06:24.0334537Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:06:24.0335522Z Removing all RPATHs ... 2025-05-07T20:06:24.0336036Z ################################################################################ 2025-05-07T20:06:24.0451030Z [536/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 1 2025-05-07T20:06:24.0453235Z ################################################################################ 2025-05-07T20:06:24.0453860Z [CMAKE] Running post-build script ... 2025-05-07T20:06:24.0454729Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 2025-05-07T20:06:24.0455607Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:24.0456273Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:24.0456956Z ################################################################################ 2025-05-07T20:06:24.1144275Z [537/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:06:24.1146740Z ################################################################################ 2025-05-07T20:06:24.1147386Z [CMAKE] Running post-build script ... 2025-05-07T20:06:24.1148378Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:06:24.1149538Z Removing all RPATHs ... 2025-05-07T20:06:24.1150023Z ################################################################################ 2025-05-07T20:06:24.5567444Z [538/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 1 2025-05-07T20:06:24.5568771Z ################################################################################ 2025-05-07T20:06:24.5569141Z [CMAKE] Running post-build script ... 2025-05-07T20:06:24.5569703Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:06:24.5570313Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:24.5570693Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:24.5571114Z ################################################################################ 2025-05-07T20:06:24.5646220Z [539/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:06:24.5648472Z ################################################################################ 2025-05-07T20:06:24.5649086Z [CMAKE] Running post-build script ... 2025-05-07T20:06:24.5650257Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:06:24.5651230Z Removing all RPATHs ... 2025-05-07T20:06:24.5651712Z ################################################################################ 2025-05-07T20:06:24.6277192Z [540/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:06:24.6279473Z ################################################################################ 2025-05-07T20:06:24.6280084Z [CMAKE] Running post-build script ... 2025-05-07T20:06:24.6280978Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:06:24.6281878Z Removing all RPATHs ... 2025-05-07T20:06:24.6282326Z ################################################################################ 2025-05-07T20:06:24.6353782Z [541/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 1 2025-05-07T20:06:24.6356249Z ################################################################################ 2025-05-07T20:06:24.6356855Z [CMAKE] Running post-build script ... 2025-05-07T20:06:24.6357915Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:06:24.6358971Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:24.6359625Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:24.6360332Z ################################################################################ 2025-05-07T20:06:24.6446956Z [542/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 1 2025-05-07T20:06:24.6449309Z ################################################################################ 2025-05-07T20:06:24.6450277Z [CMAKE] Running post-build script ... 2025-05-07T20:06:24.6451277Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:06:24.6452316Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:24.6452927Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:24.6453634Z ################################################################################ 2025-05-07T20:06:24.6704801Z [543/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:06:24.6706926Z ################################################################################ 2025-05-07T20:06:24.6707514Z [CMAKE] Running post-build script ... 2025-05-07T20:06:24.6708482Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:06:24.6709433Z Removing all RPATHs ... 2025-05-07T20:06:24.6709858Z ################################################################################ 2025-05-07T20:06:24.9809909Z [544/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 1 2025-05-07T20:06:24.9812255Z ################################################################################ 2025-05-07T20:06:24.9812907Z [CMAKE] Running post-build script ... 2025-05-07T20:06:24.9814300Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:06:24.9815417Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:24.9816083Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:24.9817012Z ################################################################################ 2025-05-07T20:06:24.9998590Z [545/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 1 2025-05-07T20:06:25.0000974Z ################################################################################ 2025-05-07T20:06:25.0001620Z [CMAKE] Running post-build script ... 2025-05-07T20:06:25.0002720Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:06:25.0003844Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:25.0004502Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:25.0005242Z ################################################################################ 2025-05-07T20:06:25.0112135Z [546/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 1 2025-05-07T20:06:25.0114638Z ################################################################################ 2025-05-07T20:06:25.0115226Z [CMAKE] Running post-build script ... 2025-05-07T20:06:25.0116491Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:06:25.0117746Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:25.0118395Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:25.0119169Z ################################################################################ 2025-05-07T20:06:25.0592470Z [547/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o 2025-05-07T20:06:25.9562791Z [548/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o 2025-05-07T20:06:26.9223175Z [549/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o 2025-05-07T20:06:26.9245079Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:26.9247371Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:26.9248488Z ^ 2025-05-07T20:06:26.9248735Z 2025-05-07T20:06:26.9249176Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:26.9250087Z 2025-05-07T20:06:26.9251678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:26.9254122Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:26.9255289Z ^ 2025-05-07T20:06:26.9255662Z 2025-05-07T20:06:26.9257218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:26.9259883Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:26.9261175Z ^ 2025-05-07T20:06:26.9261457Z 2025-05-07T20:06:26.9261884Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:26.9262519Z 2025-05-07T20:06:26.9264184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:26.9266853Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:26.9268027Z ^ 2025-05-07T20:06:26.9268384Z 2025-05-07T20:06:26.9270177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:26.9272841Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:26.9274143Z ^ 2025-05-07T20:06:26.9274382Z 2025-05-07T20:06:26.9274810Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:26.9275492Z 2025-05-07T20:06:26.9277295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:26.9279866Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:26.9280979Z ^ 2025-05-07T20:06:26.9281324Z 2025-05-07T20:06:26.9282910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:26.9285409Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:26.9286519Z ^ 2025-05-07T20:06:26.9286754Z 2025-05-07T20:06:26.9287195Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:26.9287822Z 2025-05-07T20:06:26.9289371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:26.9291842Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:26.9292961Z ^ 2025-05-07T20:06:26.9293299Z 2025-05-07T20:06:26.9294988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:26.9297526Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:26.9298653Z ^ 2025-05-07T20:06:26.9298899Z 2025-05-07T20:06:26.9299322Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:26.9299944Z 2025-05-07T20:06:26.9301547Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:26.9304072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:26.9305125Z ^ 2025-05-07T20:06:26.9305408Z 2025-05-07T20:06:27.4015501Z [550/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update.cu -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o 2025-05-07T20:06:27.9729102Z [551/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_embedding_inplace_ops.so -o fbgemm_gpu_embedding_inplace_ops.so CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:06:27.9969354Z [552/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:06:27.9971664Z ################################################################################ 2025-05-07T20:06:27.9972310Z [CMAKE] Running post-build script ... 2025-05-07T20:06:27.9973335Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:06:27.9974396Z Removing all RPATHs ... 2025-05-07T20:06:27.9974861Z ################################################################################ 2025-05-07T20:06:28.2612788Z [553/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o 2025-05-07T20:06:29.4715584Z [554/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/dense_to_jagged_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o 2025-05-07T20:06:29.9653391Z [555/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o 2025-05-07T20:06:30.8122483Z [556/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T20:06:30.8143159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.8145588Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.8146625Z ^ 2025-05-07T20:06:30.8146889Z 2025-05-07T20:06:30.8147277Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:30.8147857Z 2025-05-07T20:06:30.8149157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.8151603Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.8152487Z ^ 2025-05-07T20:06:30.8152764Z 2025-05-07T20:06:30.8154091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.8156768Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.8157878Z ^ 2025-05-07T20:06:30.8158133Z 2025-05-07T20:06:30.8158544Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:30.8159214Z 2025-05-07T20:06:30.8160716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.8163220Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.8164296Z ^ 2025-05-07T20:06:30.8164645Z 2025-05-07T20:06:30.8166154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.8168589Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.8169695Z ^ 2025-05-07T20:06:30.8169943Z 2025-05-07T20:06:30.8170386Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:30.8170986Z 2025-05-07T20:06:30.8172509Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.8174980Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.8176553Z ^ 2025-05-07T20:06:30.8176908Z 2025-05-07T20:06:30.8178430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.8197518Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.8198596Z ^ 2025-05-07T20:06:30.8198831Z 2025-05-07T20:06:30.8199259Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:30.8199844Z 2025-05-07T20:06:30.8201319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.8203788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.8204926Z ^ 2025-05-07T20:06:30.8205288Z 2025-05-07T20:06:30.8206724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.8209151Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.8210415Z ^ 2025-05-07T20:06:30.8210660Z 2025-05-07T20:06:30.8211048Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:30.8211604Z 2025-05-07T20:06:30.8212992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:30.8215714Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:30.8216837Z ^ 2025-05-07T20:06:30.8217177Z 2025-05-07T20:06:30.9826318Z [557/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o 2025-05-07T20:06:32.4212598Z [558/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o 2025-05-07T20:06:33.0174955Z [559/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o 2025-05-07T20:06:33.4984114Z [560/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o 2025-05-07T20:06:33.5004078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5005431Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:33.5006097Z ^ 2025-05-07T20:06:33.5006313Z 2025-05-07T20:06:33.5006685Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:33.5007326Z 2025-05-07T20:06:33.5008320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5009872Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:33.5010624Z ^ 2025-05-07T20:06:33.5010876Z 2025-05-07T20:06:33.5011866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5013344Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:33.5014141Z ^ 2025-05-07T20:06:33.5014400Z 2025-05-07T20:06:33.5015397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5016898Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:33.5017501Z ^ 2025-05-07T20:06:33.5020281Z 2025-05-07T20:06:33.5021382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5022742Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:33.5023661Z ^ 2025-05-07T20:06:33.5023940Z 2025-05-07T20:06:33.5024398Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:33.5025081Z 2025-05-07T20:06:33.5025993Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5027531Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:33.5028273Z ^ 2025-05-07T20:06:33.5028527Z 2025-05-07T20:06:33.5029499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5031012Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:33.5031811Z ^ 2025-05-07T20:06:33.5032066Z 2025-05-07T20:06:33.5033040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5034386Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:33.5035004Z ^ 2025-05-07T20:06:33.5035272Z 2025-05-07T20:06:33.5036244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5037826Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:33.5038489Z ^ 2025-05-07T20:06:33.5038766Z 2025-05-07T20:06:33.5039225Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:33.5039913Z 2025-05-07T20:06:33.5040886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5042244Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:33.5043136Z ^ 2025-05-07T20:06:33.5043391Z 2025-05-07T20:06:33.5044332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5045644Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:33.5046379Z ^ 2025-05-07T20:06:33.5046653Z 2025-05-07T20:06:33.5047632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5049080Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:33.5049697Z ^ 2025-05-07T20:06:33.5049914Z 2025-05-07T20:06:33.5050903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5052485Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:33.5053075Z ^ 2025-05-07T20:06:33.5053282Z 2025-05-07T20:06:33.5053687Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:33.5054330Z 2025-05-07T20:06:33.5055305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5056764Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:33.5057359Z ^ 2025-05-07T20:06:33.5057578Z 2025-05-07T20:06:33.5058666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5060284Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:33.5060975Z ^ 2025-05-07T20:06:33.5061179Z 2025-05-07T20:06:33.5061885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5063248Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:33.5063844Z ^ 2025-05-07T20:06:33.5064165Z 2025-05-07T20:06:33.5064949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5066224Z __attribute__((global)) inline void _float_to_FP8rowwise_cuda_kernel( 2025-05-07T20:06:33.5066868Z ^ 2025-05-07T20:06:33.5067094Z 2025-05-07T20:06:33.5067469Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:33.5068049Z 2025-05-07T20:06:33.5068911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5070474Z __attribute__((global)) inline void _get_FP8_qparam_cuda_kernel( 2025-05-07T20:06:33.5071219Z ^ 2025-05-07T20:06:33.5071472Z 2025-05-07T20:06:33.5072464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5073707Z __attribute__((global)) inline void _compute_FP8_quantize_cuda_kernel( 2025-05-07T20:06:33.5074366Z ^ 2025-05-07T20:06:33.5074624Z 2025-05-07T20:06:33.5075621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:33.5077433Z __attribute__((global)) inline void _FP8rowwise_to_float_cuda_kernel( 2025-05-07T20:06:33.5078051Z ^ 2025-05-07T20:06:33.5078263Z 2025-05-07T20:06:35.2752597Z [561/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_batch_index_select_dim0_backward_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o 2025-05-07T20:06:35.2769879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:35.2771944Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:35.2772876Z ^ 2025-05-07T20:06:35.2773078Z 2025-05-07T20:06:35.2773423Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:35.2773934Z 2025-05-07T20:06:35.2775116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:35.2777320Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:35.2778178Z ^ 2025-05-07T20:06:35.2778472Z 2025-05-07T20:06:35.2779645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:35.2781685Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:35.2782548Z ^ 2025-05-07T20:06:35.2782744Z 2025-05-07T20:06:35.2783281Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:35.2783773Z 2025-05-07T20:06:35.2784996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:35.2786928Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:35.2787819Z ^ 2025-05-07T20:06:35.2788106Z 2025-05-07T20:06:35.2789249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:35.2791173Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:35.2792053Z ^ 2025-05-07T20:06:35.2792243Z 2025-05-07T20:06:35.2792558Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:35.2793046Z 2025-05-07T20:06:35.2794232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:35.2796131Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:35.2796996Z ^ 2025-05-07T20:06:35.2797258Z 2025-05-07T20:06:35.2798574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:35.2800532Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:35.2801399Z ^ 2025-05-07T20:06:35.2801587Z 2025-05-07T20:06:35.2802019Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:35.2802502Z 2025-05-07T20:06:35.2803703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:35.2805638Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:35.2806518Z ^ 2025-05-07T20:06:35.2806801Z 2025-05-07T20:06:35.2807955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:35.2809881Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:35.2810785Z ^ 2025-05-07T20:06:35.2811004Z 2025-05-07T20:06:35.2811339Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:35.2811863Z 2025-05-07T20:06:35.2813122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:35.2815074Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:35.2816166Z ^ 2025-05-07T20:06:35.2816472Z 2025-05-07T20:06:35.8874798Z [562/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_index_select.so -o fbgemm_gpu_tbe_index_select.so CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:06:35.9224139Z [563/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 1 2025-05-07T20:06:35.9225879Z ################################################################################ 2025-05-07T20:06:35.9226373Z [CMAKE] Running post-build script ... 2025-05-07T20:06:35.9227150Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:06:35.9227989Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:06:35.9228494Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:06:35.9229081Z ################################################################################ 2025-05-07T20:06:38.3431695Z [564/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_bfloat16.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o 2025-05-07T20:06:40.5058114Z [565/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o 2025-05-07T20:06:40.5069408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5070267Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:40.5070693Z ^ 2025-05-07T20:06:40.5070830Z 2025-05-07T20:06:40.5071091Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.5071437Z 2025-05-07T20:06:40.5071960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5072787Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:40.5073155Z ^ 2025-05-07T20:06:40.5073299Z 2025-05-07T20:06:40.5073819Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5074655Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:40.5075144Z ^ 2025-05-07T20:06:40.5075276Z 2025-05-07T20:06:40.5075809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5076928Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:40.5077339Z ^ 2025-05-07T20:06:40.5077464Z 2025-05-07T20:06:40.5078052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5078926Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:40.5079377Z ^ 2025-05-07T20:06:40.5079505Z 2025-05-07T20:06:40.5080022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5080871Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:40.5081288Z ^ 2025-05-07T20:06:40.5081415Z 2025-05-07T20:06:40.5081646Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.5081994Z 2025-05-07T20:06:40.5082523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5083327Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:40.5083706Z ^ 2025-05-07T20:06:40.5083832Z 2025-05-07T20:06:40.5084366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5085193Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:40.5085600Z ^ 2025-05-07T20:06:40.5085727Z 2025-05-07T20:06:40.5086254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5087102Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:40.5087564Z ^ 2025-05-07T20:06:40.5087691Z 2025-05-07T20:06:40.5088209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5089097Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:40.5089538Z ^ 2025-05-07T20:06:40.5089684Z 2025-05-07T20:06:40.5090198Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5091049Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:40.5091452Z ^ 2025-05-07T20:06:40.5091579Z 2025-05-07T20:06:40.5091822Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.5092166Z 2025-05-07T20:06:40.5092680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5093499Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:40.5093879Z ^ 2025-05-07T20:06:40.5094005Z 2025-05-07T20:06:40.5094525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5095366Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:40.5095756Z ^ 2025-05-07T20:06:40.5095894Z 2025-05-07T20:06:40.5096459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5097311Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:40.5097714Z ^ 2025-05-07T20:06:40.5097924Z 2025-05-07T20:06:40.5098456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5099370Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:40.5099826Z ^ 2025-05-07T20:06:40.5099953Z 2025-05-07T20:06:40.5100593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5101432Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:40.5101847Z ^ 2025-05-07T20:06:40.5101975Z 2025-05-07T20:06:40.5102209Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.5102564Z 2025-05-07T20:06:40.5103080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5103905Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:40.5104268Z ^ 2025-05-07T20:06:40.5104395Z 2025-05-07T20:06:40.5104930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5105756Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:40.5106156Z ^ 2025-05-07T20:06:40.5106280Z 2025-05-07T20:06:40.5106797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5107650Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:40.5108060Z ^ 2025-05-07T20:06:40.5108185Z 2025-05-07T20:06:40.5108701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5109629Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:40.5110067Z ^ 2025-05-07T20:06:40.5110208Z 2025-05-07T20:06:40.5110725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5111573Z __attribute__((global)) inline void _float_to_fused8bitrowwise_cuda_kernel( 2025-05-07T20:06:40.5111973Z ^ 2025-05-07T20:06:40.5112113Z 2025-05-07T20:06:40.5112343Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:40.5112685Z 2025-05-07T20:06:40.5113203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5114023Z __attribute__((global)) inline void _get_8bit_qparam_cuda_kernel( 2025-05-07T20:06:40.5114404Z ^ 2025-05-07T20:06:40.5114533Z 2025-05-07T20:06:40.5115052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5115893Z __attribute__((global)) inline void _compute_8bit_quantize_cuda_kernel( 2025-05-07T20:06:40.5116288Z ^ 2025-05-07T20:06:40.5116427Z 2025-05-07T20:06:40.5116943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5117826Z __attribute__((global)) inline void _fused8bitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:40.5118231Z ^ 2025-05-07T20:06:40.5118370Z 2025-05-07T20:06:40.5118889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:40.5119793Z __attribute__((global)) inline void _fused8bitrowwise_to_float_mixed_dim_cuda_kernel( 2025-05-07T20:06:40.5120243Z ^ 2025-05-07T20:06:40.5120368Z 2025-05-07T20:06:41.6745853Z [566/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_mx.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o 2025-05-07T20:06:43.3395963Z [567/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:06:43.3417036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3419326Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3420292Z ^ 2025-05-07T20:06:43.3420593Z 2025-05-07T20:06:43.3421010Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:43.3421599Z 2025-05-07T20:06:43.3423024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3425738Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3426809Z ^ 2025-05-07T20:06:43.3427174Z 2025-05-07T20:06:43.3428675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3431195Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3432292Z ^ 2025-05-07T20:06:43.3432557Z 2025-05-07T20:06:43.3432977Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:43.3433573Z 2025-05-07T20:06:43.3435072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3437473Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3438612Z ^ 2025-05-07T20:06:43.3438954Z 2025-05-07T20:06:43.3440487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3443185Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3444292Z ^ 2025-05-07T20:06:43.3444635Z 2025-05-07T20:06:43.3445055Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:43.3445671Z 2025-05-07T20:06:43.3447274Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3449598Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3450680Z ^ 2025-05-07T20:06:43.3451009Z 2025-05-07T20:06:43.3452470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3454909Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3455963Z ^ 2025-05-07T20:06:43.3456204Z 2025-05-07T20:06:43.3456612Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:43.3457206Z 2025-05-07T20:06:43.3458737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3461285Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3462409Z ^ 2025-05-07T20:06:43.3462768Z 2025-05-07T20:06:43.3464147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3466658Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3467741Z ^ 2025-05-07T20:06:43.3467980Z 2025-05-07T20:06:43.3468410Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:43.3469003Z 2025-05-07T20:06:43.3470580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:06:43.3472951Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:06:43.3474038Z ^ 2025-05-07T20:06:43.3474376Z 2025-05-07T20:06:45.6151019Z [568/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o 2025-05-07T20:06:45.6169745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6171253Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:45.6171979Z ^ 2025-05-07T20:06:45.6172194Z 2025-05-07T20:06:45.6172597Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:45.6173177Z 2025-05-07T20:06:45.6174073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6175714Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:45.6176625Z ^ 2025-05-07T20:06:45.6176842Z 2025-05-07T20:06:45.6177762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6179159Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:45.6179814Z ^ 2025-05-07T20:06:45.6180029Z 2025-05-07T20:06:45.6181045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6182556Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:45.6183269Z ^ 2025-05-07T20:06:45.6183476Z 2025-05-07T20:06:45.6184375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6185777Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:45.6186463Z ^ 2025-05-07T20:06:45.6186699Z 2025-05-07T20:06:45.6187548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6189007Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:45.6189714Z ^ 2025-05-07T20:06:45.6189945Z 2025-05-07T20:06:45.6190328Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:45.6190795Z 2025-05-07T20:06:45.6191794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6193161Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:45.6193957Z ^ 2025-05-07T20:06:45.6194154Z 2025-05-07T20:06:45.6195067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6196677Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:45.6197334Z ^ 2025-05-07T20:06:45.6197548Z 2025-05-07T20:06:45.6198449Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6199975Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:45.6200697Z ^ 2025-05-07T20:06:45.6200921Z 2025-05-07T20:06:45.6201800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6203093Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:45.6203692Z ^ 2025-05-07T20:06:45.6203850Z 2025-05-07T20:06:45.6204504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6205663Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:45.6206307Z ^ 2025-05-07T20:06:45.6206470Z 2025-05-07T20:06:45.6206840Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:45.6207473Z 2025-05-07T20:06:45.6208116Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6209260Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:45.6209849Z ^ 2025-05-07T20:06:45.6210214Z 2025-05-07T20:06:45.6211098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6212352Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:45.6212887Z ^ 2025-05-07T20:06:45.6213065Z 2025-05-07T20:06:45.6213948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6215325Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:45.6215923Z ^ 2025-05-07T20:06:45.6216125Z 2025-05-07T20:06:45.6217057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6218624Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:45.6219266Z ^ 2025-05-07T20:06:45.6219470Z 2025-05-07T20:06:45.6220322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6221695Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:45.6222364Z ^ 2025-05-07T20:06:45.6222580Z 2025-05-07T20:06:45.6222949Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:45.6223512Z 2025-05-07T20:06:45.6224444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6225991Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:45.6226615Z ^ 2025-05-07T20:06:45.6226819Z 2025-05-07T20:06:45.6227716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6229287Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:45.6229928Z ^ 2025-05-07T20:06:45.6230122Z 2025-05-07T20:06:45.6231131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6232666Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:45.6233392Z ^ 2025-05-07T20:06:45.6233617Z 2025-05-07T20:06:45.6234538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6236093Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:45.6236768Z ^ 2025-05-07T20:06:45.6237006Z 2025-05-07T20:06:45.6237923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6239395Z __attribute__((global)) inline void _float_to_paddedFP8rowwise_cuda_kernel( 2025-05-07T20:06:45.6240066Z ^ 2025-05-07T20:06:45.6240306Z 2025-05-07T20:06:45.6240687Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:45.6241263Z 2025-05-07T20:06:45.6242134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6243499Z __attribute__((global)) inline void _get_padding_value_kernel( 2025-05-07T20:06:45.6244110Z ^ 2025-05-07T20:06:45.6244330Z 2025-05-07T20:06:45.6245229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6246762Z __attribute__((global)) inline void _single_thread_sum_padding_kernel( 2025-05-07T20:06:45.6247417Z ^ 2025-05-07T20:06:45.6247654Z 2025-05-07T20:06:45.6248528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6249955Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_1d_cuda_kernel( 2025-05-07T20:06:45.6250645Z ^ 2025-05-07T20:06:45.6250862Z 2025-05-07T20:06:45.6251775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:45.6253274Z __attribute__((global)) inline void _PaddedFP8rowwise_to_float_2d_cuda_kernel( 2025-05-07T20:06:45.6253990Z ^ 2025-05-07T20:06:45.6254206Z 2025-05-07T20:06:46.4446701Z [569/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o 2025-05-07T20:06:46.4466327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:46.4467841Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:46.4468474Z ^ 2025-05-07T20:06:46.4468682Z 2025-05-07T20:06:46.4469080Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:46.4469647Z 2025-05-07T20:06:46.4470483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:46.4472259Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:46.4472838Z ^ 2025-05-07T20:06:46.4473031Z 2025-05-07T20:06:46.4473770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:46.4475063Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:46.4475705Z ^ 2025-05-07T20:06:46.4476177Z 2025-05-07T20:06:46.4476532Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:46.4477058Z 2025-05-07T20:06:46.4477887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:46.4479193Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:46.4479816Z ^ 2025-05-07T20:06:46.4480055Z 2025-05-07T20:06:46.4480857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:46.4482024Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:46.4482634Z ^ 2025-05-07T20:06:46.4482841Z 2025-05-07T20:06:46.4483203Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:46.4483791Z 2025-05-07T20:06:46.4484617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:46.4486223Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:46.4486972Z ^ 2025-05-07T20:06:46.4487194Z 2025-05-07T20:06:46.4488145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:46.4489717Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:46.4490438Z ^ 2025-05-07T20:06:46.4490668Z 2025-05-07T20:06:46.4491188Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:46.4491781Z 2025-05-07T20:06:46.4492720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:46.4494265Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:46.4494964Z ^ 2025-05-07T20:06:46.4495184Z 2025-05-07T20:06:46.4496133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:46.4497636Z __attribute__((global)) inline void _float_to_fusednbitrowwise_cuda_kernel( 2025-05-07T20:06:46.4498403Z ^ 2025-05-07T20:06:46.4498634Z 2025-05-07T20:06:46.4499085Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:46.4499719Z 2025-05-07T20:06:46.4500742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:46.4502303Z __attribute__((global)) inline void _fusednbitrowwise_to_float_cuda_kernel( 2025-05-07T20:06:46.4503040Z ^ 2025-05-07T20:06:46.4503285Z 2025-05-07T20:06:50.6074634Z [570/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_hfp8.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o 2025-05-07T20:06:50.7166747Z [571/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o 2025-05-07T20:06:50.7178029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:50.7178884Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:50.7179347Z ^ 2025-05-07T20:06:50.7179495Z 2025-05-07T20:06:50.7179742Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:50.7180119Z 2025-05-07T20:06:50.7180710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:50.7181534Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:50.7181999Z ^ 2025-05-07T20:06:50.7182134Z 2025-05-07T20:06:50.7182395Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:50.7182744Z 2025-05-07T20:06:50.7183221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:50.7184053Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:50.7184480Z ^ 2025-05-07T20:06:50.7184638Z 2025-05-07T20:06:50.7184880Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:50.7185232Z 2025-05-07T20:06:50.7185820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:50.7186632Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:50.7187132Z ^ 2025-05-07T20:06:50.7187267Z 2025-05-07T20:06:50.7187504Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:50.7187873Z 2025-05-07T20:06:50.7188393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:06:50.7189228Z __attribute__((global)) inline void _compute_msfp_shared_exponent_cuda_kernel( 2025-05-07T20:06:50.7189657Z ^ 2025-05-07T20:06:50.7189815Z 2025-05-07T20:06:50.7190052Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:06:50.7190397Z 2025-05-07T20:06:52.8910265Z [572/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o 2025-05-07T20:06:57.2626943Z [573/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o 2025-05-07T20:07:01.1597309Z [574/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o 2025-05-07T20:07:01.1608793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1610366Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1611022Z ^ 2025-05-07T20:07:01.1611172Z 2025-05-07T20:07:01.1611481Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.1611840Z 2025-05-07T20:07:01.1612710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1614126Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1614769Z ^ 2025-05-07T20:07:01.1614994Z 2025-05-07T20:07:01.1615846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1617238Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1617864Z ^ 2025-05-07T20:07:01.1618029Z 2025-05-07T20:07:01.1618273Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.1618625Z 2025-05-07T20:07:01.1619511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1621002Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1621725Z ^ 2025-05-07T20:07:01.1621930Z 2025-05-07T20:07:01.1622802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1624181Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1624828Z ^ 2025-05-07T20:07:01.1624971Z 2025-05-07T20:07:01.1625217Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.1625590Z 2025-05-07T20:07:01.1626448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1627854Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1628480Z ^ 2025-05-07T20:07:01.1628702Z 2025-05-07T20:07:01.1629553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1630978Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1631586Z ^ 2025-05-07T20:07:01.1631732Z 2025-05-07T20:07:01.1631963Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.1632305Z 2025-05-07T20:07:01.1633208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1634624Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1635250Z ^ 2025-05-07T20:07:01.1635444Z 2025-05-07T20:07:01.1636283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1637653Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1638282Z ^ 2025-05-07T20:07:01.1638433Z 2025-05-07T20:07:01.1638674Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.1639049Z 2025-05-07T20:07:01.1639914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.1641295Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.1641931Z ^ 2025-05-07T20:07:01.1642137Z 2025-05-07T20:07:01.5529561Z [575/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o 2025-05-07T20:07:01.5541349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.5542809Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.5543430Z ^ 2025-05-07T20:07:01.5543571Z 2025-05-07T20:07:01.5543810Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.5544161Z 2025-05-07T20:07:01.5545037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.5546416Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.5547086Z ^ 2025-05-07T20:07:01.5547280Z 2025-05-07T20:07:01.5548150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.5549525Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.5550171Z ^ 2025-05-07T20:07:01.5550320Z 2025-05-07T20:07:01.5550590Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.5550946Z 2025-05-07T20:07:01.5551809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.5553285Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.5553924Z ^ 2025-05-07T20:07:01.5554155Z 2025-05-07T20:07:01.5554692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:01.5555499Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:01.5555841Z ^ 2025-05-07T20:07:01.5556033Z 2025-05-07T20:07:01.5556889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.5558259Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.5558875Z ^ 2025-05-07T20:07:01.5559011Z 2025-05-07T20:07:01.5559258Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.5559604Z 2025-05-07T20:07:01.5560456Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.5561831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.5562495Z ^ 2025-05-07T20:07:01.5562691Z 2025-05-07T20:07:01.5563218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:01.5564028Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:01.5564345Z ^ 2025-05-07T20:07:01.5564519Z 2025-05-07T20:07:01.5565399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.5566769Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.5567372Z ^ 2025-05-07T20:07:01.5567522Z 2025-05-07T20:07:01.5567760Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.5568104Z 2025-05-07T20:07:01.5568975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.5570343Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.5570972Z ^ 2025-05-07T20:07:01.5571168Z 2025-05-07T20:07:01.5571704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:01.5572456Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:01.5572781Z ^ 2025-05-07T20:07:01.5572944Z 2025-05-07T20:07:01.5573785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.5575152Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.5575801Z ^ 2025-05-07T20:07:01.5576148Z 2025-05-07T20:07:01.5576382Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:01.5576738Z 2025-05-07T20:07:01.5577594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:01.5578969Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:01.5579585Z ^ 2025-05-07T20:07:01.5579779Z 2025-05-07T20:07:01.5580320Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:07:01.5581155Z static constexpr uint32_t kMaxThreads = 1024; 2025-05-07T20:07:01.5581483Z ^ 2025-05-07T20:07:01.5581648Z 2025-05-07T20:07:02.4349548Z [576/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_batched_unary_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o 2025-05-07T20:07:02.4360940Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.4362312Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.4363018Z ^ 2025-05-07T20:07:02.4363198Z 2025-05-07T20:07:02.4363451Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.4363798Z 2025-05-07T20:07:02.4364656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.4366039Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.4366671Z ^ 2025-05-07T20:07:02.4366868Z 2025-05-07T20:07:02.4367711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.4369087Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.4369703Z ^ 2025-05-07T20:07:02.4369843Z 2025-05-07T20:07:02.4370078Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.4370425Z 2025-05-07T20:07:02.4371290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.4372652Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.4373338Z ^ 2025-05-07T20:07:02.4373544Z 2025-05-07T20:07:02.4374403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.4375808Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.4376705Z ^ 2025-05-07T20:07:02.4376845Z 2025-05-07T20:07:02.4377093Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.4377439Z 2025-05-07T20:07:02.4378298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.4379676Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.4380302Z ^ 2025-05-07T20:07:02.4380582Z 2025-05-07T20:07:02.4381423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.4382794Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.4383399Z ^ 2025-05-07T20:07:02.4383550Z 2025-05-07T20:07:02.4383787Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.4384130Z 2025-05-07T20:07:02.4385005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.4386372Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.4387089Z ^ 2025-05-07T20:07:02.4387287Z 2025-05-07T20:07:02.4388144Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.4389510Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.4390130Z ^ 2025-05-07T20:07:02.4390268Z 2025-05-07T20:07:02.4390501Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:02.4390884Z 2025-05-07T20:07:02.4391737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:02.4393120Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:02.4393733Z ^ 2025-05-07T20:07:02.4393946Z 2025-05-07T20:07:04.7414571Z [577/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_compute_frequency_sequence.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o 2025-05-07T20:07:04.7426218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.7427692Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.7428302Z ^ 2025-05-07T20:07:04.7428454Z 2025-05-07T20:07:04.7428743Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:04.7429090Z 2025-05-07T20:07:04.7429959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.7431329Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.7431963Z ^ 2025-05-07T20:07:04.7432158Z 2025-05-07T20:07:04.7433010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.7434367Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.7434984Z ^ 2025-05-07T20:07:04.7435123Z 2025-05-07T20:07:04.7435368Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:04.7435712Z 2025-05-07T20:07:04.7436563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.7437976Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.7438592Z ^ 2025-05-07T20:07:04.7438834Z 2025-05-07T20:07:04.7439676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.7441092Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.7441697Z ^ 2025-05-07T20:07:04.7441845Z 2025-05-07T20:07:04.7442075Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:04.7442421Z 2025-05-07T20:07:04.7443286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.7444653Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.7445286Z ^ 2025-05-07T20:07:04.7445481Z 2025-05-07T20:07:04.7446337Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.7447691Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.7448303Z ^ 2025-05-07T20:07:04.7448439Z 2025-05-07T20:07:04.7448672Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:04.7449033Z 2025-05-07T20:07:04.7449882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.7451298Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.7451909Z ^ 2025-05-07T20:07:04.7452121Z 2025-05-07T20:07:04.7452973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.7454368Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.7454988Z ^ 2025-05-07T20:07:04.7455153Z 2025-05-07T20:07:04.7455394Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:04.7455747Z 2025-05-07T20:07:04.7456606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:04.7458020Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:04.7458676Z ^ 2025-05-07T20:07:04.7458876Z 2025-05-07T20:07:06.1558774Z [578/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_expand_into_jagged_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o 2025-05-07T20:07:06.1571170Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:06.1572731Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:06.1573368Z ^ 2025-05-07T20:07:06.1573543Z 2025-05-07T20:07:06.1573799Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:06.1574154Z 2025-05-07T20:07:06.1575044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:06.1576635Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:06.1577299Z ^ 2025-05-07T20:07:06.1577510Z 2025-05-07T20:07:06.1578384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:06.1579761Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:06.1580476Z ^ 2025-05-07T20:07:06.1580633Z 2025-05-07T20:07:06.1580903Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:06.1581255Z 2025-05-07T20:07:06.1582189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:06.1583602Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:06.1584278Z ^ 2025-05-07T20:07:06.1584506Z 2025-05-07T20:07:06.1585410Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:06.1586802Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:06.1587424Z ^ 2025-05-07T20:07:06.1587593Z 2025-05-07T20:07:06.1587841Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:06.1588193Z 2025-05-07T20:07:06.1589072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:06.1590457Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:06.1591108Z ^ 2025-05-07T20:07:06.1591311Z 2025-05-07T20:07:06.1592188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:06.1593558Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:06.1594205Z ^ 2025-05-07T20:07:06.1594352Z 2025-05-07T20:07:06.1594596Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:06.1594974Z 2025-05-07T20:07:06.1595832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:06.1597287Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:06.1597919Z ^ 2025-05-07T20:07:06.1598142Z 2025-05-07T20:07:06.1598995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:06.1600387Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:06.1601006Z ^ 2025-05-07T20:07:06.1601152Z 2025-05-07T20:07:06.1601421Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:06.1601778Z 2025-05-07T20:07:06.1602638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:06.1604042Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:06.1604688Z ^ 2025-05-07T20:07:06.1604893Z 2025-05-07T20:07:17.4072293Z [579/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_block_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o 2025-05-07T20:07:17.4084062Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4085464Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4086088Z ^ 2025-05-07T20:07:17.4086230Z 2025-05-07T20:07:17.4086470Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.4086820Z 2025-05-07T20:07:17.4087698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4089072Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4089709Z ^ 2025-05-07T20:07:17.4089909Z 2025-05-07T20:07:17.4090778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4092134Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4092762Z ^ 2025-05-07T20:07:17.4092901Z 2025-05-07T20:07:17.4093214Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.4093562Z 2025-05-07T20:07:17.4094417Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4095843Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4096462Z ^ 2025-05-07T20:07:17.4096707Z 2025-05-07T20:07:17.4097554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4098926Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4099535Z ^ 2025-05-07T20:07:17.4099687Z 2025-05-07T20:07:17.4099923Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.4100268Z 2025-05-07T20:07:17.4101217Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4102597Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4103231Z ^ 2025-05-07T20:07:17.4103425Z 2025-05-07T20:07:17.4104279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4105629Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4106242Z ^ 2025-05-07T20:07:17.4106378Z 2025-05-07T20:07:17.4106672Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.4107026Z 2025-05-07T20:07:17.4107880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4109258Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4109869Z ^ 2025-05-07T20:07:17.4110072Z 2025-05-07T20:07:17.4110917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4112289Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4112890Z ^ 2025-05-07T20:07:17.4113037Z 2025-05-07T20:07:17.4113269Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.4113613Z 2025-05-07T20:07:17.4114473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4115834Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4116459Z ^ 2025-05-07T20:07:17.4116651Z 2025-05-07T20:07:17.4190792Z [580/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_group_index.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o 2025-05-07T20:07:17.4201926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4203317Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4203937Z ^ 2025-05-07T20:07:17.4204078Z 2025-05-07T20:07:17.4204409Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.4204893Z 2025-05-07T20:07:17.4205755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4207148Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4207770Z ^ 2025-05-07T20:07:17.4207985Z 2025-05-07T20:07:17.4208833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4210202Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4210807Z ^ 2025-05-07T20:07:17.4211006Z 2025-05-07T20:07:17.4211245Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.4211596Z 2025-05-07T20:07:17.4212465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4213931Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4214563Z ^ 2025-05-07T20:07:17.4214758Z 2025-05-07T20:07:17.4215744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4217205Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4217819Z ^ 2025-05-07T20:07:17.4217957Z 2025-05-07T20:07:17.4218190Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.4218551Z 2025-05-07T20:07:17.4219408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4220864Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4221514Z ^ 2025-05-07T20:07:17.4221723Z 2025-05-07T20:07:17.4222577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4223945Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4224601Z ^ 2025-05-07T20:07:17.4224738Z 2025-05-07T20:07:17.4224987Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.4225333Z 2025-05-07T20:07:17.4226186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4227564Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4228192Z ^ 2025-05-07T20:07:17.4228387Z 2025-05-07T20:07:17.4229229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4230601Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4231216Z ^ 2025-05-07T20:07:17.4231353Z 2025-05-07T20:07:17.4231588Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:17.4231943Z 2025-05-07T20:07:17.4232799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:17.4234173Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:17.4234819Z ^ 2025-05-07T20:07:17.4235016Z 2025-05-07T20:07:21.5814834Z [581/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_add.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o 2025-05-07T20:07:21.5826242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:21.5827632Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:21.5828238Z ^ 2025-05-07T20:07:21.5828390Z 2025-05-07T20:07:21.5828634Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:21.5828980Z 2025-05-07T20:07:21.5829847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:21.5831218Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:21.5831845Z ^ 2025-05-07T20:07:21.5832041Z 2025-05-07T20:07:21.5832881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:21.5834319Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:21.5834950Z ^ 2025-05-07T20:07:21.5835091Z 2025-05-07T20:07:21.5835326Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:21.5835688Z 2025-05-07T20:07:21.5836568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:21.5837978Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:21.5838590Z ^ 2025-05-07T20:07:21.5838800Z 2025-05-07T20:07:21.5839642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:21.5841007Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:21.5841611Z ^ 2025-05-07T20:07:21.5841797Z 2025-05-07T20:07:21.5842037Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:21.5842395Z 2025-05-07T20:07:21.5843276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:21.5844661Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:21.5845318Z ^ 2025-05-07T20:07:21.5845524Z 2025-05-07T20:07:21.5846401Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:21.5847776Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:21.5848458Z ^ 2025-05-07T20:07:21.5848607Z 2025-05-07T20:07:21.5848850Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:21.5849233Z 2025-05-07T20:07:21.5850099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:21.5851511Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:21.5852152Z ^ 2025-05-07T20:07:21.5852385Z 2025-05-07T20:07:21.5853238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:21.5854642Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:21.5855274Z ^ 2025-05-07T20:07:21.5855450Z 2025-05-07T20:07:21.5855695Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:21.5856049Z 2025-05-07T20:07:21.5856905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:21.5858344Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:21.5858999Z ^ 2025-05-07T20:07:21.5859205Z 2025-05-07T20:07:23.1139517Z [582/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_select.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o 2025-05-07T20:07:23.1151333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:23.1152724Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:23.1153354Z ^ 2025-05-07T20:07:23.1153496Z 2025-05-07T20:07:23.1153735Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:23.1154098Z 2025-05-07T20:07:23.1154954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:23.1156345Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:23.1156960Z ^ 2025-05-07T20:07:23.1157170Z 2025-05-07T20:07:23.1158010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:23.1159493Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:23.1160099Z ^ 2025-05-07T20:07:23.1160248Z 2025-05-07T20:07:23.1160485Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:23.1160861Z 2025-05-07T20:07:23.1161759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:23.1163121Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:23.1163743Z ^ 2025-05-07T20:07:23.1163942Z 2025-05-07T20:07:23.1164796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:23.1166144Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:23.1166760Z ^ 2025-05-07T20:07:23.1166895Z 2025-05-07T20:07:23.1167126Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:23.1167480Z 2025-05-07T20:07:23.1168334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:23.1169704Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:23.1170315Z ^ 2025-05-07T20:07:23.1170524Z 2025-05-07T20:07:23.1171365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:23.1172784Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:23.1173384Z ^ 2025-05-07T20:07:23.1173530Z 2025-05-07T20:07:23.1173762Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:23.1174108Z 2025-05-07T20:07:23.1174972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:23.1176498Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:23.1177129Z ^ 2025-05-07T20:07:23.1177323Z 2025-05-07T20:07:23.1178162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:23.1179532Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:23.1180144Z ^ 2025-05-07T20:07:23.1180282Z 2025-05-07T20:07:23.1180583Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:23.1180941Z 2025-05-07T20:07:23.1181851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:23.1183225Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:23.1183835Z ^ 2025-05-07T20:07:23.1184082Z 2025-05-07T20:07:24.8358609Z [583/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_invert_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o 2025-05-07T20:07:24.8369780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:24.8371169Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:24.8371796Z ^ 2025-05-07T20:07:24.8371936Z 2025-05-07T20:07:24.8372207Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:24.8372576Z 2025-05-07T20:07:24.8373435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:24.8374815Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:24.8375431Z ^ 2025-05-07T20:07:24.8375640Z 2025-05-07T20:07:24.8376747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:24.8378129Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:24.8378796Z ^ 2025-05-07T20:07:24.8378936Z 2025-05-07T20:07:24.8379186Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:24.8379530Z 2025-05-07T20:07:24.8380517Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:24.8381899Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:24.8382523Z ^ 2025-05-07T20:07:24.8382719Z 2025-05-07T20:07:24.8383564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:24.8384931Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:24.8385545Z ^ 2025-05-07T20:07:24.8385681Z 2025-05-07T20:07:24.8385918Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:24.8386267Z 2025-05-07T20:07:24.8387129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:24.8388496Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:24.8389126Z ^ 2025-05-07T20:07:24.8389321Z 2025-05-07T20:07:24.8390173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:24.8392944Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:24.8393569Z ^ 2025-05-07T20:07:24.8393708Z 2025-05-07T20:07:24.8393951Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:24.8394299Z 2025-05-07T20:07:24.8403296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:24.8404805Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:24.8405432Z ^ 2025-05-07T20:07:24.8405664Z 2025-05-07T20:07:24.8406515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:24.8407896Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:24.8408502Z ^ 2025-05-07T20:07:24.8408641Z 2025-05-07T20:07:24.8408895Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:24.8409243Z 2025-05-07T20:07:24.8410184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:24.8411571Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:24.8412286Z ^ 2025-05-07T20:07:24.8412484Z 2025-05-07T20:07:27.9624530Z [584/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o 2025-05-07T20:07:27.9635821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:27.9637208Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:27.9637841Z ^ 2025-05-07T20:07:27.9637982Z 2025-05-07T20:07:27.9638222Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:27.9638570Z 2025-05-07T20:07:27.9639446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:27.9640815Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:27.9641446Z ^ 2025-05-07T20:07:27.9641643Z 2025-05-07T20:07:27.9642611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:27.9644043Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:27.9644647Z ^ 2025-05-07T20:07:27.9644785Z 2025-05-07T20:07:27.9645090Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:27.9645436Z 2025-05-07T20:07:27.9646287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:27.9647666Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:27.9648298Z ^ 2025-05-07T20:07:27.9648493Z 2025-05-07T20:07:27.9649332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:27.9650706Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:27.9651322Z ^ 2025-05-07T20:07:27.9651460Z 2025-05-07T20:07:27.9651692Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:27.9652049Z 2025-05-07T20:07:27.9652896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:27.9654273Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:27.9654889Z ^ 2025-05-07T20:07:27.9655121Z 2025-05-07T20:07:27.9655970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:27.9657320Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:27.9657933Z ^ 2025-05-07T20:07:27.9658070Z 2025-05-07T20:07:27.9658317Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:27.9658659Z 2025-05-07T20:07:27.9659511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:27.9660980Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:27.9661616Z ^ 2025-05-07T20:07:27.9661812Z 2025-05-07T20:07:27.9662660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:27.9664026Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:27.9664625Z ^ 2025-05-07T20:07:27.9664775Z 2025-05-07T20:07:27.9665007Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:27.9665389Z 2025-05-07T20:07:27.9666255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:27.9667656Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:27.9668286Z ^ 2025-05-07T20:07:27.9668510Z 2025-05-07T20:07:29.3572636Z [585/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o 2025-05-07T20:07:30.7423546Z [586/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute102.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o 2025-05-07T20:07:30.7434697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:30.7436088Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:30.7436698Z ^ 2025-05-07T20:07:30.7436855Z 2025-05-07T20:07:30.7437097Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:30.7437450Z 2025-05-07T20:07:30.7438309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:30.7439770Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:30.7440403Z ^ 2025-05-07T20:07:30.7440605Z 2025-05-07T20:07:30.7441451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:30.7442849Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:30.7443466Z ^ 2025-05-07T20:07:30.7443607Z 2025-05-07T20:07:30.7443842Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:30.7444204Z 2025-05-07T20:07:30.7445057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:30.7446448Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:30.7447064Z ^ 2025-05-07T20:07:30.7447274Z 2025-05-07T20:07:30.7448115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:30.7449524Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:30.7450137Z ^ 2025-05-07T20:07:30.7450294Z 2025-05-07T20:07:30.7450528Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:30.7450908Z 2025-05-07T20:07:30.7451793Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:30.7453175Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:30.7453801Z ^ 2025-05-07T20:07:30.7453994Z 2025-05-07T20:07:30.7454836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:30.7456200Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:30.7456817Z ^ 2025-05-07T20:07:30.7456957Z 2025-05-07T20:07:30.7457190Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:30.7457549Z 2025-05-07T20:07:30.7458403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:30.7459778Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:30.7460456Z ^ 2025-05-07T20:07:30.7460667Z 2025-05-07T20:07:30.7461510Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:30.7462913Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:30.7463530Z ^ 2025-05-07T20:07:30.7463667Z 2025-05-07T20:07:30.7463915Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:30.7464259Z 2025-05-07T20:07:30.7465110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:30.7466491Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:30.7467118Z ^ 2025-05-07T20:07:30.7467312Z 2025-05-07T20:07:31.5559443Z [587/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o 2025-05-07T20:07:31.5570713Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:31.5572138Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:31.5572747Z ^ 2025-05-07T20:07:31.5572905Z 2025-05-07T20:07:31.5573145Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:31.5573494Z 2025-05-07T20:07:31.5574362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:31.5575803Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:31.5576617Z ^ 2025-05-07T20:07:31.5576819Z 2025-05-07T20:07:31.5577660Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:31.5579037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:31.5579656Z ^ 2025-05-07T20:07:31.5579793Z 2025-05-07T20:07:31.5580028Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:31.5580499Z 2025-05-07T20:07:31.5581362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:31.5582742Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:31.5583361Z ^ 2025-05-07T20:07:31.5583572Z 2025-05-07T20:07:31.5584478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:31.5585847Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:31.5586502Z ^ 2025-05-07T20:07:31.5586642Z 2025-05-07T20:07:31.5586889Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:31.5587234Z 2025-05-07T20:07:31.5588126Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:31.5589509Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:31.5590138Z ^ 2025-05-07T20:07:31.5590334Z 2025-05-07T20:07:31.5591180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:31.5592555Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:31.5593174Z ^ 2025-05-07T20:07:31.5593312Z 2025-05-07T20:07:31.5593549Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:31.5593904Z 2025-05-07T20:07:31.5594752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:31.5596133Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:31.5596747Z ^ 2025-05-07T20:07:31.5596943Z 2025-05-07T20:07:31.5597797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:31.5599203Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:31.5599826Z ^ 2025-05-07T20:07:31.5599962Z 2025-05-07T20:07:31.5600210Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:31.5600555Z 2025-05-07T20:07:31.5601405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:31.5602784Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:31.5603409Z ^ 2025-05-07T20:07:31.5603608Z 2025-05-07T20:07:32.5793556Z [588/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_zipf.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o 2025-05-07T20:07:32.5804543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:32.5805915Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:32.5806537Z ^ 2025-05-07T20:07:32.5806728Z 2025-05-07T20:07:32.5806971Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:32.5807322Z 2025-05-07T20:07:32.5808253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:32.5809640Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:32.5810271Z ^ 2025-05-07T20:07:32.5810481Z 2025-05-07T20:07:32.5811322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:32.5812693Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:32.5813308Z ^ 2025-05-07T20:07:32.5813450Z 2025-05-07T20:07:32.5813687Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:32.5814047Z 2025-05-07T20:07:32.5814906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:32.5816279Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:32.5816892Z ^ 2025-05-07T20:07:32.5817085Z 2025-05-07T20:07:32.5817975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:32.5819334Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:32.5819985Z ^ 2025-05-07T20:07:32.5820123Z 2025-05-07T20:07:32.5820433Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:32.5820786Z 2025-05-07T20:07:32.5821673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:32.5823052Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:32.5823678Z ^ 2025-05-07T20:07:32.5823873Z 2025-05-07T20:07:32.5824715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:32.5826091Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:32.5826687Z ^ 2025-05-07T20:07:32.5826835Z 2025-05-07T20:07:32.5827070Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:32.5827412Z 2025-05-07T20:07:32.5828271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:32.5829633Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:32.5830254Z ^ 2025-05-07T20:07:32.5830446Z 2025-05-07T20:07:32.5831292Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:32.5832742Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:32.5833353Z ^ 2025-05-07T20:07:32.5833490Z 2025-05-07T20:07:32.5833721Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:32.5834076Z 2025-05-07T20:07:32.5834926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:32.5836299Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:32.5836914Z ^ 2025-05-07T20:07:32.5837119Z 2025-05-07T20:07:33.0742871Z [589/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_1d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o 2025-05-07T20:07:33.0753971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.0755362Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.0755972Z ^ 2025-05-07T20:07:33.0756124Z 2025-05-07T20:07:33.0756362Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:33.0756776Z 2025-05-07T20:07:33.0757685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.0759055Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.0759683Z ^ 2025-05-07T20:07:33.0759879Z 2025-05-07T20:07:33.0760742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.0762101Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.0762723Z ^ 2025-05-07T20:07:33.0762860Z 2025-05-07T20:07:33.0763093Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:33.0763448Z 2025-05-07T20:07:33.0764305Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.0765684Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.0766300Z ^ 2025-05-07T20:07:33.0766505Z 2025-05-07T20:07:33.0767386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.0768789Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.0769392Z ^ 2025-05-07T20:07:33.0769542Z 2025-05-07T20:07:33.0769810Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:33.0770155Z 2025-05-07T20:07:33.0771007Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.0772381Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.0773004Z ^ 2025-05-07T20:07:33.0773197Z 2025-05-07T20:07:33.0774036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.0775410Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.0776298Z ^ 2025-05-07T20:07:33.0776434Z 2025-05-07T20:07:33.0776668Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:33.0777027Z 2025-05-07T20:07:33.0777886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.0779262Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.0779878Z ^ 2025-05-07T20:07:33.0780139Z 2025-05-07T20:07:33.0781053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.0782414Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.0783028Z ^ 2025-05-07T20:07:33.0783164Z 2025-05-07T20:07:33.0783410Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:33.0783753Z 2025-05-07T20:07:33.0784608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.0785986Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.0786617Z ^ 2025-05-07T20:07:33.0786812Z 2025-05-07T20:07:33.2690214Z [590/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_range.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o 2025-05-07T20:07:33.2701399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.2702794Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.2703454Z ^ 2025-05-07T20:07:33.2703689Z 2025-05-07T20:07:33.2703930Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:33.2704294Z 2025-05-07T20:07:33.2705151Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.2706536Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.2707152Z ^ 2025-05-07T20:07:33.2707362Z 2025-05-07T20:07:33.2708206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.2709574Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.2710185Z ^ 2025-05-07T20:07:33.2710322Z 2025-05-07T20:07:33.2710569Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:33.2710915Z 2025-05-07T20:07:33.2711768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.2713253Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.2713909Z ^ 2025-05-07T20:07:33.2714103Z 2025-05-07T20:07:33.2714919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.2716284Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.2716883Z ^ 2025-05-07T20:07:33.2717050Z 2025-05-07T20:07:33.2717279Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:33.2717615Z 2025-05-07T20:07:33.2718461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.2719788Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.2720396Z ^ 2025-05-07T20:07:33.2720587Z 2025-05-07T20:07:33.2721419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.2722742Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.2723340Z ^ 2025-05-07T20:07:33.2723473Z 2025-05-07T20:07:33.2723712Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:33.2724047Z 2025-05-07T20:07:33.2724877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.2726218Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.2726882Z ^ 2025-05-07T20:07:33.2727071Z 2025-05-07T20:07:33.2727888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.2729213Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.2729794Z ^ 2025-05-07T20:07:33.2729940Z 2025-05-07T20:07:33.2730168Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:33.2730502Z 2025-05-07T20:07:33.2731348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.2732679Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.2733286Z ^ 2025-05-07T20:07:33.2733472Z 2025-05-07T20:07:33.8416519Z [591/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_segment_sum_csr.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o 2025-05-07T20:07:33.8428150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.8429563Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.8430266Z ^ 2025-05-07T20:07:33.8430415Z 2025-05-07T20:07:33.8430690Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:33.8431046Z 2025-05-07T20:07:33.8431913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.8433427Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.8434083Z ^ 2025-05-07T20:07:33.8434290Z 2025-05-07T20:07:33.8435122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.8436484Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.8437112Z ^ 2025-05-07T20:07:33.8437256Z 2025-05-07T20:07:33.8437497Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:33.8437857Z 2025-05-07T20:07:33.8438693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.8440118Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.8440739Z ^ 2025-05-07T20:07:33.8440966Z 2025-05-07T20:07:33.8441796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.8443222Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.8443830Z ^ 2025-05-07T20:07:33.8443998Z 2025-05-07T20:07:33.8444238Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:33.8444586Z 2025-05-07T20:07:33.8445428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.8446800Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.8447444Z ^ 2025-05-07T20:07:33.8447644Z 2025-05-07T20:07:33.8448473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.8449831Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.8450452Z ^ 2025-05-07T20:07:33.8450592Z 2025-05-07T20:07:33.8450827Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:33.8451188Z 2025-05-07T20:07:33.8452028Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.8453391Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.8454034Z ^ 2025-05-07T20:07:33.8454230Z 2025-05-07T20:07:33.8455076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.8456412Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.8457038Z ^ 2025-05-07T20:07:33.8457178Z 2025-05-07T20:07:33.8457437Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:33.8457780Z 2025-05-07T20:07:33.8458617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:33.8459983Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:33.8460687Z ^ 2025-05-07T20:07:33.8461051Z 2025-05-07T20:07:34.5016793Z [592/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o 2025-05-07T20:07:34.5028442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5029909Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5030519Z ^ 2025-05-07T20:07:34.5030676Z 2025-05-07T20:07:34.5030949Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:34.5031298Z 2025-05-07T20:07:34.5032169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5033629Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5034243Z ^ 2025-05-07T20:07:34.5034432Z 2025-05-07T20:07:34.5035266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5036586Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5037185Z ^ 2025-05-07T20:07:34.5037318Z 2025-05-07T20:07:34.5037546Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:34.5037892Z 2025-05-07T20:07:34.5038757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5040098Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5040745Z ^ 2025-05-07T20:07:34.5040946Z 2025-05-07T20:07:34.5041791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5043118Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5043705Z ^ 2025-05-07T20:07:34.5043850Z 2025-05-07T20:07:34.5044075Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:34.5044409Z 2025-05-07T20:07:34.5045251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5046580Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5047196Z ^ 2025-05-07T20:07:34.5047384Z 2025-05-07T20:07:34.5048374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5049740Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5050351Z ^ 2025-05-07T20:07:34.5050486Z 2025-05-07T20:07:34.5050721Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:34.5051074Z 2025-05-07T20:07:34.5051925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5053335Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5053944Z ^ 2025-05-07T20:07:34.5054150Z 2025-05-07T20:07:34.5054989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5056357Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5056957Z ^ 2025-05-07T20:07:34.5057095Z 2025-05-07T20:07:34.5057347Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:34.5057692Z 2025-05-07T20:07:34.5058548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5059924Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5060621Z ^ 2025-05-07T20:07:34.5060818Z 2025-05-07T20:07:34.5920970Z [593/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_2d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o 2025-05-07T20:07:34.5931922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5933377Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5934015Z ^ 2025-05-07T20:07:34.5934165Z 2025-05-07T20:07:34.5934435Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:34.5934805Z 2025-05-07T20:07:34.5935642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5937023Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5937643Z ^ 2025-05-07T20:07:34.5937870Z 2025-05-07T20:07:34.5938700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5940058Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5940744Z ^ 2025-05-07T20:07:34.5941085Z 2025-05-07T20:07:34.5941331Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:34.5941689Z 2025-05-07T20:07:34.5942625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5944014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5944714Z ^ 2025-05-07T20:07:34.5944918Z 2025-05-07T20:07:34.5945796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5947192Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5947833Z ^ 2025-05-07T20:07:34.5947979Z 2025-05-07T20:07:34.5948221Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:34.5948597Z 2025-05-07T20:07:34.5949453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5950853Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5951482Z ^ 2025-05-07T20:07:34.5951709Z 2025-05-07T20:07:34.5952561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5954037Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5954641Z ^ 2025-05-07T20:07:34.5954784Z 2025-05-07T20:07:34.5955043Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:34.5955386Z 2025-05-07T20:07:34.5956225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5958670Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5959305Z ^ 2025-05-07T20:07:34.5959504Z 2025-05-07T20:07:34.5960333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5961702Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5962324Z ^ 2025-05-07T20:07:34.5962465Z 2025-05-07T20:07:34.5962700Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:34.5963053Z 2025-05-07T20:07:34.5963913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:34.5965254Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:34.5965892Z ^ 2025-05-07T20:07:34.5966090Z 2025-05-07T20:07:35.8018206Z [594/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_reorder_batched_ad.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o 2025-05-07T20:07:35.8029797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:35.8031225Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:35.8031887Z ^ 2025-05-07T20:07:35.8032043Z 2025-05-07T20:07:35.8032295Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:35.8032679Z 2025-05-07T20:07:35.8033640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:35.8035021Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:35.8035650Z ^ 2025-05-07T20:07:35.8035876Z 2025-05-07T20:07:35.8036714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:35.8038076Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:35.8038688Z ^ 2025-05-07T20:07:35.8038836Z 2025-05-07T20:07:35.8039102Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:35.8039488Z 2025-05-07T20:07:35.8040327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:35.8041758Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:35.8042405Z ^ 2025-05-07T20:07:35.8042609Z 2025-05-07T20:07:35.8043497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:35.8044834Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:35.8045463Z ^ 2025-05-07T20:07:35.8045607Z 2025-05-07T20:07:35.8045849Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:35.8046219Z 2025-05-07T20:07:35.8047057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:35.8048442Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:35.8049063Z ^ 2025-05-07T20:07:35.8049280Z 2025-05-07T20:07:35.8050102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:35.8051457Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:35.8052055Z ^ 2025-05-07T20:07:35.8052195Z 2025-05-07T20:07:35.8052451Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:35.8052848Z 2025-05-07T20:07:35.8053684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:35.8055052Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:35.8055686Z ^ 2025-05-07T20:07:35.8055884Z 2025-05-07T20:07:35.8056712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:35.8058065Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:35.8058693Z ^ 2025-05-07T20:07:35.8058834Z 2025-05-07T20:07:35.8059071Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:07:35.8059432Z 2025-05-07T20:07:35.8060269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:07:35.8061885Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:07:35.8062514Z ^ 2025-05-07T20:07:35.8062718Z 2025-05-07T20:07:36.5192748Z [595/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_py.so -o fbgemm_gpu_py.so CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_embedding_inplace_ops.so fbgemm_gpu_tbe_index_select.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_optimizers.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && : 2025-05-07T20:07:36.5846179Z [596/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 1 2025-05-07T20:07:36.5847497Z ################################################################################ 2025-05-07T20:07:36.5847869Z [CMAKE] Running post-build script ... 2025-05-07T20:07:36.5848439Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:07:36.5849001Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:07:36.5849409Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:07:36.5849859Z ################################################################################ 2025-05-07T20:09:10.6986901Z [597/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T20:09:10.6999075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:10.7000364Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:10.7000972Z ^ 2025-05-07T20:09:10.7001117Z 2025-05-07T20:09:10.7001370Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:09:10.7001707Z 2025-05-07T20:09:10.7002511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:10.7003823Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:10.7004433Z ^ 2025-05-07T20:09:10.7004628Z 2025-05-07T20:09:10.7005415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:10.7006700Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:10.7007273Z ^ 2025-05-07T20:09:10.7007436Z 2025-05-07T20:09:10.7007661Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:09:10.7007989Z 2025-05-07T20:09:10.7008805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:10.7010118Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:10.7010729Z ^ 2025-05-07T20:09:10.7010919Z 2025-05-07T20:09:10.7011725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:10.7012994Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:10.7013588Z ^ 2025-05-07T20:09:10.7013727Z 2025-05-07T20:09:10.7013973Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:09:10.7014302Z 2025-05-07T20:09:10.7015101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:10.7016397Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:10.7016983Z ^ 2025-05-07T20:09:10.7017193Z 2025-05-07T20:09:10.7018014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:10.7019313Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:10.7019922Z ^ 2025-05-07T20:09:10.7020081Z 2025-05-07T20:09:10.7020365Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:09:10.7020859Z 2025-05-07T20:09:10.7021791Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:10.7023176Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:10.7023844Z ^ 2025-05-07T20:09:10.7024049Z 2025-05-07T20:09:10.7024930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:10.7026306Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:10.7026949Z ^ 2025-05-07T20:09:10.7027094Z 2025-05-07T20:09:10.7027339Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:09:10.7027712Z 2025-05-07T20:09:10.7028568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:10.7029969Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:10.7030595Z ^ 2025-05-07T20:09:10.7030816Z 2025-05-07T20:09:15.5642589Z [598/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:09:15.5654839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:15.5656136Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:15.5656753Z ^ 2025-05-07T20:09:15.5656897Z 2025-05-07T20:09:15.5657155Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:09:15.5657487Z 2025-05-07T20:09:15.5658291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:15.5659599Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:15.5660209Z ^ 2025-05-07T20:09:15.5660482Z 2025-05-07T20:09:15.5661490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:15.5662899Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:15.5663591Z ^ 2025-05-07T20:09:15.5663764Z 2025-05-07T20:09:15.5664017Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:09:15.5664374Z 2025-05-07T20:09:15.5665438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:15.5666934Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:15.5667552Z ^ 2025-05-07T20:09:15.5667740Z 2025-05-07T20:09:15.5668545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:15.5669816Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:15.5670414Z ^ 2025-05-07T20:09:15.5670554Z 2025-05-07T20:09:15.5670797Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:09:15.5671127Z 2025-05-07T20:09:15.5671924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:15.5673258Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:15.5673844Z ^ 2025-05-07T20:09:15.5674054Z 2025-05-07T20:09:15.5674881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:15.5676556Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:15.5677176Z ^ 2025-05-07T20:09:15.5677347Z 2025-05-07T20:09:15.5677587Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:09:15.5677942Z 2025-05-07T20:09:15.5678826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:15.5680206Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:15.5680867Z ^ 2025-05-07T20:09:15.5681070Z 2025-05-07T20:09:15.5681921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:15.5683487Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:15.5684092Z ^ 2025-05-07T20:09:15.5684231Z 2025-05-07T20:09:15.5684461Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:09:15.5684818Z 2025-05-07T20:09:15.5685610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:15.5686956Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:15.5687541Z ^ 2025-05-07T20:09:15.5687748Z 2025-05-07T20:09:17.9549762Z [599/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/targets/x86_64-linux/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++20 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o 2025-05-07T20:09:17.9561544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:17.9562875Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:17.9563467Z ^ 2025-05-07T20:09:17.9563638Z 2025-05-07T20:09:17.9563873Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:09:17.9564207Z 2025-05-07T20:09:17.9565032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:17.9566324Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:17.9566947Z ^ 2025-05-07T20:09:17.9567202Z 2025-05-07T20:09:17.9567995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:17.9569299Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:17.9569911Z ^ 2025-05-07T20:09:17.9570053Z 2025-05-07T20:09:17.9570283Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:09:17.9570640Z 2025-05-07T20:09:17.9571443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:17.9572754Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:17.9573357Z ^ 2025-05-07T20:09:17.9573577Z 2025-05-07T20:09:17.9574366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:17.9575660Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:17.9576820Z ^ 2025-05-07T20:09:17.9576971Z 2025-05-07T20:09:17.9577300Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:09:17.9577656Z 2025-05-07T20:09:17.9578521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:17.9580014Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:17.9580802Z ^ 2025-05-07T20:09:17.9581010Z 2025-05-07T20:09:17.9581862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:17.9583258Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:17.9583908Z ^ 2025-05-07T20:09:17.9584058Z 2025-05-07T20:09:17.9584303Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:09:17.9584683Z 2025-05-07T20:09:17.9585550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:17.9586962Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:17.9587582Z ^ 2025-05-07T20:09:17.9587799Z 2025-05-07T20:09:17.9588642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:17.9590023Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:17.9590640Z ^ 2025-05-07T20:09:17.9590801Z 2025-05-07T20:09:17.9591087Z Remark: The warnings can be suppressed with "-diag-suppress " 2025-05-07T20:09:17.9591435Z 2025-05-07T20:09:17.9592314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:09:17.9593745Z __attribute__((host)) __attribute__((device)) inline __attribute__((always_inline)) constexpr StochasticRoundingRNGState() = default; 2025-05-07T20:09:17.9594344Z ^ 2025-05-07T20:09:17.9594532Z 2025-05-07T20:09:19.5530458Z [600/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward.so -o fbgemm_gpu_tbe_training_backward.so CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && : 2025-05-07T20:09:20.1434932Z [601/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_dense.so -o fbgemm_gpu_tbe_training_backward_dense.so CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && : 2025-05-07T20:09:20.1556582Z [602/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_gwd.so -o fbgemm_gpu_tbe_training_backward_gwd.so CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs" -L"/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib" && : 2025-05-07T20:09:20.1989670Z [603/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 1 2025-05-07T20:09:20.1991044Z ################################################################################ 2025-05-07T20:09:20.1991412Z [CMAKE] Running post-build script ... 2025-05-07T20:09:20.1992042Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:09:20.1992956Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:20.1993333Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:20.1993767Z ################################################################################ 2025-05-07T20:09:20.2800636Z [604/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 1 2025-05-07T20:09:20.2802015Z ################################################################################ 2025-05-07T20:09:20.2802428Z [CMAKE] Running post-build script ... 2025-05-07T20:09:20.2803084Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:09:20.2803747Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:20.2804156Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:20.2804590Z ################################################################################ 2025-05-07T20:09:20.3246208Z [605/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 1 2025-05-07T20:09:20.3247592Z ################################################################################ 2025-05-07T20:09:20.3247959Z [CMAKE] Running post-build script ... 2025-05-07T20:09:20.3248816Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:09:20.3249576Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:20.3249982Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:20.3250537Z ################################################################################ 2025-05-07T20:09:20.4376112Z [606/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib -L/github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_vbe.so -o fbgemm_gpu_tbe_training_backward_vbe.so CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/targets/x86_64-linux/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && : 2025-05-07T20:09:20.7390088Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 1 2025-05-07T20:09:20.7391924Z ################################################################################ 2025-05-07T20:09:20.7392304Z [CMAKE] Running post-build script ... 2025-05-07T20:09:20.7393069Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:09:20.7393720Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:09:20.7394140Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:09:20.7394585Z ################################################################################ 2025-05-07T20:09:20.7395644Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-build && /github/home/miniconda/envs/build_binary/lib/python3.9/site-packages/cmake/data/bin/cmake -P cmake_install.cmake 2025-05-07T20:09:20.7430019Z -- Install configuration: "Release" 2025-05-07T20:09:20.7430687Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/asmjit.so 2025-05-07T20:09:20.7456454Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm.so 2025-05-07T20:09:20.7457436Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so 2025-05-07T20:09:20.7470782Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so 2025-05-07T20:09:20.7471916Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so 2025-05-07T20:09:20.7487593Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so 2025-05-07T20:09:20.7505316Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:09:20.7506329Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so 2025-05-07T20:09:20.7507742Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:09:20.7534485Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:09:20.7536669Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:09:20.7537730Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:09:26.9808388Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:09:28.1099652Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:09:30.7269120Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:09:31.1922393Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:09:31.1923503Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:09:31.1924624Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py 2025-05-07T20:09:31.1926100Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py 2025-05-07T20:09:31.1927331Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py 2025-05-07T20:09:31.1928629Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py 2025-05-07T20:09:31.1929886Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py 2025-05-07T20:09:31.1931096Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py 2025-05-07T20:09:31.1932379Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py 2025-05-07T20:09:31.1933685Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py 2025-05-07T20:09:31.1934962Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py 2025-05-07T20:09:31.1936249Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py 2025-05-07T20:09:31.1937573Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py 2025-05-07T20:09:31.1938808Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py 2025-05-07T20:09:31.1940000Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py 2025-05-07T20:09:31.1941315Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py 2025-05-07T20:09:31.1942728Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T20:09:31.1944022Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py 2025-05-07T20:09:31.1945106Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:09:31.1951755Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so 2025-05-07T20:09:31.1999878Z 2025-05-07T20:09:31.2057540Z 2025-05-07T20:09:31.2058167Z copying fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/__init__.py 2025-05-07T20:09:31.2059080Z copying fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py 2025-05-07T20:09:31.2059983Z copying fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/enums.py 2025-05-07T20:09:31.2060810Z copying fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/metrics.py 2025-05-07T20:09:31.2061749Z copying fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py 2025-05-07T20:09:31.2062934Z copying fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py 2025-05-07T20:09:31.2064150Z copying fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize_comm.py 2025-05-07T20:09:31.2064989Z copying fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize_utils.py 2025-05-07T20:09:31.2065829Z copying fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/runtime_monitor.py 2025-05-07T20:09:31.2066701Z copying fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sparse_ops.py 2025-05-07T20:09:31.2067625Z copying fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_configs.py 2025-05-07T20:09:31.2068724Z copying fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py 2025-05-07T20:09:31.2069838Z copying fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py 2025-05-07T20:09:31.2070839Z copying fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_utils.py 2025-05-07T20:09:31.2071861Z copying fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py 2025-05-07T20:09:31.2073066Z copying fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py 2025-05-07T20:09:31.2074379Z copying fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py 2025-05-07T20:09:31.2075667Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py 2025-05-07T20:09:31.2077223Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py 2025-05-07T20:09:31.2078529Z copying fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py 2025-05-07T20:09:31.2079659Z copying fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py 2025-05-07T20:09:31.2080456Z copying fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/uvm.py 2025-05-07T20:09:31.2081099Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/config 2025-05-07T20:09:31.2081802Z copying fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/config/__init__.py 2025-05-07T20:09:31.2082683Z copying fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/config/feature_list.py 2025-05-07T20:09:31.2083436Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs 2025-05-07T20:09:31.2084114Z copying fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/__init__.py 2025-05-07T20:09:31.2084902Z copying fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/common.py 2025-05-07T20:09:31.2085693Z copying fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/examples.py 2025-05-07T20:09:31.2086585Z copying fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py 2025-05-07T20:09:31.2087620Z copying fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py 2025-05-07T20:09:31.2088725Z copying fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py 2025-05-07T20:09:31.2089760Z copying fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/quantize_ops.py 2025-05-07T20:09:31.2090668Z copying fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/sparse_ops.py 2025-05-07T20:09:31.2091481Z copying fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/version.py 2025-05-07T20:09:31.2092237Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize 2025-05-07T20:09:31.2092962Z copying fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize/__init__.py 2025-05-07T20:09:31.2093922Z copying fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize/quantize_ops.py 2025-05-07T20:09:31.2094692Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll 2025-05-07T20:09:31.2095358Z copying fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/__init__.py 2025-05-07T20:09:31.2096038Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe 2025-05-07T20:09:31.2096690Z copying fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/__init__.py 2025-05-07T20:09:31.2097371Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton 2025-05-07T20:09:31.2098087Z copying fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/__init__.py 2025-05-07T20:09:31.2098890Z copying fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/common.py 2025-05-07T20:09:31.2099729Z copying fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/quantize.py 2025-05-07T20:09:31.2100678Z copying fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/quantize_ref.py 2025-05-07T20:09:31.2101447Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils 2025-05-07T20:09:31.2102155Z copying fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/__init__.py 2025-05-07T20:09:31.2102972Z copying fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/filestore.py 2025-05-07T20:09:31.2103807Z copying fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/loader.py 2025-05-07T20:09:31.2105040Z copying fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/torch_library.py 2025-05-07T20:09:31.2105805Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/cpu 2025-05-07T20:09:31.2106536Z copying fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/cpu/__init__.py 2025-05-07T20:09:31.2107360Z copying fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py 2025-05-07T20:09:31.2108089Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/meta 2025-05-07T20:09:31.2108805Z copying fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/meta/__init__.py 2025-05-07T20:09:31.2109679Z copying fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py 2025-05-07T20:09:31.2110438Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton 2025-05-07T20:09:31.2111186Z copying fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/__init__.py 2025-05-07T20:09:31.2112078Z copying fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/common.py 2025-05-07T20:09:31.2113187Z copying fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py 2025-05-07T20:09:31.2114476Z copying fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py 2025-05-07T20:09:31.2115679Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py 2025-05-07T20:09:31.2116806Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py 2025-05-07T20:09:31.2118141Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py 2025-05-07T20:09:31.2119634Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py 2025-05-07T20:09:31.2121069Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py 2025-05-07T20:09:31.2122443Z copying fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py 2025-05-07T20:09:31.2123868Z copying fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py 2025-05-07T20:09:31.2125165Z copying fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py 2025-05-07T20:09:31.2126468Z copying fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py 2025-05-07T20:09:31.2127506Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench 2025-05-07T20:09:31.2128247Z copying fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/__init__.py 2025-05-07T20:09:31.2129173Z copying fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py 2025-05-07T20:09:31.2130124Z copying fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py 2025-05-07T20:09:31.2131051Z copying fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py 2025-05-07T20:09:31.2132110Z copying fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py 2025-05-07T20:09:31.2133231Z copying fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py 2025-05-07T20:09:31.2134214Z copying fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/reporter.py 2025-05-07T20:09:31.2135188Z copying fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py 2025-05-07T20:09:31.2136283Z copying fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py 2025-05-07T20:09:31.2137494Z copying fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py 2025-05-07T20:09:31.2138578Z copying fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/utils.py 2025-05-07T20:09:31.2139340Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/cache 2025-05-07T20:09:31.2140310Z copying fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/cache/__init__.py 2025-05-07T20:09:31.2141433Z copying fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py 2025-05-07T20:09:31.2142431Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:31.2143207Z copying fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py 2025-05-07T20:09:31.2144062Z copying fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/common.py 2025-05-07T20:09:31.2145017Z copying fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/inference.py 2025-05-07T20:09:31.2146000Z copying fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/training.py 2025-05-07T20:09:31.2146777Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils 2025-05-07T20:09:31.2147584Z copying fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/__init__.py 2025-05-07T20:09:31.2148473Z copying fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/common.py 2025-05-07T20:09:31.2149414Z copying fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/offsets.py 2025-05-07T20:09:31.2150365Z copying fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/quantize.py 2025-05-07T20:09:31.2151298Z copying fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/requests.py 2025-05-07T20:09:31.2152117Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/stats 2025-05-07T20:09:31.2152887Z copying fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/stats/__init__.py 2025-05-07T20:09:31.2153918Z copying fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py 2025-05-07T20:09:31.2154848Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:31.2155661Z copying fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py 2025-05-07T20:09:31.2156854Z copying fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py 2025-05-07T20:09:31.2157922Z creating directory _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/jagged 2025-05-07T20:09:31.2158768Z copying fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/jagged/__init__.py 2025-05-07T20:09:31.2159884Z copying fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py 2025-05-07T20:09:31.2160557Z 2025-05-07T20:09:31.2271452Z INFO:root:running bdist_wheel 2025-05-07T20:09:31.2319103Z INFO:root:running build 2025-05-07T20:09:31.2319841Z INFO:root:running build_py 2025-05-07T20:09:31.2323939Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2325763Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2327800Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2329250Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2330735Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2332551Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2334506Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2336180Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2337838Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2339221Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2340926Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2343197Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2344706Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2346337Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2347788Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2349488Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2351096Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2352923Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2355626Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2358450Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2360170Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2361610Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2363205Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2365532Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/config 2025-05-07T20:09:31.2366730Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/config 2025-05-07T20:09:31.2368408Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/config 2025-05-07T20:09:31.2371252Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:31.2372338Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:31.2374049Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:31.2375584Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:31.2377557Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:31.2379222Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:31.2381149Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:31.2382636Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:31.2384007Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:31.2385859Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:31.2387999Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize 2025-05-07T20:09:31.2389140Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize 2025-05-07T20:09:31.2391082Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize 2025-05-07T20:09:31.2392751Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll 2025-05-07T20:09:31.2394008Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll 2025-05-07T20:09:31.2396149Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe 2025-05-07T20:09:31.2397368Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe 2025-05-07T20:09:31.2399609Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:31.2400760Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:31.2402552Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:31.2404100Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:31.2405997Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:31.2408236Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:31.2409438Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:31.2411159Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:31.2412623Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:31.2414168Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:31.2416336Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/cpu 2025-05-07T20:09:31.2417491Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/cpu 2025-05-07T20:09:31.2419135Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/cpu 2025-05-07T20:09:31.2421685Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/meta 2025-05-07T20:09:31.2422866Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/meta 2025-05-07T20:09:31.2424979Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/meta 2025-05-07T20:09:31.2428160Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:31.2440066Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:31.2441798Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:31.2443361Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:31.2445016Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:31.2446564Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:31.2448129Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:31.2449771Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:31.2451452Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:31.2453206Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:31.2454881Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:31.2457033Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:31.2458688Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:31.2460437Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:31.2461698Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:31.2462850Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:31.2464294Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:31.2465722Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:31.2467161Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:31.2468663Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:31.2470224Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:31.2471720Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:31.2473181Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:31.2474653Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:31.2476379Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:31.2477878Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:31.2478979Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/cache 2025-05-07T20:09:31.2480111Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/cache 2025-05-07T20:09:31.2481664Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/cache 2025-05-07T20:09:31.2482834Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:31.2483993Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:31.2485408Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:31.2486777Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:31.2488192Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:31.2489316Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:31.2490426Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:31.2491848Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:31.2493280Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:31.2494695Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:31.2496145Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:31.2497532Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/stats 2025-05-07T20:09:31.2498710Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/stats 2025-05-07T20:09:31.2500419Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/stats 2025-05-07T20:09:31.2502753Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:31.2503930Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:31.2505682Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:31.2507729Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/jagged 2025-05-07T20:09:31.2508896Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/jagged 2025-05-07T20:09:31.2510524Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/jagged 2025-05-07T20:09:31.2563597Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2597964Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.2822435Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:31.3981655Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:34.8219684Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:34.8224121Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:34.9510122Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:34.9625190Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:34.9853457Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:35.0537326Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:37.7595725Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:37.8414080Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:45.1429210Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:46.2731368Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:48.9819967Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:49.4478763Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:49.4842490Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:49.7562357Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:49.7564056Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:49.7566653Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:49.7572840Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:49.7579184Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:49.7585768Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:49.7597031Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:49.7601832Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:49.7611425Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:49.7617692Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:49.7623979Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:49.7635375Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:49.7639708Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:49.7649557Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:49.7654769Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:49.7659006Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:49.7660636Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:49.7666539Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:49.7671359Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:49.7722195Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3437568Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3438958Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3440311Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3441564Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3442889Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3444397Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3445796Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3447061Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3448410Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3450040Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3452231Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3453924Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3455472Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3457058Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3458647Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3460279Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3462228Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3465171Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3468182Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3470019Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3471708Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3472970Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu 2025-05-07T20:09:50.3474493Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/config 2025-05-07T20:09:50.3476346Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/config 2025-05-07T20:09:50.3478196Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:50.3479811Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:50.3481849Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:50.3483471Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:50.3485384Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:50.3487324Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:50.3489328Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:50.3490922Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:50.3492691Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs 2025-05-07T20:09:50.3494428Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize 2025-05-07T20:09:50.3497846Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize 2025-05-07T20:09:50.3499361Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll 2025-05-07T20:09:50.3501234Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe 2025-05-07T20:09:50.3503048Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:50.3504768Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:50.3506303Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:50.3508118Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton 2025-05-07T20:09:50.3509720Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:50.3511414Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:50.3512972Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:50.3514529Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils 2025-05-07T20:09:50.3516287Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/cpu 2025-05-07T20:09:50.3517820Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/cpu 2025-05-07T20:09:50.3519665Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/meta 2025-05-07T20:09:50.3521304Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/meta 2025-05-07T20:09:50.3523014Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3525160Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3526882Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3528637Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3530215Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3531793Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3533563Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3535272Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3536995Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3538662Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3540573Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3542193Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3544259Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3545926Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3547710Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3549275Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3551164Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3552798Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3554387Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3557881Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3559467Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3561455Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3563184Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3564755Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3566412Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/cache 2025-05-07T20:09:50.3568114Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/cache 2025-05-07T20:09:50.3569625Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:50.3571240Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:50.3572870Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:50.3574608Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:50.3577388Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:50.3579012Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:50.3580609Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:50.3582340Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:50.3583980Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils 2025-05-07T20:09:50.3586117Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/stats 2025-05-07T20:09:50.3587821Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/stats 2025-05-07T20:09:50.3589493Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:50.3591222Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:50.3592806Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/jagged 2025-05-07T20:09:50.3594513Z INFO:root:copying _skbuild/linux-x86_64-3.9/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/jagged 2025-05-07T20:09:50.3614652Z INFO:skbuild:copied 90 files 2025-05-07T20:09:50.3615102Z INFO:root:running build_ext 2025-05-07T20:09:50.3616974Z INFO:root:installing to _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:50.3617480Z INFO:root:running install 2025-05-07T20:09:50.3670576Z INFO:root:running install_lib 2025-05-07T20:09:50.3671955Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:09:50.3673823Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu 2025-05-07T20:09:50.3674894Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/config 2025-05-07T20:09:50.3676225Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:50.3677934Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:09:50.3679131Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/docs 2025-05-07T20:09:50.3680426Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:50.3681978Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:50.3683509Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:50.3685052Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:50.3686678Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:50.3688363Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:50.3689953Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:50.3691498Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:50.3693030Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:09:50.3694226Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/quantize 2025-05-07T20:09:50.3695443Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:50.3697067Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:09:50.3698278Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll 2025-05-07T20:09:50.3699067Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/cpu 2025-05-07T20:09:50.3700243Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:50.3701858Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:09:50.3703054Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/meta 2025-05-07T20:09:50.3704253Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:50.3705843Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:09:50.3707046Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3708297Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3709909Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3711635Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3713493Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3715221Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3716981Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3718805Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3720670Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3722562Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3724462Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3726340Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3728138Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3729962Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:09:50.3731645Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll 2025-05-07T20:09:50.3732735Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe 2025-05-07T20:09:50.3733527Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3734731Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3736342Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3738099Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3739717Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3741430Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3743156Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3744813Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3746432Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3748122Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3749860Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3751550Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:09:50.3752755Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/cache 2025-05-07T20:09:50.3753964Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:50.3755612Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:09:50.3756889Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd 2025-05-07T20:09:50.3757707Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:50.3758949Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:50.3760724Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:09:50.3762478Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:50.3764011Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:50.3765626Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:50.3767250Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:09:50.3768424Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/utils 2025-05-07T20:09:50.3769632Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:50.3771199Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:50.3772813Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:50.3774430Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:50.3776212Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:09:50.3777406Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/stats 2025-05-07T20:09:50.3778616Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:50.3780403Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:09:50.3781982Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe 2025-05-07T20:09:50.3783112Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton 2025-05-07T20:09:50.3783911Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton/jagged 2025-05-07T20:09:50.3785172Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:50.3786922Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:09:50.3788570Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:50.3790148Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:50.3791706Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:50.3793399Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:09:50.3794580Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/utils 2025-05-07T20:09:50.3795703Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:50.3797183Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:50.3798695Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:50.3800211Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:09:50.3801643Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.3803023Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.3819902Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.3947732Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.6614185Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.6615789Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.6718181Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.6725602Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.6746103Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.6805170Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.8884389Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:50.8946936Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.4502896Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.5373107Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.7381981Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.7739163Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.7769831Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.7980518Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:51.7982247Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:51.7984486Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:51.7986654Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:51.7988978Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:51.7991080Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:51.7993212Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:51.7995371Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:51.7997612Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:51.7999859Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:51.8002072Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:51.8004484Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:51.8006636Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:51.8008716Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:51.8010846Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:09:51.8012393Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:51.8014049Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:51.8016217Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:09:51.8018082Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8019641Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8454185Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8455822Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8457373Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8458812Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8460393Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8462263Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8463865Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8465440Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8466976Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8468449Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8469932Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8471553Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8473195Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8474772Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8476501Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8478177Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8479975Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8481664Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8483405Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8485150Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8486721Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8488186Z INFO:root:copying _skbuild/linux-x86_64-3.9/setuptools/lib.linux-x86_64-cpython-39/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:09:51.8489153Z INFO:skbuild:copied 125 files 2025-05-07T20:09:51.8489452Z INFO:root:running install_egg_info 2025-05-07T20:09:51.8522669Z INFO:root:running egg_info 2025-05-07T20:09:51.8551994Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T20:09:51.8556969Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T20:09:51.8559417Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T20:09:51.8560507Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T20:09:51.8662598Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:51.8703057Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:09:51.8704180Z INFO:root:Copying fbgemm_gpu_nightly.egg-info to _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu_nightly-2025.5.7-py3.9.egg-info 2025-05-07T20:09:51.8710801Z INFO:root:running install_scripts 2025-05-07T20:09:51.8711376Z INFO:skbuild:copied 0 files 2025-05-07T20:09:54.6162193Z INFO:root:creating _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL 2025-05-07T20:09:54.6163742Z INFO:wheel:creating '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-sth8qixf/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl' and adding '_skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel' to it 2025-05-07T20:09:54.6167306Z INFO:wheel:adding 'fbgemm_gpu/__init__.py' 2025-05-07T20:09:54.6426217Z INFO:wheel:adding 'fbgemm_gpu/asmjit.so' 2025-05-07T20:09:54.6442831Z INFO:wheel:adding 'fbgemm_gpu/batched_unary_embeddings_ops.py' 2025-05-07T20:09:54.6443685Z INFO:wheel:adding 'fbgemm_gpu/enums.py' 2025-05-07T20:09:54.8462886Z INFO:wheel:adding 'fbgemm_gpu/fbgemm.so' 2025-05-07T20:09:54.8599890Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_config.so' 2025-05-07T20:09:54.8731429Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so' 2025-05-07T20:09:56.5764371Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_py.so' 2025-05-07T20:09:56.7783955Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so' 2025-05-07T20:09:57.4841348Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_cache.so' 2025-05-07T20:09:57.5909339Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_common.so' 2025-05-07T20:09:58.1816768Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_index_select.so' 2025-05-07T20:10:15.9543086Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_inference.so' 2025-05-07T20:10:17.1853200Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so' 2025-05-07T20:10:44.1879502Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so' 2025-05-07T20:10:46.9890755Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so' 2025-05-07T20:10:50.5924011Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so' 2025-05-07T20:10:51.2837603Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so' 2025-05-07T20:10:51.5028651Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so' 2025-05-07T20:11:00.0811287Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so' 2025-05-07T20:11:10.9294092Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so' 2025-05-07T20:11:12.3862450Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_utils.so' 2025-05-07T20:11:12.4219038Z INFO:wheel:adding 'fbgemm_gpu/metrics.py' 2025-05-07T20:11:12.4219643Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules.py' 2025-05-07T20:11:12.4222564Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules_split.py' 2025-05-07T20:11:12.4223893Z INFO:wheel:adding 'fbgemm_gpu/quantize_comm.py' 2025-05-07T20:11:12.4227027Z INFO:wheel:adding 'fbgemm_gpu/quantize_utils.py' 2025-05-07T20:11:12.4230257Z INFO:wheel:adding 'fbgemm_gpu/runtime_monitor.py' 2025-05-07T20:11:12.4241199Z INFO:wheel:adding 'fbgemm_gpu/sparse_ops.py' 2025-05-07T20:11:12.4244952Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_configs.py' 2025-05-07T20:11:12.4247936Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_inference_converter.py' 2025-05-07T20:11:12.4249646Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_ops.py' 2025-05-07T20:11:12.4251200Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_utils.py' 2025-05-07T20:11:12.4253365Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops.py' 2025-05-07T20:11:12.4256506Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_common.py' 2025-05-07T20:11:12.4279237Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_inference.py' 2025-05-07T20:11:12.4324533Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training.py' 2025-05-07T20:11:12.4327836Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py' 2025-05-07T20:11:12.4329460Z INFO:wheel:adding 'fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py' 2025-05-07T20:11:12.4331305Z INFO:wheel:adding 'fbgemm_gpu/tbe_input_multiplexer.py' 2025-05-07T20:11:12.4332993Z INFO:wheel:adding 'fbgemm_gpu/uvm.py' 2025-05-07T20:11:12.4334899Z INFO:wheel:adding 'fbgemm_gpu/config/__init__.py' 2025-05-07T20:11:12.4336910Z INFO:wheel:adding 'fbgemm_gpu/config/feature_list.py' 2025-05-07T20:11:12.4338965Z INFO:wheel:adding 'fbgemm_gpu/docs/__init__.py' 2025-05-07T20:11:12.4340528Z INFO:wheel:adding 'fbgemm_gpu/docs/common.py' 2025-05-07T20:11:12.4342624Z INFO:wheel:adding 'fbgemm_gpu/docs/examples.py' 2025-05-07T20:11:12.4345219Z INFO:wheel:adding 'fbgemm_gpu/docs/jagged_tensor_ops.py' 2025-05-07T20:11:12.4347126Z INFO:wheel:adding 'fbgemm_gpu/docs/merge_pooled_embedding_ops.py' 2025-05-07T20:11:12.4349470Z INFO:wheel:adding 'fbgemm_gpu/docs/permute_pooled_embedding_ops.py' 2025-05-07T20:11:12.4351196Z INFO:wheel:adding 'fbgemm_gpu/docs/quantize_ops.py' 2025-05-07T20:11:12.4357125Z INFO:wheel:adding 'fbgemm_gpu/docs/sparse_ops.py' 2025-05-07T20:11:12.4359152Z INFO:wheel:adding 'fbgemm_gpu/docs/version.py' 2025-05-07T20:11:12.4361068Z INFO:wheel:adding 'fbgemm_gpu/quantize/__init__.py' 2025-05-07T20:11:12.4362973Z INFO:wheel:adding 'fbgemm_gpu/quantize/quantize_ops.py' 2025-05-07T20:11:12.4365212Z INFO:wheel:adding 'fbgemm_gpu/sll/__init__.py' 2025-05-07T20:11:12.4367359Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/__init__.py' 2025-05-07T20:11:12.4373751Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/cpu_sll.py' 2025-05-07T20:11:12.4376623Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/__init__.py' 2025-05-07T20:11:12.4379477Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/meta_sll.py' 2025-05-07T20:11:12.4382349Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/__init__.py' 2025-05-07T20:11:12.4384037Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/common.py' 2025-05-07T20:11:12.4386053Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py' 2025-05-07T20:11:12.4388631Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py' 2025-05-07T20:11:12.4392370Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm.py' 2025-05-07T20:11:12.4396448Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py' 2025-05-07T20:11:12.4398632Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py' 2025-05-07T20:11:12.4401044Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py' 2025-05-07T20:11:12.4406677Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py' 2025-05-07T20:11:12.4412175Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py' 2025-05-07T20:11:12.4414480Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py' 2025-05-07T20:11:12.4418359Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_softmax.py' 2025-05-07T20:11:12.4423974Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py' 2025-05-07T20:11:12.4426926Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py' 2025-05-07T20:11:12.4430027Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py' 2025-05-07T20:11:12.4433802Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py' 2025-05-07T20:11:12.4436092Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py' 2025-05-07T20:11:12.4438256Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py' 2025-05-07T20:11:12.4441286Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py' 2025-05-07T20:11:12.4444629Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py' 2025-05-07T20:11:12.4447657Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py' 2025-05-07T20:11:12.4451027Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py' 2025-05-07T20:11:12.4454303Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py' 2025-05-07T20:11:12.4457471Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py' 2025-05-07T20:11:12.4460945Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py' 2025-05-07T20:11:12.4464553Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py' 2025-05-07T20:11:12.4467527Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py' 2025-05-07T20:11:12.4469696Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py' 2025-05-07T20:11:12.4472423Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py' 2025-05-07T20:11:12.4474204Z INFO:wheel:adding 'fbgemm_gpu/tbe/__init__.py' 2025-05-07T20:11:12.4476415Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/__init__.py' 2025-05-07T20:11:12.4478849Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_config.py' 2025-05-07T20:11:12.4483877Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_runs.py' 2025-05-07T20:11:12.4486564Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eeg_cli.py' 2025-05-07T20:11:12.4489032Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/embedding_ops_common_config.py' 2025-05-07T20:11:12.4491082Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eval_compression.py' 2025-05-07T20:11:12.4492731Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/reporter.py' 2025-05-07T20:11:12.4496066Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config.py' 2025-05-07T20:11:12.4498909Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_loader.py' 2025-05-07T20:11:12.4501775Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py' 2025-05-07T20:11:12.4503680Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/utils.py' 2025-05-07T20:11:12.4505543Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/__init__.py' 2025-05-07T20:11:12.4507316Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py' 2025-05-07T20:11:12.4509043Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/__init__.py' 2025-05-07T20:11:12.4510532Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/common.py' 2025-05-07T20:11:12.4516511Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/inference.py' 2025-05-07T20:11:12.4541541Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/training.py' 2025-05-07T20:11:12.4545477Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/__init__.py' 2025-05-07T20:11:12.4548458Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py' 2025-05-07T20:11:12.4550279Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/__init__.py' 2025-05-07T20:11:12.4553156Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/bench_params_reporter.py' 2025-05-07T20:11:12.4555103Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/__init__.py' 2025-05-07T20:11:12.4556797Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/common.py' 2025-05-07T20:11:12.4558647Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/offsets.py' 2025-05-07T20:11:12.4561261Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/quantize.py' 2025-05-07T20:11:12.4566903Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/requests.py' 2025-05-07T20:11:12.4569225Z INFO:wheel:adding 'fbgemm_gpu/triton/__init__.py' 2025-05-07T20:11:12.4571182Z INFO:wheel:adding 'fbgemm_gpu/triton/common.py' 2025-05-07T20:11:12.4579298Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize.py' 2025-05-07T20:11:12.4584283Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize_ref.py' 2025-05-07T20:11:12.4586338Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/__init__.py' 2025-05-07T20:11:12.4594367Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py' 2025-05-07T20:11:12.4596843Z INFO:wheel:adding 'fbgemm_gpu/utils/__init__.py' 2025-05-07T20:11:12.4599233Z INFO:wheel:adding 'fbgemm_gpu/utils/filestore.py' 2025-05-07T20:11:12.4600971Z INFO:wheel:adding 'fbgemm_gpu/utils/loader.py' 2025-05-07T20:11:12.4603349Z INFO:wheel:adding 'fbgemm_gpu/utils/torch_library.py' 2025-05-07T20:11:12.4606358Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/METADATA' 2025-05-07T20:11:12.4607403Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL' 2025-05-07T20:11:12.4608325Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/top_level.txt' 2025-05-07T20:11:12.4615090Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/RECORD' 2025-05-07T20:11:12.4619599Z INFO:root:removing _skbuild/linux-x86_64-3.9/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:11:12.6268896Z ╒════════════════════════════╤════════════════════════════════════════════════╕ 2025-05-07T20:11:12.6269479Z │ │ Version │ 2025-05-07T20:11:12.6270040Z ╞════════════════════════════╪════════════════════════════════════════════════╡ 2025-05-07T20:11:12.6270565Z │ PyTorch │ 2.8.0.dev20250507+cu126 │ 2025-05-07T20:11:12.6271119Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:11:12.6271664Z │ CUDA (Declared by PyTorch) │ 12.6 │ 2025-05-07T20:11:12.6272247Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:11:12.6272772Z │ CUDA (Actual) │ nvcc: NVIDIA (R) Cuda compiler driver │ 2025-05-07T20:11:12.6273295Z │ │ Copyright (c) 2005-2024 NVIDIA Corporation │ 2025-05-07T20:11:12.6273804Z │ │ Built on Tue_Oct_29_23:50:19_PDT_2024 │ 2025-05-07T20:11:12.6274270Z │ │ Cuda compilation tools, release 12.6, V12.6.85 │ 2025-05-07T20:11:12.6274759Z │ │ Build cuda_12.6.r12.6/compiler.35059454_0 │ 2025-05-07T20:11:12.6275472Z ╘════════════════════════════╧════════════════════════════════════════════════╛ 2025-05-07T20:11:12.9370609Z Successfully built fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:13.0246986Z 2025-05-07T20:11:13.0398738Z ################################################################################ 2025-05-07T20:11:13.0399803Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:11:13.0400250Z [CHECK] Listing out library size: 2025-05-07T20:11:13.0400833Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:11:13.0401148Z 2025-05-07T20:11:13.0413546Z 1 ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:11:13.0413882Z 2025-05-07T20:11:13.0414291Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:11:13.0415162Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.0415717Z 2025-05-07T20:11:13.0478714Z GLIBC_2.2.5 2025-05-07T20:11:13.0479419Z GLIBC_2.14 2025-05-07T20:11:13.0479560Z 2025-05-07T20:11:13.0479564Z 2025-05-07T20:11:13.0479935Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:11:13.0480860Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.0481415Z 2025-05-07T20:11:13.0542523Z GLIBCXX_3.4 2025-05-07T20:11:13.0542954Z 2025-05-07T20:11:13.0542968Z 2025-05-07T20:11:13.0562521Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so > /tmp/tmp.Dm9GSaaMLz.symbols.txt 2025-05-07T20:11:13.0563767Z 2025-05-07T20:11:13.0604821Z 2025-05-07T20:11:13.0639875Z [CHECK] Total Number of symbols: 841 2025-05-07T20:11:13.0656058Z [CHECK] Number of fbgemm symbols: 0 2025-05-07T20:11:13.0671359Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so > /tmp/tmp.dCx8Nnlkfl.usymbols.txt 2025-05-07T20:11:13.0671809Z 2025-05-07T20:11:13.0697436Z 2025-05-07T20:11:13.0725398Z [CHECK] Listing out undefined symbols (51 total): 2025-05-07T20:11:13.0742204Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.0742608Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.0742960Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:13.0743309Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:13.0743635Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:11:13.0743983Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.0744305Z U abort@GLIBC_2.2.5 2025-05-07T20:11:13.0744622Z U bcmp@GLIBC_2.2.5 2025-05-07T20:11:13.0744908Z U close@GLIBC_2.2.5 2025-05-07T20:11:13.0745210Z U fputs@GLIBC_2.2.5 2025-05-07T20:11:13.0745493Z U free@GLIBC_2.2.5 2025-05-07T20:11:13.0745899Z U ftruncate64@GLIBC_2.2.5 2025-05-07T20:11:13.0746197Z U fwrite@GLIBC_2.2.5 2025-05-07T20:11:13.0746495Z U getenv@GLIBC_2.2.5 2025-05-07T20:11:13.0746801Z U getpagesize@GLIBC_2.2.5 2025-05-07T20:11:13.0747104Z U madvise@GLIBC_2.2.5 2025-05-07T20:11:13.0747403Z U malloc@GLIBC_2.2.5 2025-05-07T20:11:13.0747685Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:13.0747978Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.0748258Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.0748556Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.0748834Z U mmap@GLIBC_2.2.5 2025-05-07T20:11:13.0749132Z U mprotect@GLIBC_2.2.5 2025-05-07T20:11:13.0749439Z U munmap@GLIBC_2.2.5 2025-05-07T20:11:13.0749721Z U open64@GLIBC_2.2.5 2025-05-07T20:11:13.0750057Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:13.0750427Z U pthread_mutex_destroy@GLIBC_2.2.5 2025-05-07T20:11:13.0750817Z U pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:13.0751168Z U pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:13.0751524Z U read@GLIBC_2.2.5 2025-05-07T20:11:13.0751826Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:13.0752156Z U shm_open@GLIBC_2.2.5 2025-05-07T20:11:13.0752496Z U shm_unlink@GLIBC_2.2.5 2025-05-07T20:11:13.0752815Z U snprintf@GLIBC_2.2.5 2025-05-07T20:11:13.0753194Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:13.0753526Z U stderr@GLIBC_2.2.5 2025-05-07T20:11:13.0753853Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:13.0754267Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.0754586Z U strtol@GLIBC_2.2.5 2025-05-07T20:11:13.0754881Z U syscall@GLIBC_2.2.5 2025-05-07T20:11:13.0755308Z U sysconf@GLIBC_2.2.5 2025-05-07T20:11:13.0755618Z U uname@GLIBC_2.2.5 2025-05-07T20:11:13.0755898Z U unlink@GLIBC_2.2.5 2025-05-07T20:11:13.0756207Z U vsnprintf@GLIBC_2.2.5 2025-05-07T20:11:13.0756559Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.0756998Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.0757421Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.0757823Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.0758165Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.0758542Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.0758877Z w __gmon_start__ 2025-05-07T20:11:13.0759206Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.0759631Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:11:13.0759918Z 2025-05-07T20:11:13.0784739Z linux-vdso.so.1 (0x00007fff1f6b2000) 2025-05-07T20:11:13.0785756Z libtorch.so => not found 2025-05-07T20:11:13.0786516Z libc10.so => not found 2025-05-07T20:11:13.0787223Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.0788306Z libc10_cuda.so => not found 2025-05-07T20:11:13.0788940Z libnccl.so.2 => not found 2025-05-07T20:11:13.0789250Z libcuda.so.1 => not found 2025-05-07T20:11:13.0789528Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.0789852Z libtorch_cpu.so => not found 2025-05-07T20:11:13.0790136Z libtorch_cuda.so => not found 2025-05-07T20:11:13.0790507Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f628384f000) 2025-05-07T20:11:13.0790949Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f62837f9000) 2025-05-07T20:11:13.0791374Z librt.so.1 => /lib64/librt.so.1 (0x00007f62837f2000) 2025-05-07T20:11:13.0791801Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f62837c4000) 2025-05-07T20:11:13.0792244Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f62837bf000) 2025-05-07T20:11:13.0792762Z libc.so.6 => /lib64/libc.so.6 (0x00007f62835b7000) 2025-05-07T20:11:13.0793131Z libm.so.6 => /lib64/libm.so.6 (0x00007f62834dc000) 2025-05-07T20:11:13.0793535Z /lib64/ld-linux-x86-64.so.2 (0x00007f6283b30000) 2025-05-07T20:11:13.0793790Z 2025-05-07T20:11:13.0793913Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.0794484Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so 2025-05-07T20:11:13.0794774Z 2025-05-07T20:11:13.0825520Z 2025-05-07T20:11:13.0825853Z Dynamic section at offset 0x75898 contains 39 entries: 2025-05-07T20:11:13.0826270Z Tag Type Name/Value 2025-05-07T20:11:13.0826746Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.0827283Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.0827831Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:13.0828386Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:13.0828913Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:13.0829426Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:13.0829941Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:13.0830479Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.0830997Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.0831527Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.0832048Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:11:13.0832573Z 0x0000000000000001 (NEEDED) Shared library: [librt.so.1] 2025-05-07T20:11:13.0833086Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.0833611Z 0x0000000000000001 (NEEDED) Shared library: [libpthread.so.0] 2025-05-07T20:11:13.0834126Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.0834643Z 0x000000000000000e (SONAME) Library soname: [asmjit.so] 2025-05-07T20:11:13.0835060Z 0x000000000000000c (INIT) 0x19000 2025-05-07T20:11:13.0835405Z 0x000000000000000d (FINI) 0x56a1c 2025-05-07T20:11:13.0835740Z 0x0000000000000019 (INIT_ARRAY) 0x74ac0 2025-05-07T20:11:13.0836103Z 0x000000000000001b (INIT_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.0836449Z 0x000000000000001a (FINI_ARRAY) 0x74ac8 2025-05-07T20:11:13.0836805Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.0837312Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:11:13.0837649Z 0x0000000000000005 (STRTAB) 0x6980 2025-05-07T20:11:13.0837997Z 0x0000000000000006 (SYMTAB) 0x1a90 2025-05-07T20:11:13.0838346Z 0x000000000000000a (STRSZ) 48828 (bytes) 2025-05-07T20:11:13.0838774Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.0839121Z 0x0000000000000003 (PLTGOT) 0x75fe8 2025-05-07T20:11:13.0839495Z 0x0000000000000002 (PLTRELSZ) 8472 (bytes) 2025-05-07T20:11:13.0839897Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.0840225Z 0x0000000000000017 (JMPREL) 0x162d8 2025-05-07T20:11:13.0840573Z 0x0000000000000007 (RELA) 0x12f90 2025-05-07T20:11:13.0840923Z 0x0000000000000008 (RELASZ) 13128 (bytes) 2025-05-07T20:11:13.0841297Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.0841644Z 0x000000006ffffffe (VERNEED) 0x12ed0 2025-05-07T20:11:13.0842006Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:13.0842326Z 0x000000006ffffff0 (VERSYM) 0x1283c 2025-05-07T20:11:13.0842671Z 0x000000006ffffff9 (RELACOUNT) 3 2025-05-07T20:11:13.0842988Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.0843232Z 2025-05-07T20:11:13.0843349Z ################################################################################ 2025-05-07T20:11:13.0843578Z 2025-05-07T20:11:13.0843596Z 2025-05-07T20:11:13.0843713Z ################################################################################ 2025-05-07T20:11:13.0844183Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:13.0844653Z [CHECK] Listing out library size: 2025-05-07T20:11:13.0845094Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:13.0845444Z 2025-05-07T20:11:13.0845626Z 1 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:13.0845904Z 2025-05-07T20:11:13.0846282Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:13.0847212Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.0847795Z 2025-05-07T20:11:13.0897563Z GLIBC_2.2.5 2025-05-07T20:11:13.0898777Z GLIBC_2.14 2025-05-07T20:11:13.0899612Z 2025-05-07T20:11:13.0899738Z 2025-05-07T20:11:13.0900431Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:13.0901454Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.0902050Z 2025-05-07T20:11:13.0950727Z GLIBCXX_3.4 2025-05-07T20:11:13.0951395Z GLIBCXX_3.4.9 2025-05-07T20:11:13.0951999Z GLIBCXX_3.4.21 2025-05-07T20:11:13.0956458Z 2025-05-07T20:11:13.0956473Z 2025-05-07T20:11:13.0978477Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.guo1KV8J8m.symbols.txt 2025-05-07T20:11:13.0978943Z 2025-05-07T20:11:13.0999307Z 2025-05-07T20:11:13.1026727Z [CHECK] Total Number of symbols: 116 2025-05-07T20:11:13.1042183Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:11:13.1062914Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.YDY4BmLpsV.usymbols.txt 2025-05-07T20:11:13.1064323Z 2025-05-07T20:11:13.1079744Z 2025-05-07T20:11:13.1105716Z [CHECK] Listing out undefined symbols (55 total): 2025-05-07T20:11:13.1124525Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.1126315Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.1127277Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:13.1128202Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.1129277Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:13.1129617Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:13.1129978Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:13.1130337Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:13.1130760Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:11:13.1131127Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.1131465Z U c10::BoolType::get() 2025-05-07T20:11:13.1131878Z U c10::StringType::get() 2025-05-07T20:11:13.1132223Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:13.1133003Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:13.1134306Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.1135062Z U getenv@GLIBC_2.2.5 2025-05-07T20:11:13.1135377Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:13.1135660Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.1137048Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.1137359Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.1137664Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:13.1138036Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.1138434Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:13.1139095Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:11:13.1139908Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:13.1141014Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.1141681Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.1142084Z U std::__throw_invalid_argument(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.1142545Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.1142979Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.1143392Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.1143922Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:13.1144883Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.1145739Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:13.1146134Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:13.1146503Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.1146894Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:13.1147248Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:13.1147610Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.1147920Z U strtol@GLIBC_2.2.5 2025-05-07T20:11:13.1148283Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:13.1149146Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:13.1150416Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:11:13.1151477Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:13.1152205Z U typeinfo for std::invalid_argument@GLIBCXX_3.4 2025-05-07T20:11:13.1152640Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.1153190Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:13.1153637Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.1154239Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.1154897Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:13.1155320Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.1155641Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.1155938Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.1156233Z w __gmon_start__ 2025-05-07T20:11:13.1156519Z w __pthread_key_create@GLIBC_2.2.5 2025-05-07T20:11:13.1156884Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.1157307Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:13.1157583Z 2025-05-07T20:11:13.1174058Z linux-vdso.so.1 (0x00007ffcd3df3000) 2025-05-07T20:11:13.1175095Z libtorch.so => not found 2025-05-07T20:11:13.1175815Z libc10.so => not found 2025-05-07T20:11:13.1176960Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.1177723Z libc10_cuda.so => not found 2025-05-07T20:11:13.1178462Z libnccl.so.2 => not found 2025-05-07T20:11:13.1179202Z libcuda.so.1 => not found 2025-05-07T20:11:13.1179932Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.1180945Z libtorch_cpu.so => not found 2025-05-07T20:11:13.1181359Z libtorch_cuda.so => not found 2025-05-07T20:11:13.1181719Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f4afe519000) 2025-05-07T20:11:13.1182151Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f4afe4c3000) 2025-05-07T20:11:13.1182575Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f4afe493000) 2025-05-07T20:11:13.1183028Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f4afe48e000) 2025-05-07T20:11:13.1183437Z libc.so.6 => /lib64/libc.so.6 (0x00007f4afe286000) 2025-05-07T20:11:13.1183805Z libm.so.6 => /lib64/libm.so.6 (0x00007f4afe1ab000) 2025-05-07T20:11:13.1184168Z /lib64/ld-linux-x86-64.so.2 (0x00007f4afe78e000) 2025-05-07T20:11:13.1184418Z 2025-05-07T20:11:13.1184536Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.1184944Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:11:13.1185275Z 2025-05-07T20:11:13.1211072Z 2025-05-07T20:11:13.1211848Z Dynamic section at offset 0x8c98 contains 38 entries: 2025-05-07T20:11:13.1213003Z Tag Type Name/Value 2025-05-07T20:11:13.1214250Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.1215735Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.1217215Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:13.1218433Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:13.1218939Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:13.1219429Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:13.1219947Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:13.1220730Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.1221297Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.1221818Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.1222474Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:11:13.1223001Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.1223512Z 0x0000000000000001 (NEEDED) Shared library: [libpthread.so.0] 2025-05-07T20:11:13.1224073Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.1224585Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_config.so] 2025-05-07T20:11:13.1225079Z 0x000000000000000c (INIT) 0x4000 2025-05-07T20:11:13.1225419Z 0x000000000000000d (FINI) 0x6f80 2025-05-07T20:11:13.1225743Z 0x0000000000000019 (INIT_ARRAY) 0x9bb0 2025-05-07T20:11:13.1226097Z 0x000000000000001b (INIT_ARRAYSZ) 16 (bytes) 2025-05-07T20:11:13.1226445Z 0x000000000000001a (FINI_ARRAY) 0x9bc0 2025-05-07T20:11:13.1226800Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.1227146Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:11:13.1227484Z 0x0000000000000005 (STRTAB) 0xed0 2025-05-07T20:11:13.1227801Z 0x0000000000000006 (SYMTAB) 0x3d8 2025-05-07T20:11:13.1228152Z 0x000000000000000a (STRSZ) 7794 (bytes) 2025-05-07T20:11:13.1228560Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.1228925Z 0x0000000000000003 (PLTGOT) 0x9fe8 2025-05-07T20:11:13.1229290Z 0x0000000000000002 (PLTRELSZ) 1632 (bytes) 2025-05-07T20:11:13.1229642Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.1229979Z 0x0000000000000017 (JMPREL) 0x33a0 2025-05-07T20:11:13.1230302Z 0x0000000000000007 (RELA) 0x2ef0 2025-05-07T20:11:13.1230658Z 0x0000000000000008 (RELASZ) 1200 (bytes) 2025-05-07T20:11:13.1231026Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.1231369Z 0x000000006ffffffe (VERNEED) 0x2e30 2025-05-07T20:11:13.1231714Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:11:13.1232037Z 0x000000006ffffff0 (VERSYM) 0x2d42 2025-05-07T20:11:13.1232373Z 0x000000006ffffff9 (RELACOUNT) 4 2025-05-07T20:11:13.1232679Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.1233007Z 2025-05-07T20:11:13.1233125Z ################################################################################ 2025-05-07T20:11:13.1233348Z 2025-05-07T20:11:13.1233352Z 2025-05-07T20:11:13.1233481Z ################################################################################ 2025-05-07T20:11:13.1233900Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 2025-05-07T20:11:13.1234322Z [CHECK] Listing out library size: 2025-05-07T20:11:13.1234703Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 2025-05-07T20:11:13.1235013Z 2025-05-07T20:11:13.1235141Z 6 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 2025-05-07T20:11:13.1235372Z 2025-05-07T20:11:13.1235699Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 2025-05-07T20:11:13.1236524Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.1237037Z 2025-05-07T20:11:13.1501530Z GLIBC_2.2.5 2025-05-07T20:11:13.1501775Z GLIBC_2.3 2025-05-07T20:11:13.1501982Z GLIBC_2.14 2025-05-07T20:11:13.1502098Z 2025-05-07T20:11:13.1502102Z 2025-05-07T20:11:13.1502474Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 2025-05-07T20:11:13.1503377Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.1503941Z 2025-05-07T20:11:13.1767829Z GLIBCXX_3.4 2025-05-07T20:11:13.1768604Z GLIBCXX_3.4.9 2025-05-07T20:11:13.1768851Z GLIBCXX_3.4.11 2025-05-07T20:11:13.1769072Z GLIBCXX_3.4.14 2025-05-07T20:11:13.1769272Z GLIBCXX_3.4.15 2025-05-07T20:11:13.1769633Z GLIBCXX_3.4.18 2025-05-07T20:11:13.1769844Z GLIBCXX_3.4.21 2025-05-07T20:11:13.1769972Z 2025-05-07T20:11:13.1769976Z 2025-05-07T20:11:13.1785123Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so > /tmp/tmp.nBOK7x6jDg.symbols.txt 2025-05-07T20:11:13.1786509Z 2025-05-07T20:11:13.2012090Z 2025-05-07T20:11:13.2036945Z [CHECK] Total Number of symbols: 4951 2025-05-07T20:11:13.2055461Z [CHECK] Number of fbgemm symbols: 3554 2025-05-07T20:11:13.2071724Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so > /tmp/tmp.bvWw7MypVF.usymbols.txt 2025-05-07T20:11:13.2072168Z 2025-05-07T20:11:13.2100378Z 2025-05-07T20:11:13.2124808Z [CHECK] Listing out undefined symbols (133 total): 2025-05-07T20:11:13.2138786Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.2139260Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:13.2139748Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:13.2140240Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.2140636Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:13.2140960Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:13.2141305Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:13.2141813Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:13.2142158Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:13.2142556Z U __cxa_init_primary_exception@CXXABI_1.3.11 2025-05-07T20:11:13.2142919Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:13.2143254Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:13.2143577Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:13.2143892Z U __extendhfsf2 2025-05-07T20:11:13.2144184Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.2144530Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:11:13.2144900Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:13.2145198Z U __truncsfhf2 2025-05-07T20:11:13.2145484Z U abort@GLIBC_2.2.5 2025-05-07T20:11:13.2145964Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:13.2146875Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:13.2147905Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:13.2149019Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:11:13.2150073Z U asmjit::_abi_1_13::BaseEmitter::emitArgsAssignment(asmjit::_abi_1_13::FuncFrame const&, asmjit::_abi_1_13::FuncArgsAssignment const&) 2025-05-07T20:11:13.2150818Z U asmjit::_abi_1_13::BaseEmitter::emitEpilog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:11:13.2151365Z U asmjit::_abi_1_13::BaseEmitter::emitProlog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:11:13.2151953Z U asmjit::_abi_1_13::CodeHolder::CodeHolder(asmjit::_abi_1_13::Support::Temporary const*) 2025-05-07T20:11:13.2152563Z U asmjit::_abi_1_13::CodeHolder::init(asmjit::_abi_1_13::Environment const&, unsigned long) 2025-05-07T20:11:13.2153040Z U asmjit::_abi_1_13::CodeHolder::~CodeHolder() 2025-05-07T20:11:13.2153559Z U asmjit::_abi_1_13::FuncArgsAssignment::updateFuncFrame(asmjit::_abi_1_13::FuncFrame&) const 2025-05-07T20:11:13.2154274Z U asmjit::_abi_1_13::FuncDetail::init(asmjit::_abi_1_13::FuncSignature const&, asmjit::_abi_1_13::Environment const&) 2025-05-07T20:11:13.2154916Z U asmjit::_abi_1_13::FuncFrame::finalize() 2025-05-07T20:11:13.2155342Z U asmjit::_abi_1_13::FuncFrame::init(asmjit::_abi_1_13::FuncDetail const&) 2025-05-07T20:11:13.2155919Z U asmjit::_abi_1_13::JitRuntime::JitRuntime(asmjit::_abi_1_13::JitAllocator::CreateParams const*) 2025-05-07T20:11:13.2156487Z U asmjit::_abi_1_13::JitRuntime::~JitRuntime() 2025-05-07T20:11:13.2156931Z U asmjit::_abi_1_13::x86::Assembler::Assembler(asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:11:13.2157407Z U asmjit::_abi_1_13::x86::Assembler::~Assembler() 2025-05-07T20:11:13.2157747Z U bcmp@GLIBC_2.2.5 2025-05-07T20:11:13.2158012Z U ceilf@GLIBC_2.2.5 2025-05-07T20:11:13.2158300Z U cpuinfo_get_packages 2025-05-07T20:11:13.2158592Z U cpuinfo_get_packages_count 2025-05-07T20:11:13.2158901Z U cpuinfo_initialize 2025-05-07T20:11:13.2159183Z U cpuinfo_isa 2025-05-07T20:11:13.2159437Z U floor@GLIBC_2.2.5 2025-05-07T20:11:13.2159718Z U fma@GLIBC_2.2.5 2025-05-07T20:11:13.2159974Z U fmaf@GLIBC_2.2.5 2025-05-07T20:11:13.2160243Z U free@GLIBC_2.2.5 2025-05-07T20:11:13.2160536Z U fwrite@GLIBC_2.2.5 2025-05-07T20:11:13.2160818Z U getenv@GLIBC_2.2.5 2025-05-07T20:11:13.2161083Z U ldexp@GLIBC_2.2.5 2025-05-07T20:11:13.2161357Z U log2@GLIBC_2.2.5 2025-05-07T20:11:13.2161622Z U log2f@GLIBC_2.2.5 2025-05-07T20:11:13.2161901Z U lrintf@GLIBC_2.2.5 2025-05-07T20:11:13.2162185Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.2162452Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.2162739Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.2163010Z U nearbyint@GLIBC_2.2.5 2025-05-07T20:11:13.2163307Z U nearbyintf@GLIBC_2.2.5 2025-05-07T20:11:13.2163611Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:13.2163951Z U operator delete[](void*)@GLIBCXX_3.4 2025-05-07T20:11:13.2164283Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.2164636Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.2164982Z U posix_memalign@GLIBC_2.2.5 2025-05-07T20:11:13.2165270Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:11:13.2165666Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:13.2166134Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:13.2166577Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:13.2167213Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:11:13.2167910Z U std::__atomic_futex_unsigned_base::_M_futex_notify_all(unsigned int*)@GLIBCXX_3.4.21 2025-05-07T20:11:13.2168894Z U std::__atomic_futex_unsigned_base::_M_futex_wait_until(unsigned int*, unsigned int, bool, std::chrono::duration >, std::chrono::duration >)@GLIBCXX_3.4.21 2025-05-07T20:11:13.2170076Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:13.2170765Z U std::__detail::_Prime_rehash_policy::_M_next_bkt(unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:13.2171252Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:13.2171636Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:13.2172077Z U std::__exception_ptr::exception_ptr::exception_ptr(void*)@CXXABI_1.3.11 2025-05-07T20:11:13.2172608Z U std::__future_base::_Result_base::_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:11:13.2173051Z U std::__future_base::_Result_base::~_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:11:13.2173445Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:11:13.2173806Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:11:13.2174124Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.2174454Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:13.2174789Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:11:13.2175142Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:13.2175499Z U std::__throw_future_error(int)@GLIBCXX_3.4.14 2025-05-07T20:11:13.2175883Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.2176728Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:13.2177106Z U std::bad_alloc::~bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.2177944Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.2178805Z U std::cerr@GLIBCXX_3.4 2025-05-07T20:11:13.2179111Z U std::cout@GLIBCXX_3.4 2025-05-07T20:11:13.2179491Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:11:13.2179893Z U std::future_category()@GLIBCXX_3.4.15 2025-05-07T20:11:13.2180405Z U std::future_error::~future_error()@GLIBCXX_3.4.14 2025-05-07T20:11:13.2180785Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:13.2181231Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:13.2181912Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:13.2182643Z U std::logic_error::logic_error(std::logic_error const&)@GLIBCXX_3.4.21 2025-05-07T20:11:13.2183183Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:11:13.2183687Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.2184244Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.2184736Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:11:13.2185089Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:13.2185462Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:11:13.2185912Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:13.2186557Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:13.2187003Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:13.2187358Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:13.2187683Z U stderr@GLIBC_2.2.5 2025-05-07T20:11:13.2187970Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:13.2188270Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.2188550Z U strstr@GLIBC_2.2.5 2025-05-07T20:11:13.2188852Z U tolower@GLIBC_2.2.5 2025-05-07T20:11:13.2189145Z U toupper@GLIBC_2.2.5 2025-05-07T20:11:13.2189532Z U typeinfo for std::__future_base::_Result_base@GLIBCXX_3.4.15 2025-05-07T20:11:13.2189966Z U typeinfo for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:11:13.2190428Z U typeinfo for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:11:13.2190801Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:13.2191168Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.2191634Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.2192025Z U vtable for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:11:13.2192366Z U vtable for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:11:13.2192752Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.2193057Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.2193365Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.2193684Z w __gmon_start__ 2025-05-07T20:11:13.2193961Z w __pthread_key_create 2025-05-07T20:11:13.2194264Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:13.2194573Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:13.2194879Z w pthread_once 2025-05-07T20:11:13.2195132Z w pthread_rwlock_rdlock 2025-05-07T20:11:13.2195430Z w pthread_rwlock_unlock 2025-05-07T20:11:13.2195712Z w pthread_rwlock_wrlock 2025-05-07T20:11:13.2196011Z w pthread_self@GLIBC_2.2.5 2025-05-07T20:11:13.2196342Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.2196729Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 2025-05-07T20:11:13.2196998Z 2025-05-07T20:11:13.2197154Z linux-vdso.so.1 (0x00007ffce0be4000) 2025-05-07T20:11:13.2197429Z libc10.so => not found 2025-05-07T20:11:13.2197681Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.2197932Z libc10_cuda.so => not found 2025-05-07T20:11:13.2198202Z libnccl.so.2 => not found 2025-05-07T20:11:13.2198442Z libcuda.so.1 => not found 2025-05-07T20:11:13.2198957Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007f0f861df000) 2025-05-07T20:11:13.2199507Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.2199766Z libtorch.so => not found 2025-05-07T20:11:13.2200021Z libtorch_cpu.so => not found 2025-05-07T20:11:13.2200280Z libtorch_cuda.so => not found 2025-05-07T20:11:13.2200609Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f0f8599c000) 2025-05-07T20:11:13.2200980Z libm.so.6 => /lib64/libm.so.6 (0x00007f0f858c1000) 2025-05-07T20:11:13.2201351Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f0f861af000) 2025-05-07T20:11:13.2201709Z libc.so.6 => /lib64/libc.so.6 (0x00007f0f856b9000) 2025-05-07T20:11:13.2202062Z /lib64/ld-linux-x86-64.so.2 (0x00007f0f8625c000) 2025-05-07T20:11:13.2202382Z libtorch.so => not found 2025-05-07T20:11:13.2202615Z libc10.so => not found 2025-05-07T20:11:13.2202862Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.2203107Z libc10_cuda.so => not found 2025-05-07T20:11:13.2203364Z libnccl.so.2 => not found 2025-05-07T20:11:13.2203602Z libcuda.so.1 => not found 2025-05-07T20:11:13.2203855Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.2204112Z libtorch_cpu.so => not found 2025-05-07T20:11:13.2204381Z libtorch_cuda.so => not found 2025-05-07T20:11:13.2204697Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f0f85663000) 2025-05-07T20:11:13.2205061Z librt.so.1 => /lib64/librt.so.1 (0x00007f0f861a6000) 2025-05-07T20:11:13.2205459Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f0f861a1000) 2025-05-07T20:11:13.2205724Z 2025-05-07T20:11:13.2205832Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.2206196Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so 2025-05-07T20:11:13.2206457Z 2025-05-07T20:11:13.2221928Z 2025-05-07T20:11:13.2222611Z Dynamic section at offset 0x54d6c8 contains 40 entries: 2025-05-07T20:11:13.2223098Z Tag Type Name/Value 2025-05-07T20:11:13.2223569Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.2224140Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:13.2224676Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:13.2225232Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:13.2225889Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:13.2226432Z 0x0000000000000001 (NEEDED) Shared library: [asmjit.so] 2025-05-07T20:11:13.2226967Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:13.2227595Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.2228143Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.2228728Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.2229286Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.2229803Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:13.2230333Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.2230842Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.2231399Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:13.2231949Z 0x000000000000000e (SONAME) Library soname: [fbgemm.so] 2025-05-07T20:11:13.2232445Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:13.2232959Z 0x000000000000000c (INIT) 0xff000 2025-05-07T20:11:13.2233307Z 0x000000000000000d (FINI) 0x4c1c58 2025-05-07T20:11:13.2233684Z 0x0000000000000019 (INIT_ARRAY) 0x54a1c0 2025-05-07T20:11:13.2234083Z 0x000000000000001b (INIT_ARRAYSZ) 1224 (bytes) 2025-05-07T20:11:13.2234452Z 0x000000000000001a (FINI_ARRAY) 0x54a688 2025-05-07T20:11:13.2234836Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.2235193Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:11:13.2235563Z 0x0000000000000005 (STRTAB) 0x26de0 2025-05-07T20:11:13.2235912Z 0x0000000000000006 (SYMTAB) 0x9da0 2025-05-07T20:11:13.2236311Z 0x000000000000000a (STRSZ) 754246 (bytes) 2025-05-07T20:11:13.2236695Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.2237082Z 0x0000000000000003 (PLTGOT) 0x551fe8 2025-05-07T20:11:13.2237492Z 0x0000000000000002 (PLTRELSZ) 25992 (bytes) 2025-05-07T20:11:13.2237865Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.2238221Z 0x0000000000000017 (JMPREL) 0xf8458 2025-05-07T20:11:13.2238572Z 0x0000000000000007 (RELA) 0xe1838 2025-05-07T20:11:13.2238962Z 0x0000000000000008 (RELASZ) 93216 (bytes) 2025-05-07T20:11:13.2239339Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.2239721Z 0x000000006ffffffe (VERNEED) 0xe16d8 2025-05-07T20:11:13.2240066Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:13.2240435Z 0x000000006ffffff0 (VERSYM) 0xdf026 2025-05-07T20:11:13.2240804Z 0x000000006ffffff9 (RELACOUNT) 155 2025-05-07T20:11:13.2241130Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.2241343Z 2025-05-07T20:11:13.2241492Z ################################################################################ 2025-05-07T20:11:13.2241731Z 2025-05-07T20:11:13.2241735Z 2025-05-07T20:11:13.2241866Z ################################################################################ 2025-05-07T20:11:13.2242397Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:13.2242916Z [CHECK] Listing out library size: 2025-05-07T20:11:13.2243377Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:13.2243749Z 2025-05-07T20:11:13.2243982Z 3 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:13.2244286Z 2025-05-07T20:11:13.2244673Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:13.2245720Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.2246308Z 2025-05-07T20:11:13.2299297Z GLIBC_2.2.5 2025-05-07T20:11:13.2299630Z GLIBC_2.14 2025-05-07T20:11:13.2299807Z 2025-05-07T20:11:13.2299943Z 2025-05-07T20:11:13.2300473Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:13.2301600Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.2302213Z 2025-05-07T20:11:13.2363790Z GLIBCXX_3.4 2025-05-07T20:11:13.2364147Z GLIBCXX_3.4.9 2025-05-07T20:11:13.2364464Z GLIBCXX_3.4.14 2025-05-07T20:11:13.2364720Z GLIBCXX_3.4.20 2025-05-07T20:11:13.2364966Z GLIBCXX_3.4.21 2025-05-07T20:11:13.2365098Z 2025-05-07T20:11:13.2365103Z 2025-05-07T20:11:13.2384595Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.uxBYVoeUKR.symbols.txt 2025-05-07T20:11:13.2386003Z 2025-05-07T20:11:13.2421651Z 2025-05-07T20:11:13.2450863Z [CHECK] Total Number of symbols: 550 2025-05-07T20:11:13.2463102Z [CHECK] Number of fbgemm symbols: 48 2025-05-07T20:11:13.2479453Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.vlBsBR8AdS.usymbols.txt 2025-05-07T20:11:13.2479977Z 2025-05-07T20:11:13.2502528Z 2025-05-07T20:11:13.2535868Z [CHECK] Listing out undefined symbols (179 total): 2025-05-07T20:11:13.2551803Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.2552458Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.2552813Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.2553224Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.2553607Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.2554007Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:13.2554380Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:13.2554751Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:13.2555132Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.2555516Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:13.2555857Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:13.2556176Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.2556511Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:13.2556829Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:13.2557165Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:13.2557489Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:13.2557824Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:13.2558151Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:13.2558456Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:13.2558791Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.2559297Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:11:13.2559899Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:13.2560371Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:13.2561315Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.2562248Z U at::_ops::is_nonzero::call(at::Tensor const&) 2025-05-07T20:11:13.2562699Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:13.2563353Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:13.2564028Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:11:13.2565195Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:13.2566110Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:13.2567022Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.2567910Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:13.2568240Z U at::get_num_threads() 2025-05-07T20:11:13.2568537Z U at::get_thread_num() 2025-05-07T20:11:13.2568829Z U at::internal::set_thread_num(int) 2025-05-07T20:11:13.2569173Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:11:13.2569488Z U c10::BoolType::get() 2025-05-07T20:11:13.2569890Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:13.2570488Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:13.2571047Z U c10::Error::what() const 2025-05-07T20:11:13.2571392Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.2571801Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.2572218Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:13.2572542Z U c10::IntType::get() 2025-05-07T20:11:13.2572893Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:13.2573277Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:13.2573691Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:13.2574315Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:13.2574664Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:13.2575039Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:13.2575430Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:13.2576440Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:13.2577106Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:13.2577490Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:13.2577874Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:13.2578227Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:11:13.2578594Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:13.2578932Z U c10::SymIntType::get() 2025-05-07T20:11:13.2579286Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:13.2579660Z U c10::TensorType::get() 2025-05-07T20:11:13.2579987Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:13.2581047Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:13.2582030Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:13.2582440Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:11:13.2583065Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:11:13.2583800Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:11:13.2584420Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:13.2584783Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:13.2585125Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:13.2585519Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:13.2585862Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:13.2586343Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:13.2586825Z U c10::cuda::device_count() 2025-05-07T20:11:13.2587171Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:13.2587573Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:13.2587967Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:13.2588377Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:13.2588839Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:13.2589222Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:13.2589972Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:13.2590849Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:13.2591727Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.2592687Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:13.2593813Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.2594617Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:13.2594957Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:13.2595288Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:13.2595642Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:13.2595982Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:11:13.2596356Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:13.2596738Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:13.2597132Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:13.2597510Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:11:13.2597851Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:11:13.2598193Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:13.2598594Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:13.2599033Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:13.2599499Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:13.2599832Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:13.2600340Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:13.2600673Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:13.2601009Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:13.2601344Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:13.2601737Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:13.2602071Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:13.2602413Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:13.2602754Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:13.2603157Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:13.2603519Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:13.2604533Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.2606363Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.2608017Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.2609659Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.2611462Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.2613245Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.2615040Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.2616829Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.2618628Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.2620517Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.2622334Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.2624148Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.2625332Z U fbgemm::fbgemmAlignedAlloc(unsigned long, unsigned long, bool) 2025-05-07T20:11:13.2625785Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:11:13.2626267Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:13.2626824Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.2627232Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.2627643Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.2628060Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.2628490Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:13.2628921Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.2629335Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.2629703Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.2630004Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.2630293Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.2630599Z U omp_get_max_threads@OMP_1.0 2025-05-07T20:11:13.2630913Z U omp_get_thread_num@OMP_1.0 2025-05-07T20:11:13.2631255Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:13.2631600Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.2632187Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:13.2633124Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.2633696Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.2634047Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:13.2634408Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.2634784Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.2635188Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:13.2635672Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:13.2636546Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.2637310Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:13.2637639Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:13.2637978Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:13.2638301Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.2638627Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:13.2639004Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.2639491Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.2639941Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:13.2640446Z U std::pair fbgemm::radix_sort_parallel(int*, int*, int*, int*, long, long, bool) 2025-05-07T20:11:13.2641296Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:11:13.2642341Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:11:13.2643008Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:13.2643287Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.2643578Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:13.2644357Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:13.2645432Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.2646409Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.2647146Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:13.2647715Z U typeinfo for c10::Error 2025-05-07T20:11:13.2648109Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:13.2648517Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.2648955Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.2649359Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:13.2649796Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.2650154Z U vtable for c10::Error 2025-05-07T20:11:13.2650668Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.2651351Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:13.2651783Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.2652096Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.2652392Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.2652678Z w __gmon_start__ 2025-05-07T20:11:13.2652942Z w __pthread_key_create 2025-05-07T20:11:13.2653265Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.2653699Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:13.2654000Z 2025-05-07T20:11:13.2654148Z linux-vdso.so.1 (0x00007ffca49ea000) 2025-05-07T20:11:13.2654415Z libc10.so => not found 2025-05-07T20:11:13.2654660Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.2654904Z libc10_cuda.so => not found 2025-05-07T20:11:13.2655157Z libnccl.so.2 => not found 2025-05-07T20:11:13.2655390Z libcuda.so.1 => not found 2025-05-07T20:11:13.2655897Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007f97fb600000) 2025-05-07T20:11:13.2656770Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so (0x00007f97fc078000) 2025-05-07T20:11:13.2657396Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.2657660Z libtorch.so => not found 2025-05-07T20:11:13.2657893Z libtorch_cpu.so => not found 2025-05-07T20:11:13.2658148Z libtorch_cuda.so => not found 2025-05-07T20:11:13.2658394Z libcudart.so.12 => not found 2025-05-07T20:11:13.2658711Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f97fb39c000) 2025-05-07T20:11:13.2659102Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f97fc020000) 2025-05-07T20:11:13.2659495Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f97fbff2000) 2025-05-07T20:11:13.2659857Z libc.so.6 => /lib64/libc.so.6 (0x00007f97fb194000) 2025-05-07T20:11:13.2660245Z libc10.so => not found 2025-05-07T20:11:13.2660656Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.2660909Z libc10_cuda.so => not found 2025-05-07T20:11:13.2661220Z libnccl.so.2 => not found 2025-05-07T20:11:13.2661465Z libcuda.so.1 => not found 2025-05-07T20:11:13.2661991Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007f97fbf79000) 2025-05-07T20:11:13.2662551Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.2662825Z libtorch.so => not found 2025-05-07T20:11:13.2663070Z libtorch_cpu.so => not found 2025-05-07T20:11:13.2663333Z libtorch_cuda.so => not found 2025-05-07T20:11:13.2663667Z libm.so.6 => /lib64/libm.so.6 (0x00007f97fb0b9000) 2025-05-07T20:11:13.2664022Z /lib64/ld-linux-x86-64.so.2 (0x00007f97fc089000) 2025-05-07T20:11:13.2664350Z libtorch.so => not found 2025-05-07T20:11:13.2664588Z libc10.so => not found 2025-05-07T20:11:13.2664864Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.2665113Z libc10_cuda.so => not found 2025-05-07T20:11:13.2665368Z libnccl.so.2 => not found 2025-05-07T20:11:13.2665608Z libcuda.so.1 => not found 2025-05-07T20:11:13.2665862Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.2666147Z libtorch_cpu.so => not found 2025-05-07T20:11:13.2666414Z libtorch_cuda.so => not found 2025-05-07T20:11:13.2666758Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f97fbf6e000) 2025-05-07T20:11:13.2667125Z libtorch.so => not found 2025-05-07T20:11:13.2667369Z libc10.so => not found 2025-05-07T20:11:13.2667602Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.2667857Z libc10_cuda.so => not found 2025-05-07T20:11:13.2668105Z libnccl.so.2 => not found 2025-05-07T20:11:13.2668357Z libcuda.so.1 => not found 2025-05-07T20:11:13.2668598Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.2668864Z libtorch_cpu.so => not found 2025-05-07T20:11:13.2669124Z libtorch_cuda.so => not found 2025-05-07T20:11:13.2669426Z librt.so.1 => /lib64/librt.so.1 (0x00007f97fbf65000) 2025-05-07T20:11:13.2669694Z 2025-05-07T20:11:13.2669813Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.2670222Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:11:13.2670554Z 2025-05-07T20:11:13.2670559Z 2025-05-07T20:11:13.2670722Z Dynamic section at offset 0x2b5a90 contains 41 entries: 2025-05-07T20:11:13.2671087Z Tag Type Name/Value 2025-05-07T20:11:13.2671501Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.2671995Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:13.2672508Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:13.2673115Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:13.2673590Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:13.2674074Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:13.2674559Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:11:13.2675070Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:13.2675570Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.2676193Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.2676891Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.2677406Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:13.2677933Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.2678444Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:11:13.2678965Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.2679477Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.2679999Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:13.2680531Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:13.2680935Z 0x000000000000000c (INIT) 0x16000 2025-05-07T20:11:13.2681282Z 0x000000000000000d (FINI) 0x6243c 2025-05-07T20:11:13.2681617Z 0x0000000000000019 (INIT_ARRAY) 0x2b4a40 2025-05-07T20:11:13.2681981Z 0x000000000000001b (INIT_ARRAYSZ) 72 (bytes) 2025-05-07T20:11:13.2682331Z 0x000000000000001a (FINI_ARRAY) 0x2b4a88 2025-05-07T20:11:13.2682693Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.2683137Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:11:13.2683468Z 0x0000000000000005 (STRTAB) 0x40a0 2025-05-07T20:11:13.2683812Z 0x0000000000000006 (SYMTAB) 0xcf8 2025-05-07T20:11:13.2684156Z 0x000000000000000a (STRSZ) 48233 (bytes) 2025-05-07T20:11:13.2685929Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.2686281Z 0x0000000000000003 (PLTGOT) 0x2b5fe8 2025-05-07T20:11:13.2686657Z 0x0000000000000002 (PLTRELSZ) 9240 (bytes) 2025-05-07T20:11:13.2687262Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.2687591Z 0x0000000000000017 (JMPREL) 0x13a68 2025-05-07T20:11:13.2687937Z 0x0000000000000007 (RELA) 0x10258 2025-05-07T20:11:13.2688285Z 0x0000000000000008 (RELASZ) 14352 (bytes) 2025-05-07T20:11:13.2688656Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.2689109Z 0x000000006ffffffe (VERNEED) 0x10158 2025-05-07T20:11:13.2689451Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:13.2689764Z 0x000000006ffffff0 (VERSYM) 0xfd0a 2025-05-07T20:11:13.2690097Z 0x000000006ffffff9 (RELACOUNT) 337 2025-05-07T20:11:13.2690407Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.2690643Z 2025-05-07T20:11:13.2690757Z ################################################################################ 2025-05-07T20:11:13.2690982Z 2025-05-07T20:11:13.2690999Z 2025-05-07T20:11:13.2691116Z ################################################################################ 2025-05-07T20:11:13.2691582Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:13.2692052Z [CHECK] Listing out library size: 2025-05-07T20:11:13.2692492Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:13.2692939Z 2025-05-07T20:11:13.2693130Z 21 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:13.2693413Z 2025-05-07T20:11:13.2693758Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:13.2694638Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.2695184Z 2025-05-07T20:11:13.2740461Z GLIBC_2.2.5 2025-05-07T20:11:13.2740984Z GLIBC_2.14 2025-05-07T20:11:13.2741182Z 2025-05-07T20:11:13.2741348Z 2025-05-07T20:11:13.2741774Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:13.2742790Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.2743385Z 2025-05-07T20:11:13.2821243Z GLIBCXX_3.4 2025-05-07T20:11:13.2821888Z GLIBCXX_3.4.9 2025-05-07T20:11:13.2822489Z GLIBCXX_3.4.11 2025-05-07T20:11:13.2823090Z GLIBCXX_3.4.20 2025-05-07T20:11:13.2823669Z GLIBCXX_3.4.21 2025-05-07T20:11:13.2824022Z 2025-05-07T20:11:13.2824035Z 2025-05-07T20:11:13.2847569Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.B7ywzsim1f.symbols.txt 2025-05-07T20:11:13.2848071Z 2025-05-07T20:11:13.2897078Z 2025-05-07T20:11:13.2926214Z [CHECK] Total Number of symbols: 783 2025-05-07T20:11:13.2941012Z [CHECK] Number of fbgemm symbols: 73 2025-05-07T20:11:13.2957274Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.rgcmPI1xGk.usymbols.txt 2025-05-07T20:11:13.2957770Z 2025-05-07T20:11:13.2977887Z 2025-05-07T20:11:13.3001737Z [CHECK] Listing out undefined symbols (147 total): 2025-05-07T20:11:13.3021322Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.3022007Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.3022579Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.3023011Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.3023449Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.3023846Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:13.3024353Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:13.3024727Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:13.3025191Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.3025580Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:13.3025911Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.3026274Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:13.3026604Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:13.3026963Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:13.3027302Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:13.3027652Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:13.3027977Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.3028364Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:13.3028866Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:13.3029634Z U at::_ops::arange::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.3030821Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.3032188Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.3033138Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:13.3034079Z U at::_ops::full_like::call(at::Tensor const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.3035068Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:11:13.3035853Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:11:13.3036758Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.3037870Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.3038667Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:13.3039112Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:11:13.3039470Z U c10::BoolType::get() 2025-05-07T20:11:13.3039819Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:13.3040221Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:13.3040634Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.3041050Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:13.3041417Z U c10::IntType::get() 2025-05-07T20:11:13.3041819Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:13.3042319Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:13.3042778Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:13.3043408Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:13.3044091Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:13.3044449Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:13.3044842Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:13.3045249Z U c10::TensorType::get() 2025-05-07T20:11:13.3045569Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:13.3046491Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:13.3047391Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:13.3047767Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:13.3048134Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:13.3048451Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:13.3048812Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:13.3049123Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:13.3049572Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:13.3050010Z U c10::cuda::current_device() 2025-05-07T20:11:13.3050311Z U c10::cuda::device_count() 2025-05-07T20:11:13.3050646Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:13.3051001Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:13.3051374Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:13.3051735Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:13.3052123Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:13.3052494Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:13.3053183Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:13.3054013Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:13.3054806Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.3055695Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:13.3056663Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.3057414Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:13.3057739Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:13.3058090Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:13.3058489Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:13.3058871Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:13.3059211Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:13.3059581Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:13.3059920Z U c10::throwNullDataPtrError() 2025-05-07T20:11:13.3060292Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:13.3060802Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:13.3061209Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:13.3061641Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:13.3062017Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:13.3062386Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:13.3062754Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:13.3063158Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:13.3063517Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:13.3063850Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:13.3064195Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:13.3064544Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:13.3064915Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:13.3065274Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:13.3065655Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:13.3066038Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:13.3066391Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:13.3066798Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:13.3067143Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:11:13.3067518Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:11:13.3068039Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:13.3068596Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:13.3068970Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:13.3069315Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:13.3069704Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:13.3070077Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:13.3070508Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.3070928Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.3071351Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.3071740Z U log2f@GLIBC_2.2.5 2025-05-07T20:11:13.3072122Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:13.3072580Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.3072984Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.3073374Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.3073676Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.3073996Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.3074334Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:13.3074694Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.3075302Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:13.3076345Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.3077074Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.3077481Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.3077889Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.3078351Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:13.3078786Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:13.3079299Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:13.3080327Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.3081153Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:13.3081608Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:13.3081971Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.3082379Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:13.3082820Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.3083367Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.3083866Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:13.3084193Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.3084542Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:13.3085401Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:13.3086611Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.3087475Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.3088246Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:13.3088945Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.3089669Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.3090083Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:13.3090519Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.3091111Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.3091753Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:13.3092204Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.3092506Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.3092801Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.3093076Z w __gmon_start__ 2025-05-07T20:11:13.3093341Z w __pthread_key_create 2025-05-07T20:11:13.3093630Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:13.3093931Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:13.3094277Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.3094686Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:13.3094982Z 2025-05-07T20:11:13.3095099Z linux-vdso.so.1 (0x00007ffe6e3bb000) 2025-05-07T20:11:13.3095382Z libtorch.so => not found 2025-05-07T20:11:13.3095627Z libc10.so => not found 2025-05-07T20:11:13.3095863Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.3096106Z libc10_cuda.so => not found 2025-05-07T20:11:13.3096357Z libnccl.so.2 => not found 2025-05-07T20:11:13.3096589Z libcuda.so.1 => not found 2025-05-07T20:11:13.3096843Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.3097092Z libtorch_cpu.so => not found 2025-05-07T20:11:13.3097352Z libtorch_cuda.so => not found 2025-05-07T20:11:13.3097765Z libcudart.so.12 => not found 2025-05-07T20:11:13.3098099Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fbd6fb9c000) 2025-05-07T20:11:13.3098499Z libm.so.6 => /lib64/libm.so.6 (0x00007fbd6fac1000) 2025-05-07T20:11:13.3098871Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fbd71519000) 2025-05-07T20:11:13.3099470Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fbd714eb000) 2025-05-07T20:11:13.3099853Z libc.so.6 => /lib64/libc.so.6 (0x00007fbd6f8b9000) 2025-05-07T20:11:13.3100363Z /lib64/ld-linux-x86-64.so.2 (0x00007fbd71577000) 2025-05-07T20:11:13.3100633Z 2025-05-07T20:11:13.3100740Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.3101174Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:11:13.3101562Z 2025-05-07T20:11:13.3101573Z 2025-05-07T20:11:13.3101785Z Dynamic section at offset 0x14b76f0 contains 39 entries: 2025-05-07T20:11:13.3102163Z Tag Type Name/Value 2025-05-07T20:11:13.3102592Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.3103086Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.3103619Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:13.3104144Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:13.3104642Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:13.3105150Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:13.3105691Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:13.3106217Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.3106741Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.3107252Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:13.3107771Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.3108269Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:13.3108772Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:11:13.3109271Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.3109777Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.3110301Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:11:13.3110752Z 0x000000000000000c (INIT) 0x2d000 2025-05-07T20:11:13.3111091Z 0x000000000000000d (FINI) 0xd6d2c 2025-05-07T20:11:13.3111425Z 0x0000000000000019 (INIT_ARRAY) 0x14b5318 2025-05-07T20:11:13.3111788Z 0x000000000000001b (INIT_ARRAYSZ) 208 (bytes) 2025-05-07T20:11:13.3112133Z 0x000000000000001a (FINI_ARRAY) 0x14b53e8 2025-05-07T20:11:13.3112487Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.3112824Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:11:13.3113163Z 0x0000000000000005 (STRTAB) 0x5fa8 2025-05-07T20:11:13.3113504Z 0x0000000000000006 (SYMTAB) 0x1628 2025-05-07T20:11:13.3113862Z 0x000000000000000a (STRSZ) 113301 (bytes) 2025-05-07T20:11:13.3114239Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.3114585Z 0x0000000000000003 (PLTGOT) 0x14b7fe8 2025-05-07T20:11:13.3114963Z 0x0000000000000002 (PLTRELSZ) 10368 (bytes) 2025-05-07T20:11:13.3115313Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.3115654Z 0x0000000000000017 (JMPREL) 0x29e58 2025-05-07T20:11:13.3115998Z 0x0000000000000007 (RELA) 0x22160 2025-05-07T20:11:13.3116346Z 0x0000000000000008 (RELASZ) 31992 (bytes) 2025-05-07T20:11:13.3116715Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.3117057Z 0x000000006ffffffe (VERNEED) 0x22060 2025-05-07T20:11:13.3117399Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:13.3117516Z 0x000000006ffffff0 (VERSYM) 0x21a3e 2025-05-07T20:11:13.3117628Z 0x000000006ffffff9 (RELACOUNT) 498 2025-05-07T20:11:13.3117775Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.3117781Z 2025-05-07T20:11:13.3117899Z ################################################################################ 2025-05-07T20:11:13.3117904Z 2025-05-07T20:11:13.3117908Z 2025-05-07T20:11:13.3118052Z ################################################################################ 2025-05-07T20:11:13.3118344Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:13.3118449Z [CHECK] Listing out library size: 2025-05-07T20:11:13.3118740Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:13.3118746Z 2025-05-07T20:11:13.3118969Z 9 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:13.3118973Z 2025-05-07T20:11:13.3119366Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:13.3119864Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.3119869Z 2025-05-07T20:11:13.3175351Z GLIBC_2.2.5 2025-05-07T20:11:13.3175460Z GLIBC_2.3 2025-05-07T20:11:13.3175661Z GLIBC_2.14 2025-05-07T20:11:13.3175668Z 2025-05-07T20:11:13.3175673Z 2025-05-07T20:11:13.3176788Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:13.3177331Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.3177336Z 2025-05-07T20:11:13.3241705Z GLIBCXX_3.4 2025-05-07T20:11:13.3242819Z GLIBCXX_3.4.9 2025-05-07T20:11:13.3243107Z GLIBCXX_3.4.11 2025-05-07T20:11:13.3243365Z GLIBCXX_3.4.18 2025-05-07T20:11:13.3243590Z GLIBCXX_3.4.21 2025-05-07T20:11:13.3244407Z 2025-05-07T20:11:13.3244421Z 2025-05-07T20:11:13.3264691Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.78xa2pqODb.symbols.txt 2025-05-07T20:11:13.3264728Z 2025-05-07T20:11:13.3296315Z 2025-05-07T20:11:13.3322537Z [CHECK] Total Number of symbols: 347 2025-05-07T20:11:13.3341737Z [CHECK] Number of fbgemm symbols: 16 2025-05-07T20:11:13.3359565Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.d6QkgDWZtV.usymbols.txt 2025-05-07T20:11:13.3359981Z 2025-05-07T20:11:13.3377975Z 2025-05-07T20:11:13.3402949Z [CHECK] Listing out undefined symbols (124 total): 2025-05-07T20:11:13.3421969Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.3422334Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.3422449Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.3422631Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.3422813Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.3422992Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.3423135Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:13.3423295Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:13.3423421Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:13.3423564Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.3423697Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:13.3423814Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.3423925Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:13.3424035Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:13.3424173Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:13.3424286Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.3424565Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:13.3424734Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:13.3424926Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:13.3425190Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:13.3425380Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:13.3425489Z U c10::BoolType::get() 2025-05-07T20:11:13.3425710Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:13.3425840Z U c10::FloatType::get() 2025-05-07T20:11:13.3425968Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:13.3426153Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.3426325Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:13.3426434Z U c10::IntType::get() 2025-05-07T20:11:13.3426612Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:13.3426739Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:13.3427079Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:13.3427232Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:13.3427645Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:13.3427811Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:13.3427940Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:13.3428045Z U c10::TensorType::get() 2025-05-07T20:11:13.3428297Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:13.3428971Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:13.3429102Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:13.3429239Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:13.3429355Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:13.3429470Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:13.3429605Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:13.3429722Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:13.3429960Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:13.3430084Z U c10::cuda::device_count() 2025-05-07T20:11:13.3430218Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:13.3430353Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:13.3430514Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:13.3430646Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:13.3430798Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:13.3430920Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:13.3431401Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:13.3431642Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:13.3432120Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.3432473Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:13.3433022Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.3433181Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:13.3433284Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:13.3433432Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:13.3433585Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:13.3433717Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:13.3433821Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:13.3434022Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:13.3434147Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:13.3434275Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:13.3434403Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:13.3434523Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:13.3434661Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:13.3434783Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:13.3434904Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:13.3435038Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:13.3435152Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:13.3435278Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:13.3435390Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:13.3435498Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:13.3435634Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:13.3435752Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:13.3435863Z U float at::Tensor::item() const 2025-05-07T20:11:13.3436027Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.3436168Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.3436310Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.3436404Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.3436518Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.3436613Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.3436723Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:13.3436854Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.3437172Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:13.3437541Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.3437863Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:13.3438214Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:13.3438330Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.3438457Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:13.3438593Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.3438723Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.3438861Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:13.3439109Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:13.3439649Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.3439806Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:13.3439922Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:13.3440033Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.3440174Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:13.3440348Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.3440570Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.3440701Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:13.3440805Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:13.3440899Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.3441025Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:13.3441572Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:13.3442068Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.3442322Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.3442660Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:13.3442804Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.3442971Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:13.3443124Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.3443430Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.3443660Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:13.3443767Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.3443870Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.3443981Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.3444067Z w __gmon_start__ 2025-05-07T20:11:13.3444159Z w __pthread_key_create 2025-05-07T20:11:13.3444278Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:13.3444383Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:13.3444523Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.3444727Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:13.3444747Z 2025-05-07T20:11:13.3467003Z linux-vdso.so.1 (0x00007ffe445e9000) 2025-05-07T20:11:13.3467485Z libtorch.so => not found 2025-05-07T20:11:13.3467785Z libc10.so => not found 2025-05-07T20:11:13.3468052Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.3468313Z libc10_cuda.so => not found 2025-05-07T20:11:13.3468573Z libnccl.so.2 => not found 2025-05-07T20:11:13.3468848Z libcuda.so.1 => not found 2025-05-07T20:11:13.3469254Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.3469353Z libtorch_cpu.so => not found 2025-05-07T20:11:13.3469470Z libtorch_cuda.so => not found 2025-05-07T20:11:13.3469564Z libcudart.so.12 => not found 2025-05-07T20:11:13.3469741Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f7ca1b9c000) 2025-05-07T20:11:13.3469898Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f7ca287f000) 2025-05-07T20:11:13.3470161Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f7ca2851000) 2025-05-07T20:11:13.3470292Z libc.so.6 => /lib64/libc.so.6 (0x00007f7ca1994000) 2025-05-07T20:11:13.3470422Z /lib64/ld-linux-x86-64.so.2 (0x00007f7ca28dd000) 2025-05-07T20:11:13.3470562Z libm.so.6 => /lib64/libm.so.6 (0x00007f7ca2776000) 2025-05-07T20:11:13.3470633Z 2025-05-07T20:11:13.3471290Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.3471622Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:11:13.3471630Z 2025-05-07T20:11:13.3502088Z 2025-05-07T20:11:13.3502580Z Dynamic section at offset 0x8a7a10 contains 39 entries: 2025-05-07T20:11:13.3502740Z Tag Type Name/Value 2025-05-07T20:11:13.3502969Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.3503189Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.3503431Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:13.3503648Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:13.3503852Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:13.3504077Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:13.3504378Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:13.3504594Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.3504838Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.3505051Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:13.3505278Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.3505503Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:11:13.3505705Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.3505903Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.3506150Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:13.3506396Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:11:13.3506530Z 0x000000000000000c (INIT) 0x10000 2025-05-07T20:11:13.3506678Z 0x000000000000000d (FINI) 0x333cc 2025-05-07T20:11:13.3506808Z 0x0000000000000019 (INIT_ARRAY) 0x8a71f8 2025-05-07T20:11:13.3506941Z 0x000000000000001b (INIT_ARRAYSZ) 48 (bytes) 2025-05-07T20:11:13.3507064Z 0x000000000000001a (FINI_ARRAY) 0x8a7228 2025-05-07T20:11:13.3507214Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.3507337Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:11:13.3507455Z 0x0000000000000005 (STRTAB) 0x2a78 2025-05-07T20:11:13.3507591Z 0x0000000000000006 (SYMTAB) 0x9d8 2025-05-07T20:11:13.3507734Z 0x000000000000000a (STRSZ) 38406 (bytes) 2025-05-07T20:11:13.3507865Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.3508014Z 0x0000000000000003 (PLTGOT) 0x8a7fe8 2025-05-07T20:11:13.3508167Z 0x0000000000000002 (PLTRELSZ) 4728 (bytes) 2025-05-07T20:11:13.3508284Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.3508401Z 0x0000000000000017 (JMPREL) 0xe230 2025-05-07T20:11:13.3508538Z 0x0000000000000007 (RELA) 0xc448 2025-05-07T20:11:13.3508676Z 0x0000000000000008 (RELASZ) 7656 (bytes) 2025-05-07T20:11:13.3508802Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.3508948Z 0x000000006ffffffe (VERNEED) 0xc338 2025-05-07T20:11:13.3509066Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:13.3509187Z 0x000000006ffffff0 (VERSYM) 0xc07e 2025-05-07T20:11:13.3509308Z 0x000000006ffffff9 (RELACOUNT) 136 2025-05-07T20:11:13.3509502Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.3509508Z 2025-05-07T20:11:13.3509637Z ################################################################################ 2025-05-07T20:11:13.3509644Z 2025-05-07T20:11:13.3509698Z 2025-05-07T20:11:13.3509850Z ################################################################################ 2025-05-07T20:11:13.3510123Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:13.3510263Z [CHECK] Listing out library size: 2025-05-07T20:11:13.3510526Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:13.3510530Z 2025-05-07T20:11:13.3515247Z 17 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:13.3515649Z 2025-05-07T20:11:13.3516816Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:13.3517328Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.3517334Z 2025-05-07T20:11:13.3575701Z GLIBC_2.2.5 2025-05-07T20:11:13.3576055Z GLIBC_2.14 2025-05-07T20:11:13.3576271Z 2025-05-07T20:11:13.3576392Z 2025-05-07T20:11:13.3577003Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:13.3577526Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.3577531Z 2025-05-07T20:11:13.3640355Z GLIBCXX_3.4 2025-05-07T20:11:13.3641364Z GLIBCXX_3.4.9 2025-05-07T20:11:13.3641651Z GLIBCXX_3.4.20 2025-05-07T20:11:13.3641930Z GLIBCXX_3.4.21 2025-05-07T20:11:13.3642871Z 2025-05-07T20:11:13.3642884Z 2025-05-07T20:11:13.3667967Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.8QOyCP36Ue.symbols.txt 2025-05-07T20:11:13.3668006Z 2025-05-07T20:11:13.3697904Z 2025-05-07T20:11:13.3722782Z [CHECK] Total Number of symbols: 452 2025-05-07T20:11:13.3737694Z [CHECK] Number of fbgemm symbols: 13 2025-05-07T20:11:13.3751785Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.X0Ye01Z50p.usymbols.txt 2025-05-07T20:11:13.3753228Z 2025-05-07T20:11:13.3771300Z 2025-05-07T20:11:13.3797326Z [CHECK] Listing out undefined symbols (149 total): 2025-05-07T20:11:13.3817326Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.3818665Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.3819023Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.3819424Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.3819830Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.3820325Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:13.3820733Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:13.3821125Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:13.3821580Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.3821977Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:13.3822318Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:13.3822659Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.3822990Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:13.3823337Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:13.3823688Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:13.3824021Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:13.3824361Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:13.3824674Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:13.3825130Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.3825617Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:13.3826156Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:13.3827090Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.3828474Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.3829433Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:13.3829919Z U at::_ops::mul_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:13.3830400Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:13.3830926Z U at::_ops::sub__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:13.3831440Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:13.3832154Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.3833373Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.3834131Z U c10::BoolType::get() 2025-05-07T20:11:13.3834477Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:13.3834895Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:13.3835409Z U c10::IntType::get() 2025-05-07T20:11:13.3835792Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:13.3836196Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:13.3836670Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:13.3837190Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:13.3837608Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:13.3838289Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:13.3838938Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:13.3839328Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:13.3839685Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:13.3840009Z U c10::SymIntType::get() 2025-05-07T20:11:13.3840386Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:13.3840806Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:13.3841195Z U c10::TensorType::get() 2025-05-07T20:11:13.3841527Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:13.3842489Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:13.3843456Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:13.3843820Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:13.3844362Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:13.3844736Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:13.3845126Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:13.3845503Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:13.3845981Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:13.3846517Z U c10::cuda::current_device() 2025-05-07T20:11:13.3846839Z U c10::cuda::device_count() 2025-05-07T20:11:13.3847219Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:13.3847658Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:13.3848056Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:13.3848486Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:13.3848899Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:13.3849312Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:13.3850082Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:13.3850976Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:13.3851906Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.3852998Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:13.3854022Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.3854846Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:13.3855188Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:13.3855585Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:13.3856032Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:13.3856438Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:13.3856833Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:13.3857217Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:13.3857603Z U c10::throwNullDataPtrError() 2025-05-07T20:11:13.3858110Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:13.3858442Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:13.3858891Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:13.3859320Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:13.3859709Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:13.3860158Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:13.3860563Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:13.3860960Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:13.3861325Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:13.3861707Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:13.3862096Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:13.3862481Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:13.3862848Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:13.3863225Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:13.3863576Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:13.3863962Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:13.3864366Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:13.3864759Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:13.3865131Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:13.3865485Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:11:13.3865899Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:11:13.3866419Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:13.3866974Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:13.3867381Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:13.3867725Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:13.3868116Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:13.3868480Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:13.3868866Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.3869260Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.3869645Z U log2@GLIBC_2.2.5 2025-05-07T20:11:13.3870040Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:13.3870472Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.3870925Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.3871295Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.3871606Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.3871907Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.3872243Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:13.3872598Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.3873208Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:13.3874083Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.3874715Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.3875108Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.3875546Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.3876162Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:13.3876792Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:13.3877767Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.3878616Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:13.3879004Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:13.3879365Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:13.3879741Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.3880086Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:13.3880523Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.3881089Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.3881586Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:13.3881960Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:13.3882286Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.3882631Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:13.3883542Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:13.3884719Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.3885574Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.3886368Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:13.3887054Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:13.3887477Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.3887916Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:13.3888377Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.3889102Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.3889793Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:13.3890280Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.3890656Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.3890994Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.3891301Z w __gmon_start__ 2025-05-07T20:11:13.3891604Z w __pthread_key_create 2025-05-07T20:11:13.3891956Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.3892432Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:13.3892743Z 2025-05-07T20:11:13.3892893Z linux-vdso.so.1 (0x00007ffcc4fa9000) 2025-05-07T20:11:13.3893368Z libtorch.so => not found 2025-05-07T20:11:13.3893699Z libc10.so => not found 2025-05-07T20:11:13.3893983Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.3894260Z libc10_cuda.so => not found 2025-05-07T20:11:13.3894563Z libnccl.so.2 => not found 2025-05-07T20:11:13.3894831Z libcuda.so.1 => not found 2025-05-07T20:11:13.3895108Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.3895398Z libtorch_cpu.so => not found 2025-05-07T20:11:13.3895707Z libtorch_cuda.so => not found 2025-05-07T20:11:13.3895991Z libcudart.so.12 => not found 2025-05-07T20:11:13.3896345Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fd205b9c000) 2025-05-07T20:11:13.3896746Z libm.so.6 => /lib64/libm.so.6 (0x00007fd206f78000) 2025-05-07T20:11:13.3897166Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fd206f22000) 2025-05-07T20:11:13.3897599Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fd206ef4000) 2025-05-07T20:11:13.3897982Z libc.so.6 => /lib64/libc.so.6 (0x00007fd205994000) 2025-05-07T20:11:13.3898356Z /lib64/ld-linux-x86-64.so.2 (0x00007fd20705b000) 2025-05-07T20:11:13.3898594Z 2025-05-07T20:11:13.3898709Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.3899153Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:11:13.3899492Z 2025-05-07T20:11:13.3899512Z 2025-05-07T20:11:13.3899694Z Dynamic section at offset 0x104fa28 contains 39 entries: 2025-05-07T20:11:13.3900144Z Tag Type Name/Value 2025-05-07T20:11:13.3900593Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.3901162Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.3901694Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:13.3902239Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:13.3902751Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:13.3903284Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:13.3903809Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:13.3904403Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.3904934Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.3905480Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:13.3906060Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.3906575Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:13.3907126Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:11:13.3907643Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.3908168Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.3908697Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:13.3909172Z 0x000000000000000c (INIT) 0x11000 2025-05-07T20:11:13.3909539Z 0x000000000000000d (FINI) 0x8746c 2025-05-07T20:11:13.3909889Z 0x0000000000000019 (INIT_ARRAY) 0x104ff20 2025-05-07T20:11:13.3910274Z 0x000000000000001b (INIT_ARRAYSZ) 96 (bytes) 2025-05-07T20:11:13.3910632Z 0x000000000000001a (FINI_ARRAY) 0x104ff80 2025-05-07T20:11:13.3911031Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.3911384Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:11:13.3911744Z 0x0000000000000005 (STRTAB) 0x3660 2025-05-07T20:11:13.3912082Z 0x0000000000000006 (SYMTAB) 0xbe8 2025-05-07T20:11:13.3912456Z 0x000000000000000a (STRSZ) 35789 (bytes) 2025-05-07T20:11:13.3912847Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.3913205Z 0x0000000000000003 (PLTGOT) 0x1050fe8 2025-05-07T20:11:13.3913601Z 0x0000000000000002 (PLTRELSZ) 6480 (bytes) 2025-05-07T20:11:13.3913957Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.3914310Z 0x0000000000000017 (JMPREL) 0xf060 2025-05-07T20:11:13.3914647Z 0x0000000000000007 (RELA) 0xc6a8 2025-05-07T20:11:13.3915026Z 0x0000000000000008 (RELASZ) 10680 (bytes) 2025-05-07T20:11:13.3915414Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.3915775Z 0x000000006ffffffe (VERNEED) 0xc5b8 2025-05-07T20:11:13.3916130Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:13.3916463Z 0x000000006ffffff0 (VERSYM) 0xc22e 2025-05-07T20:11:13.3916813Z 0x000000006ffffff9 (RELACOUNT) 116 2025-05-07T20:11:13.3917124Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.3917356Z 2025-05-07T20:11:13.3917480Z ################################################################################ 2025-05-07T20:11:13.3917714Z 2025-05-07T20:11:13.3917718Z 2025-05-07T20:11:13.3917854Z ################################################################################ 2025-05-07T20:11:13.3918394Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:13.3918936Z [CHECK] Listing out library size: 2025-05-07T20:11:13.3919418Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:13.3919841Z 2025-05-07T20:11:13.3920076Z 2 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:13.3920411Z 2025-05-07T20:11:13.3920851Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:13.3921895Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.3922538Z 2025-05-07T20:11:13.3962044Z GLIBC_2.2.5 2025-05-07T20:11:13.3962347Z GLIBC_2.14 2025-05-07T20:11:13.3962507Z 2025-05-07T20:11:13.3963111Z 2025-05-07T20:11:13.3965083Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:13.3966259Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.3966936Z 2025-05-07T20:11:13.4016376Z GLIBCXX_3.4 2025-05-07T20:11:13.4016683Z GLIBCXX_3.4.9 2025-05-07T20:11:13.4016906Z GLIBCXX_3.4.21 2025-05-07T20:11:13.4017040Z 2025-05-07T20:11:13.4017044Z 2025-05-07T20:11:13.4032782Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.RfYGLDylM7.symbols.txt 2025-05-07T20:11:13.4034260Z 2025-05-07T20:11:13.4060434Z 2025-05-07T20:11:13.4083728Z [CHECK] Total Number of symbols: 277 2025-05-07T20:11:13.4098879Z [CHECK] Number of fbgemm symbols: 44 2025-05-07T20:11:13.4111477Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.3iZqsAwDBD.usymbols.txt 2025-05-07T20:11:13.4112030Z 2025-05-07T20:11:13.4131282Z 2025-05-07T20:11:13.4152770Z [CHECK] Listing out undefined symbols (127 total): 2025-05-07T20:11:13.4172376Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.4174477Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.4175494Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.4176994Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.4178103Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.4179164Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:13.4179757Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:13.4180187Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:13.4180545Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.4180894Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:13.4181219Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.4181536Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:13.4181846Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:13.4182176Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:13.4182497Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.4182904Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:11:13.4183798Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.4185116Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.4186337Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:13.4186717Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:13.4187364Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.4188000Z U at::get_thread_num() 2025-05-07T20:11:13.4188295Z U at::internal::set_thread_num(int) 2025-05-07T20:11:13.4189024Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.4189909Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:11:13.4190444Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.4190927Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:13.4191326Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.4191688Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:13.4192102Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:13.4192444Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:13.4192797Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:13.4193219Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:13.4193549Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:13.4193901Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:13.4194277Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:13.4194621Z U c10::TensorType::get() 2025-05-07T20:11:13.4194932Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:13.4195805Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:13.4196734Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:13.4197070Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:13.4197386Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:13.4197704Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:13.4198008Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:13.4198328Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:13.4198755Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:13.4199191Z U c10::cuda::device_count() 2025-05-07T20:11:13.4199514Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:13.4199857Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:13.4200225Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:13.4200577Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:13.4200957Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:13.4201317Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:13.4201992Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:13.4202810Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:13.4203603Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.4204482Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:13.4205011Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:13.4205311Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:13.4205650Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:13.4206031Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:13.4206402Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:13.4206744Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:13.4207094Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:13.4207429Z U c10::throwNullDataPtrError() 2025-05-07T20:11:13.4207720Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:13.4208053Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:13.4208437Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:13.4208826Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:13.4209185Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:13.4209515Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:13.4209861Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:13.4210209Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:13.4210540Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:13.4210865Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:13.4211164Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:13.4211484Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:13.4211809Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:13.4212126Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:13.4212431Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:13.4212929Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:13.4213325Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:13.4213653Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:13.4214142Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:13.4214638Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:13.4214970Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:13.4215458Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:13.4215804Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:13.4216164Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:13.4216537Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.4216915Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.4217321Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:13.4217749Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.4218081Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.4218368Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.4218649Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.4218962Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:13.4219304Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.4219873Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:13.4220789Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.4221436Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.4221798Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.4222193Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.4222692Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:13.4223618Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.4224428Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:13.4224768Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:13.4225117Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.4225462Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:13.4225908Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.4226424Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:13.4226705Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.4227031Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:13.4227795Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:13.4228890Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.4229658Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.4230340Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:13.4230910Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.4231299Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:13.4231719Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.4232276Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.4232892Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:13.4233303Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.4233600Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.4233879Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.4234157Z w __gmon_start__ 2025-05-07T20:11:13.4234399Z w __pthread_key_create 2025-05-07T20:11:13.4234720Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.4235168Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:13.4235488Z 2025-05-07T20:11:13.4235606Z linux-vdso.so.1 (0x00007ffffdbaa000) 2025-05-07T20:11:13.4235878Z libc10.so => not found 2025-05-07T20:11:13.4236101Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.4236348Z libc10_cuda.so => not found 2025-05-07T20:11:13.4236581Z libnccl.so.2 => not found 2025-05-07T20:11:13.4236822Z libcuda.so.1 => not found 2025-05-07T20:11:13.4237400Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f98dda00000) 2025-05-07T20:11:13.4238008Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.4238261Z libtorch.so => not found 2025-05-07T20:11:13.4238485Z libtorch_cpu.so => not found 2025-05-07T20:11:13.4238738Z libtorch_cuda.so => not found 2025-05-07T20:11:13.4238977Z libcudart.so.12 => not found 2025-05-07T20:11:13.4239290Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f98dd79c000) 2025-05-07T20:11:13.4239671Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f98deb4c000) 2025-05-07T20:11:13.4240018Z libc.so.6 => /lib64/libc.so.6 (0x00007f98dd594000) 2025-05-07T20:11:13.4240321Z libtorch.so => not found 2025-05-07T20:11:13.4240544Z libc10.so => not found 2025-05-07T20:11:13.4240775Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.4241013Z libc10_cuda.so => not found 2025-05-07T20:11:13.4241236Z libnccl.so.2 => not found 2025-05-07T20:11:13.4241462Z libcuda.so.1 => not found 2025-05-07T20:11:13.4241693Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.4241943Z libtorch_cpu.so => not found 2025-05-07T20:11:13.4242177Z libtorch_cuda.so => not found 2025-05-07T20:11:13.4242417Z libcudart.so.12 => not found 2025-05-07T20:11:13.4242675Z libm.so.6 => /lib64/libm.so.6 (0x00007f98dd4b9000) 2025-05-07T20:11:13.4243024Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f98deaf2000) 2025-05-07T20:11:13.4243365Z /lib64/ld-linux-x86-64.so.2 (0x00007f98ded29000) 2025-05-07T20:11:13.4243617Z 2025-05-07T20:11:13.4243713Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.4244128Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:11:13.4244492Z 2025-05-07T20:11:13.4248271Z 2025-05-07T20:11:13.4248425Z Dynamic section at offset 0x16eba8 contains 39 entries: 2025-05-07T20:11:13.4248787Z Tag Type Name/Value 2025-05-07T20:11:13.4249741Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.4252445Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:13.4253185Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:13.4253727Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:13.4254240Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:13.4254802Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:13.4255348Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:13.4255884Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.4256485Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.4257006Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.4257550Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:13.4258066Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.4258592Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.4259088Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.4259652Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:13.4260316Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:13.4260726Z 0x000000000000000c (INIT) 0xa000 2025-05-07T20:11:13.4261076Z 0x000000000000000d (FINI) 0x1a14c 2025-05-07T20:11:13.4261419Z 0x0000000000000019 (INIT_ARRAY) 0x16f890 2025-05-07T20:11:13.4261789Z 0x000000000000001b (INIT_ARRAYSZ) 32 (bytes) 2025-05-07T20:11:13.4262142Z 0x000000000000001a (FINI_ARRAY) 0x16f8b0 2025-05-07T20:11:13.4262507Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.4262883Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:11:13.4263221Z 0x0000000000000005 (STRTAB) 0x2108 2025-05-07T20:11:13.4263568Z 0x0000000000000006 (SYMTAB) 0x6f8 2025-05-07T20:11:13.4263913Z 0x000000000000000a (STRSZ) 20443 (bytes) 2025-05-07T20:11:13.4264295Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.4264643Z 0x0000000000000003 (PLTGOT) 0x16ffe8 2025-05-07T20:11:13.4265018Z 0x0000000000000002 (PLTRELSZ) 3936 (bytes) 2025-05-07T20:11:13.4265365Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.4265701Z 0x0000000000000017 (JMPREL) 0x8150 2025-05-07T20:11:13.4266045Z 0x0000000000000007 (RELA) 0x73d0 2025-05-07T20:11:13.4266401Z 0x0000000000000008 (RELASZ) 3456 (bytes) 2025-05-07T20:11:13.4266769Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.4267111Z 0x000000006ffffffe (VERNEED) 0x7310 2025-05-07T20:11:13.4267464Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:11:13.4267785Z 0x000000006ffffff0 (VERSYM) 0x70e4 2025-05-07T20:11:13.4268140Z 0x000000006ffffff9 (RELACOUNT) 7 2025-05-07T20:11:13.4268450Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.4268680Z 2025-05-07T20:11:13.4268805Z ################################################################################ 2025-05-07T20:11:13.4269039Z 2025-05-07T20:11:13.4269043Z 2025-05-07T20:11:13.4269667Z ################################################################################ 2025-05-07T20:11:13.4270229Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:13.4270832Z [CHECK] Listing out library size: 2025-05-07T20:11:13.4271341Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:13.4271778Z 2025-05-07T20:11:13.4272072Z 11 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:13.4272434Z 2025-05-07T20:11:13.4272898Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:13.4273971Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.4274628Z 2025-05-07T20:11:13.4723524Z GLIBC_2.2.5 2025-05-07T20:11:13.4723847Z GLIBC_2.3 2025-05-07T20:11:13.4725138Z GLIBC_2.14 2025-05-07T20:11:13.4725561Z 2025-05-07T20:11:13.4725576Z 2025-05-07T20:11:13.4726992Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:13.4729891Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.4730541Z 2025-05-07T20:11:13.5191526Z GLIBCXX_3.4 2025-05-07T20:11:13.5191778Z GLIBCXX_3.4.9 2025-05-07T20:11:13.5192024Z GLIBCXX_3.4.11 2025-05-07T20:11:13.5192237Z GLIBCXX_3.4.15 2025-05-07T20:11:13.5192463Z GLIBCXX_3.4.18 2025-05-07T20:11:13.5192667Z GLIBCXX_3.4.20 2025-05-07T20:11:13.5192882Z GLIBCXX_3.4.21 2025-05-07T20:11:13.5196276Z 2025-05-07T20:11:13.5196289Z 2025-05-07T20:11:13.5217645Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.XtpK92mdKq.symbols.txt 2025-05-07T20:11:13.5218194Z 2025-05-07T20:11:13.5624327Z 2025-05-07T20:11:13.5653629Z [CHECK] Total Number of symbols: 4395 2025-05-07T20:11:13.5694640Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:11:13.5708242Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.G5taqMuom7.usymbols.txt 2025-05-07T20:11:13.5708775Z 2025-05-07T20:11:13.5742037Z 2025-05-07T20:11:13.5774268Z [CHECK] Listing out undefined symbols (185 total): 2025-05-07T20:11:13.5794502Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.5795356Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.5796009Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.5796376Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:13.5796726Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:13.5797054Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.5797399Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:13.5797734Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:13.5798105Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:13.5798432Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:13.5798783Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:13.5799090Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:13.5799396Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:13.5799704Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.5800028Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:13.5800403Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:13.5800985Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:13.5801468Z U at::RecordFunction::end() 2025-05-07T20:11:13.5801841Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:13.5802355Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:13.5803083Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:13.5803756Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:13.5804514Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:11:13.5805150Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:11:13.5806102Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.5807089Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:13.5807574Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:13.5808025Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:13.5808383Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:13.5808764Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:13.5809077Z U bcmp@GLIBC_2.2.5 2025-05-07T20:11:13.5809364Z U c10::AnyType::get() 2025-05-07T20:11:13.5809648Z U c10::BoolType::get() 2025-05-07T20:11:13.5809986Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:13.5810444Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:13.5810835Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:13.5811561Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:13.5812771Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:13.5813824Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:13.5814422Z U c10::Error::what() const 2025-05-07T20:11:13.5814732Z U c10::FloatType::get() 2025-05-07T20:11:13.5815055Z U c10::GradMode::is_enabled() 2025-05-07T20:11:13.5815170Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:13.5815349Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:13.5815471Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:13.5815585Z U c10::IValue::isBoolList() const 2025-05-07T20:11:13.5815725Z U c10::IValue::isDoubleList() const 2025-05-07T20:11:13.5815841Z U c10::IValue::isIntList() const 2025-05-07T20:11:13.5815955Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:13.5816070Z U c10::IValue::isTensorList() const 2025-05-07T20:11:13.5816234Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:13.5816330Z U c10::IntType::get() 2025-05-07T20:11:13.5816777Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:13.5816959Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:13.5817076Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:13.5817223Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:13.5817353Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:13.5817558Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:13.5817843Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:13.5817954Z U c10::StringType::get() 2025-05-07T20:11:13.5818115Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:13.5818251Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:13.5818399Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:13.5818542Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:11:13.5818921Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:13.5819065Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:13.5819187Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:11:13.5819365Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:13.5819492Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:13.5819616Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:13.5819722Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:13.5819832Z U c10::SymIntType::get() 2025-05-07T20:11:13.5819951Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:13.5820046Z U c10::TensorType::get() 2025-05-07T20:11:13.5820268Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:13.5820849Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:13.5821370Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:13.5821699Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:13.5822197Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.5822536Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:13.5823133Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.5823459Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:13.5823659Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:13.5823782Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:13.5823943Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:13.5824315Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:13.5824454Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:13.5824616Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:13.5824767Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:11:13.5824924Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:13.5825154Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:13.5825277Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:13.5825550Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:13.5825864Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:13.5825963Z U free@GLIBC_2.2.5 2025-05-07T20:11:13.5826178Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:13.5826277Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:13.5826373Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.5826485Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.5826579Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.5826699Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:13.5826836Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.5826935Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:13.5827149Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:13.5827490Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:13.5827924Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.5828256Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:13.5828650Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.5829019Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:13.5829140Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.5829271Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:13.5829414Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.5829561Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.5829750Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:13.5829884Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:13.5830030Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:13.5830287Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:13.5830870Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.5831004Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:13.5831140Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:13.5831264Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:13.5831389Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.5831519Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:13.5831705Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.5831949Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.5832094Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:13.5832261Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:13.5832397Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:13.5832966Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:13.5833099Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:13.5833204Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:13.5833327Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:13.5833430Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.5833545Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:13.5834118Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:13.5834558Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.5834801Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.5834930Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:13.5835205Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:13.5835404Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:13.5835605Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:13.5835780Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:13.5836104Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:13.5836257Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:13.5836431Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:13.5836597Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:13.5836724Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:13.5836832Z U torch::autograd::Node::metadata() 2025-05-07T20:11:13.5836964Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:13.5837204Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:13.5837456Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:13.5837586Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:13.5837796Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:13.5838000Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:13.5840481Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:13.5840631Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:13.5840773Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:13.5840965Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:13.5841700Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:11:13.5841884Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:13.5842294Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:13.5842632Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:13.5842747Z U typeinfo for c10::Error 2025-05-07T20:11:13.5842882Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:13.5843001Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:13.5843143Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:13.5843269Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:13.5843381Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:13.5843552Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.5843717Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:13.5843866Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:13.5844011Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.5844118Z U vtable for c10::Error 2025-05-07T20:11:13.5844424Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.5844549Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:13.5844776Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:13.5844884Z U vtable for torch::autograd::Node 2025-05-07T20:11:13.5845049Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:13.5845170Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.5845271Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.5845370Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.5845473Z w __gmon_start__ 2025-05-07T20:11:13.5845564Z w __pthread_key_create 2025-05-07T20:11:13.5845670Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:13.5845786Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:13.5845931Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.5846167Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:13.5846173Z 2025-05-07T20:11:13.5854915Z linux-vdso.so.1 (0x00007ffde5b6e000) 2025-05-07T20:11:13.5855186Z libc10.so => not found 2025-05-07T20:11:13.5855478Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.5855857Z libc10_cuda.so => not found 2025-05-07T20:11:13.5856157Z libnccl.so.2 => not found 2025-05-07T20:11:13.5856478Z libcuda.so.1 => not found 2025-05-07T20:11:13.5857889Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f83aec00000) 2025-05-07T20:11:13.5859272Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f83ae800000) 2025-05-07T20:11:13.5860559Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f83ae659000) 2025-05-07T20:11:13.5860831Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.5860959Z libtorch.so => not found 2025-05-07T20:11:13.5861445Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so (0x00007f83b0ebb000) 2025-05-07T20:11:13.5861593Z libtorch_cpu.so => not found 2025-05-07T20:11:13.5861708Z libtorch_cuda.so => not found 2025-05-07T20:11:13.5861902Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f83ae3f5000) 2025-05-07T20:11:13.5862053Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f83b0e8d000) 2025-05-07T20:11:13.5862193Z libc.so.6 => /lib64/libc.so.6 (0x00007f83ae1ed000) 2025-05-07T20:11:13.5862364Z /lib64/ld-linux-x86-64.so.2 (0x00007f83b0ece000) 2025-05-07T20:11:13.5862459Z libtorch.so => not found 2025-05-07T20:11:13.5862559Z libc10.so => not found 2025-05-07T20:11:13.5862657Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.5862748Z libc10_cuda.so => not found 2025-05-07T20:11:13.5862840Z libnccl.so.2 => not found 2025-05-07T20:11:13.5862944Z libcuda.so.1 => not found 2025-05-07T20:11:13.5863040Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.5863138Z libtorch_cpu.so => not found 2025-05-07T20:11:13.5863237Z libtorch_cuda.so => not found 2025-05-07T20:11:13.5863345Z libcudart.so.12 => not found 2025-05-07T20:11:13.5863468Z libm.so.6 => /lib64/libm.so.6 (0x00007f83b0325000) 2025-05-07T20:11:13.5863653Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f83b02cf000) 2025-05-07T20:11:13.5863755Z libc10.so => not found 2025-05-07T20:11:13.5863846Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.5863937Z libc10_cuda.so => not found 2025-05-07T20:11:13.5864041Z libnccl.so.2 => not found 2025-05-07T20:11:13.5864134Z libcuda.so.1 => not found 2025-05-07T20:11:13.5864494Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007f83adc00000) 2025-05-07T20:11:13.5864596Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.5864702Z libtorch.so => not found 2025-05-07T20:11:13.5864794Z libtorch_cpu.so => not found 2025-05-07T20:11:13.5864891Z libtorch_cuda.so => not found 2025-05-07T20:11:13.5864999Z libcudart.so.12 => not found 2025-05-07T20:11:13.5865085Z libc10.so => not found 2025-05-07T20:11:13.5865179Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.5865268Z libc10_cuda.so => not found 2025-05-07T20:11:13.5865369Z libnccl.so.2 => not found 2025-05-07T20:11:13.5865461Z libcuda.so.1 => not found 2025-05-07T20:11:13.5865918Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f83aca00000) 2025-05-07T20:11:13.5866025Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.5866115Z libtorch.so => not found 2025-05-07T20:11:13.5866205Z libtorch_cpu.so => not found 2025-05-07T20:11:13.5866298Z libtorch_cuda.so => not found 2025-05-07T20:11:13.5866401Z libcudart.so.12 => not found 2025-05-07T20:11:13.5866488Z libtorch.so => not found 2025-05-07T20:11:13.5866571Z libc10.so => not found 2025-05-07T20:11:13.5866675Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.5866762Z libc10_cuda.so => not found 2025-05-07T20:11:13.5866851Z libnccl.so.2 => not found 2025-05-07T20:11:13.5866938Z libcuda.so.1 => not found 2025-05-07T20:11:13.5867044Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.5867135Z libtorch_cpu.so => not found 2025-05-07T20:11:13.5867230Z libtorch_cuda.so => not found 2025-05-07T20:11:13.5867419Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f83b0e76000) 2025-05-07T20:11:13.5867506Z libc10.so => not found 2025-05-07T20:11:13.5867598Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.5867687Z libc10_cuda.so => not found 2025-05-07T20:11:13.5867792Z libnccl.so.2 => not found 2025-05-07T20:11:13.5867882Z libcuda.so.1 => not found 2025-05-07T20:11:13.5868234Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007f83aeb89000) 2025-05-07T20:11:13.5868346Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.5868436Z libtorch.so => not found 2025-05-07T20:11:13.5868529Z libtorch_cpu.so => not found 2025-05-07T20:11:13.5868649Z libtorch_cuda.so => not found 2025-05-07T20:11:13.5868774Z libtorch.so => not found 2025-05-07T20:11:13.5868869Z libc10.so => not found 2025-05-07T20:11:13.5868969Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.5869095Z libc10_cuda.so => not found 2025-05-07T20:11:13.5869198Z libnccl.so.2 => not found 2025-05-07T20:11:13.5869318Z libcuda.so.1 => not found 2025-05-07T20:11:13.5869441Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.5869543Z libtorch_cpu.so => not found 2025-05-07T20:11:13.5869648Z libtorch_cuda.so => not found 2025-05-07T20:11:13.5869771Z libcudart.so.12 => not found 2025-05-07T20:11:13.5869894Z libtorch.so => not found 2025-05-07T20:11:13.5869992Z libc10.so => not found 2025-05-07T20:11:13.5870094Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.5870215Z libc10_cuda.so => not found 2025-05-07T20:11:13.5870316Z libnccl.so.2 => not found 2025-05-07T20:11:13.5870412Z libcuda.so.1 => not found 2025-05-07T20:11:13.5870512Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.5870641Z libtorch_cpu.so => not found 2025-05-07T20:11:13.5870750Z libtorch_cuda.so => not found 2025-05-07T20:11:13.5870887Z librt.so.1 => /lib64/librt.so.1 (0x00007f83b0e67000) 2025-05-07T20:11:13.5870901Z 2025-05-07T20:11:13.5871033Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.5871353Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:11:13.5871358Z 2025-05-07T20:11:13.5898842Z 2025-05-07T20:11:13.5899577Z Dynamic section at offset 0xa44058 contains 42 entries: 2025-05-07T20:11:13.5900010Z Tag Type Name/Value 2025-05-07T20:11:13.5900832Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.5901499Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:13.5902128Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:13.5902707Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:13.5903297Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:13.5903979Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:11:13.5904628Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:13.5905371Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:13.5906015Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:13.5906592Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.5907214Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:11:13.5907565Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.5907760Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.5907958Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.5908146Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.5908353Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.5908559Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:13.5908819Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_pt2.so] 2025-05-07T20:11:13.5909015Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:13.5909135Z 0x000000000000000c (INIT) 0x190000 2025-05-07T20:11:13.5909251Z 0x000000000000000d (FINI) 0x8ac368 2025-05-07T20:11:13.5909382Z 0x0000000000000019 (INIT_ARRAY) 0xa37c40 2025-05-07T20:11:13.5909510Z 0x000000000000001b (INIT_ARRAYSZ) 256 (bytes) 2025-05-07T20:11:13.5909625Z 0x000000000000001a (FINI_ARRAY) 0xa37d40 2025-05-07T20:11:13.5909746Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.5909991Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:11:13.5910110Z 0x0000000000000005 (STRTAB) 0x23008 2025-05-07T20:11:13.5910216Z 0x0000000000000006 (SYMTAB) 0x93e8 2025-05-07T20:11:13.5910420Z 0x000000000000000a (STRSZ) 1248185 (bytes) 2025-05-07T20:11:13.5910540Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.5910658Z 0x0000000000000003 (PLTGOT) 0xa47fe8 2025-05-07T20:11:13.5911018Z 0x0000000000000002 (PLTRELSZ) 42648 (bytes) 2025-05-07T20:11:13.5911299Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.5911424Z 0x0000000000000017 (JMPREL) 0x184d90 2025-05-07T20:11:13.5911539Z 0x0000000000000007 (RELA) 0x155f30 2025-05-07T20:11:13.5911700Z 0x0000000000000008 (RELASZ) 192096 (bytes) 2025-05-07T20:11:13.5911828Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.5911954Z 0x000000006ffffffe (VERNEED) 0x155e20 2025-05-07T20:11:13.5912089Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:11:13.5912212Z 0x000000006ffffff0 (VERSYM) 0x153bc2 2025-05-07T20:11:13.5912327Z 0x000000006ffffff9 (RELACOUNT) 34 2025-05-07T20:11:13.5912491Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.5912496Z 2025-05-07T20:11:13.5912621Z ################################################################################ 2025-05-07T20:11:13.5912626Z 2025-05-07T20:11:13.5912630Z 2025-05-07T20:11:13.5912758Z ################################################################################ 2025-05-07T20:11:13.5913065Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:13.5913176Z [CHECK] Listing out library size: 2025-05-07T20:11:13.5913452Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:13.5913456Z 2025-05-07T20:11:13.5913702Z 429 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:13.5913706Z 2025-05-07T20:11:13.5914101Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:13.5914603Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.5914607Z 2025-05-07T20:11:13.6305960Z GLIBC_2.2.5 2025-05-07T20:11:13.6306205Z GLIBC_2.14 2025-05-07T20:11:13.6306237Z 2025-05-07T20:11:13.6306410Z 2025-05-07T20:11:13.6307275Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:13.6307817Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.6307826Z 2025-05-07T20:11:13.6699932Z GLIBCXX_3.4 2025-05-07T20:11:13.6700193Z GLIBCXX_3.4.9 2025-05-07T20:11:13.6700327Z GLIBCXX_3.4.11 2025-05-07T20:11:13.6700413Z GLIBCXX_3.4.14 2025-05-07T20:11:13.6700566Z GLIBCXX_3.4.18 2025-05-07T20:11:13.6700651Z GLIBCXX_3.4.20 2025-05-07T20:11:13.6701540Z GLIBCXX_3.4.21 2025-05-07T20:11:13.6701604Z 2025-05-07T20:11:13.6701618Z 2025-05-07T20:11:13.6722115Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.gMuWSzIcbd.symbols.txt 2025-05-07T20:11:13.6722156Z 2025-05-07T20:11:13.7077620Z 2025-05-07T20:11:13.7116287Z [CHECK] Total Number of symbols: 5083 2025-05-07T20:11:13.7145984Z [CHECK] Number of fbgemm symbols: 3788 2025-05-07T20:11:13.7166329Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.9yF5Q3iS0s.usymbols.txt 2025-05-07T20:11:13.7166341Z 2025-05-07T20:11:13.7206955Z 2025-05-07T20:11:13.7232626Z [CHECK] Listing out undefined symbols (246 total): 2025-05-07T20:11:13.7253453Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.7254667Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.7254977Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.7255564Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.7255977Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:13.7256371Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.7256885Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:13.7257263Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:13.7257631Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:13.7258024Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:13.7258364Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:13.7258703Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:13.7259015Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.7259322Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:13.7259644Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:13.7259982Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:13.7260187Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:13.7260307Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:13.7260448Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:13.7260724Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:13.7260848Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.7261083Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:13.7261654Z U at::_ops::arange_start::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.7262253Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.7262941Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.7263121Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:13.7263631Z U at::_ops::scalar_tensor::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.7263823Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:13.7264146Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:13.7264811Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:13.7265303Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.7265433Z U at::detail::getCUDAHooks() 2025-05-07T20:11:13.7265579Z U at::detail::getHIPHooks() 2025-05-07T20:11:13.7265691Z U at::get_thread_num() 2025-05-07T20:11:13.7265810Z U at::globalContext() 2025-05-07T20:11:13.7265969Z U at::internal::set_thread_num(int) 2025-05-07T20:11:13.7266157Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:11:13.7266389Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.7266731Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.7266911Z U c10::ClassType::addMethod(torch::jit::Function*) 2025-05-07T20:11:13.7267269Z U c10::ClassType::getMethod(std::__cxx11::basic_string, std::allocator > const&) const 2025-05-07T20:11:13.7267485Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:13.7268107Z U c10::DictType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:13.7268467Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:13.7268614Z U c10::Error::what() const 2025-05-07T20:11:13.7268729Z U c10::GradMode::is_enabled() 2025-05-07T20:11:13.7268848Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:13.7269026Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.7269254Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.7269417Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:13.7269577Z U c10::IValue::is(c10::IValue const&) const 2025-05-07T20:11:13.7269702Z U c10::IValue::isTensorList() const 2025-05-07T20:11:13.7269850Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:13.7269985Z U c10::IntType::get() 2025-05-07T20:11:13.7270444Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:13.7270619Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:13.7270777Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:13.7270888Z U c10::NoneType::get() 2025-05-07T20:11:13.7271109Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:13.7271276Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:13.7271414Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:13.7271579Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:13.7271691Z U c10::StringType::get() 2025-05-07T20:11:13.7271862Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:13.7272014Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:13.7272409Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:13.7272583Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:13.7272703Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:13.7272859Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:13.7273291Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:11:13.7273396Z U c10::TensorType::get() 2025-05-07T20:11:13.7274129Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:11:13.7274279Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:13.7275020Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:13.7275280Z U c10::_fastEqualsForContainer(c10::IValue const&, c10::IValue const&) 2025-05-07T20:11:13.7275418Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:13.7275541Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:13.7275722Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:13.7275847Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:13.7276122Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:13.7276401Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:13.7276686Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:13.7276807Z U c10::cuda::device_count() 2025-05-07T20:11:13.7277072Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:13.7277248Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:13.7277456Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:13.7277611Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:13.7277807Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:13.7277935Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:13.7278380Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:13.7278930Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:13.7279969Z U c10::detail::infer_schema::make_function_schema(std::__cxx11::basic_string, std::allocator >&&, std::__cxx11::basic_string, std::allocator >&&, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:13.7280265Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:13.7280774Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.7281127Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:13.7281756Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.7281929Z U c10::getCustomClassTypeImpl(std::type_index const&) 2025-05-07T20:11:13.7282048Z U c10::get_default_dtype() 2025-05-07T20:11:13.7282363Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:11:13.7282579Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:11:13.7282812Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:13.7282953Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:13.7283081Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:13.7283439Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:13.7283626Z U c10::ivalue::Future::extractStorages(c10::IValue const&) 2025-05-07T20:11:13.7283808Z U c10::ivalue::Object::resizeObject(unsigned long) 2025-05-07T20:11:13.7284046Z U c10::ivalue::checkCustomClassType(c10::ClassType const*, c10::Type const*) 2025-05-07T20:11:13.7284217Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:13.7284396Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:13.7284562Z U c10::operator<<(std::ostream&, c10::FunctionSchema const&) 2025-05-07T20:11:13.7284700Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:13.7284916Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:13.7285038Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:13.7285201Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:13.7285338Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:13.7285463Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:13.7285620Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:13.7285739Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:13.7285860Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:13.7285987Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:13.7286165Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:13.7286289Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:13.7286408Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:13.7286550Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:13.7286680Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:13.7286799Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:13.7287539Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7288316Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7289127Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7289850Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7290635Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7291453Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7292123Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:13.7292891Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:13.7293686Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7294514Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:13.7295393Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7296150Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:13.7296964Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:13.7297831Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7298539Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:13.7299301Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:13.7300186Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7301398Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:13.7302337Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7303157Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:11:13.7304092Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:11:13.7305023Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7305904Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7306851Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7307793Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7308700Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7309624Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7310578Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:13.7310778Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.7310963Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.7311099Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.7311249Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.7311691Z U linearize_cache_indices_cuda(at::Tensor const&, at::Tensor const&, at::Tensor const&, std::optional const&, long, long) 2025-05-07T20:11:13.7311873Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:13.7312015Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.7312193Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.7312785Z U lru_cache_populate_byte_cuda(at::Tensor, at::Tensor, long, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long, bool, std::optional) 2025-05-07T20:11:13.7313210Z U lxu_cache_lookup_cuda(at::Tensor, at::Tensor, long, bool, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.7313334Z U memchr@GLIBC_2.2.5 2025-05-07T20:11:13.7313439Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.7313540Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.7313663Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.7313786Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:13.7313924Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.7314138Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:13.7314512Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:13.7315022Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.7315368Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:13.7316091Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream(std::__cxx11::basic_string, std::allocator > const&, std::_Ios_Openmode)@GLIBCXX_3.4.21 2025-05-07T20:11:13.7316498Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.7316887Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:13.7317065Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:11:13.7317215Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:11:13.7317379Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.7317505Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:13.7317643Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:13.7317807Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.7317949Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.7318127Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:13.7318284Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:13.7318522Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:13.7319130Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.7319335Z U std::condition_variable::condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:11:13.7319492Z U std::condition_variable::notify_all()@GLIBCXX_3.4.11 2025-05-07T20:11:13.7319679Z U std::condition_variable::~condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:11:13.7319838Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:11:13.7319968Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:13.7320100Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:13.7320249Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:13.7320373Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.7320572Z U std::istream& std::istream::_M_extract(long&)@GLIBCXX_3.4.9 2025-05-07T20:11:13.7320703Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:13.7321207Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:13.7321337Z U std::logic_error::~logic_error()@GLIBCXX_3.4 2025-05-07T20:11:13.7321534Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.7321762Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.7321894Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:13.7322075Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:13.7322209Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:13.7322431Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:11:13.7322638Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:13.7322770Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:13.7322876Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:13.7322976Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:13.7323067Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.7323184Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:13.7323843Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:13.7324285Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.7324551Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.7325725Z U torch::detail::class_base::class_base(std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator >, std::type_info const&, std::type_info const&) 2025-05-07T20:11:13.7326062Z U torch::detail::class_base::withNewArguments(c10::FunctionSchema const&, std::initializer_list) 2025-05-07T20:11:13.7326411Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:13.7326779Z U torch::registerCustomClassMethod(std::unique_ptr >) 2025-05-07T20:11:13.7326944Z U torch::serialize::InputArchive::InputArchive() 2025-05-07T20:11:13.7327248Z U torch::serialize::InputArchive::load_from(char const*, unsigned long, std::optional) 2025-05-07T20:11:13.7327670Z U torch::serialize::InputArchive::read(std::__cxx11::basic_string, std::allocator > const&, at::Tensor&, bool) 2025-05-07T20:11:13.7327962Z U torch::serialize::OutputArchive::OutputArchive(std::shared_ptr) 2025-05-07T20:11:13.7328135Z U torch::serialize::OutputArchive::save_to(std::ostream&) 2025-05-07T20:11:13.7328590Z U torch::serialize::OutputArchive::write(std::__cxx11::basic_string, std::allocator > const&, at::Tensor const&, bool) 2025-05-07T20:11:13.7328690Z U typeinfo for c10::Error 2025-05-07T20:11:13.7328819Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:13.7328946Z U typeinfo for std::logic_error@GLIBCXX_3.4 2025-05-07T20:11:13.7329067Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:13.7329204Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:13.7329382Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:11:13.7329581Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:13.7329740Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.7329892Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:13.7330042Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.7330153Z U vtable for c10::Error 2025-05-07T20:11:13.7330461Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.7330675Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:13.7330813Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:11:13.7330957Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.7331055Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.7331168Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.7331253Z w __gmon_start__ 2025-05-07T20:11:13.7331345Z w __pthread_key_create 2025-05-07T20:11:13.7331466Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:13.7331573Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:13.7331735Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.7331945Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:13.7331952Z 2025-05-07T20:11:13.7332048Z linux-vdso.so.1 (0x00007ffd862e2000) 2025-05-07T20:11:13.7332159Z libc10.so => not found 2025-05-07T20:11:13.7332249Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.7332348Z libc10_cuda.so => not found 2025-05-07T20:11:13.7332433Z libnccl.so.2 => not found 2025-05-07T20:11:13.7332517Z libcuda.so.1 => not found 2025-05-07T20:11:13.7332886Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007f6667800000) 2025-05-07T20:11:13.7333309Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f6666000000) 2025-05-07T20:11:13.7333713Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so (0x00007f6683067000) 2025-05-07T20:11:13.7333819Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.7333903Z libtorch.so => not found 2025-05-07T20:11:13.7333992Z libtorch_cpu.so => not found 2025-05-07T20:11:13.7334080Z libtorch_cuda.so => not found 2025-05-07T20:11:13.7334178Z libcudart.so.12 => not found 2025-05-07T20:11:13.7334352Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f6665d9c000) 2025-05-07T20:11:13.7334490Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f6683037000) 2025-05-07T20:11:13.7334615Z libc.so.6 => /lib64/libc.so.6 (0x00007f6665b94000) 2025-05-07T20:11:13.7334697Z libc10.so => not found 2025-05-07T20:11:13.7334784Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.7334871Z libc10_cuda.so => not found 2025-05-07T20:11:13.7334968Z libnccl.so.2 => not found 2025-05-07T20:11:13.7335053Z libcuda.so.1 => not found 2025-05-07T20:11:13.7335381Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007f6682fbe000) 2025-05-07T20:11:13.7335486Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.7335572Z libtorch.so => not found 2025-05-07T20:11:13.7335662Z libtorch_cpu.so => not found 2025-05-07T20:11:13.7335764Z libtorch_cuda.so => not found 2025-05-07T20:11:13.7335873Z libm.so.6 => /lib64/libm.so.6 (0x00007f6667725000) 2025-05-07T20:11:13.7335993Z /lib64/ld-linux-x86-64.so.2 (0x00007f6683078000) 2025-05-07T20:11:13.7336080Z libtorch.so => not found 2025-05-07T20:11:13.7336170Z libc10.so => not found 2025-05-07T20:11:13.7336258Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.7336344Z libc10_cuda.so => not found 2025-05-07T20:11:13.7336441Z libnccl.so.2 => not found 2025-05-07T20:11:13.7336524Z libcuda.so.1 => not found 2025-05-07T20:11:13.7336616Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.7336704Z libtorch_cpu.so => not found 2025-05-07T20:11:13.7336806Z libtorch_cuda.so => not found 2025-05-07T20:11:13.7336895Z libcudart.so.12 => not found 2025-05-07T20:11:13.7337032Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f6682f62000) 2025-05-07T20:11:13.7337129Z libtorch.so => not found 2025-05-07T20:11:13.7337209Z libc10.so => not found 2025-05-07T20:11:13.7337298Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.7337381Z libc10_cuda.so => not found 2025-05-07T20:11:13.7337475Z libnccl.so.2 => not found 2025-05-07T20:11:13.7337560Z libcuda.so.1 => not found 2025-05-07T20:11:13.7337649Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.7337745Z libtorch_cpu.so => not found 2025-05-07T20:11:13.7337838Z libtorch_cuda.so => not found 2025-05-07T20:11:13.7337998Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f6682f59000) 2025-05-07T20:11:13.7338085Z libtorch.so => not found 2025-05-07T20:11:13.7338176Z libc10.so => not found 2025-05-07T20:11:13.7338259Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.7338345Z libc10_cuda.so => not found 2025-05-07T20:11:13.7338444Z libnccl.so.2 => not found 2025-05-07T20:11:13.7338525Z libcuda.so.1 => not found 2025-05-07T20:11:13.7338617Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.7338705Z libtorch_cpu.so => not found 2025-05-07T20:11:13.7338830Z libtorch_cuda.so => not found 2025-05-07T20:11:13.7338954Z librt.so.1 => /lib64/librt.so.1 (0x00007f6682f50000) 2025-05-07T20:11:13.7338959Z 2025-05-07T20:11:13.7339058Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.7339317Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:11:13.7339322Z 2025-05-07T20:11:13.7351695Z 2025-05-07T20:11:13.7352307Z Dynamic section at offset 0x1ac7bfc8 contains 41 entries: 2025-05-07T20:11:13.7352769Z Tag Type Name/Value 2025-05-07T20:11:13.7353412Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.7354004Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:13.7354574Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:13.7355163Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:13.7355731Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:13.7356274Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:13.7356935Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:11:13.7359529Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:11:13.7359733Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:13.7359941Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.7360136Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.7360336Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.7360542Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:13.7360738Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.7360931Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.7361113Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.7361349Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_inference.so] 2025-05-07T20:11:13.7361530Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:13.7361643Z 0x000000000000000c (INIT) 0x1a0000 2025-05-07T20:11:13.7361763Z 0x000000000000000d (FINI) 0x74838c 2025-05-07T20:11:13.7361883Z 0x0000000000000019 (INIT_ARRAY) 0x1ac7aca0 2025-05-07T20:11:13.7362007Z 0x000000000000001b (INIT_ARRAYSZ) 392 (bytes) 2025-05-07T20:11:13.7362135Z 0x000000000000001a (FINI_ARRAY) 0x1ac7ae28 2025-05-07T20:11:13.7362251Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.7362361Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:11:13.7362470Z 0x0000000000000005 (STRTAB) 0x27a50 2025-05-07T20:11:13.7362591Z 0x0000000000000006 (SYMTAB) 0x9db0 2025-05-07T20:11:13.7362728Z 0x000000000000000a (STRSZ) 1387089 (bytes) 2025-05-07T20:11:13.7362844Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.7362974Z 0x0000000000000003 (PLTGOT) 0x1ac84fe8 2025-05-07T20:11:13.7363105Z 0x0000000000000002 (PLTRELSZ) 20568 (bytes) 2025-05-07T20:11:13.7363210Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.7363336Z 0x0000000000000017 (JMPREL) 0x19af18 2025-05-07T20:11:13.7363440Z 0x0000000000000007 (RELA) 0x17cd80 2025-05-07T20:11:13.7363568Z 0x0000000000000008 (RELASZ) 123288 (bytes) 2025-05-07T20:11:13.7363686Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.7363811Z 0x000000006ffffffe (VERNEED) 0x17cc60 2025-05-07T20:11:13.7363914Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:11:13.7364058Z 0x000000006ffffff0 (VERSYM) 0x17a4a2 2025-05-07T20:11:13.7364177Z 0x000000006ffffff9 (RELACOUNT) 539 2025-05-07T20:11:13.7364274Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.7364295Z 2025-05-07T20:11:13.7364409Z ################################################################################ 2025-05-07T20:11:13.7364440Z 2025-05-07T20:11:13.7364444Z 2025-05-07T20:11:13.7364566Z ################################################################################ 2025-05-07T20:11:13.7364931Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:13.7365033Z [CHECK] Listing out library size: 2025-05-07T20:11:13.7365378Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:13.7365382Z 2025-05-07T20:11:13.7375509Z 5 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:13.7375537Z 2025-05-07T20:11:13.7377284Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:13.7378304Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.7378371Z 2025-05-07T20:11:13.7638345Z GLIBC_2.2.5 2025-05-07T20:11:13.7638568Z GLIBC_2.3 2025-05-07T20:11:13.7639261Z GLIBC_2.14 2025-05-07T20:11:13.7639278Z 2025-05-07T20:11:13.7639302Z 2025-05-07T20:11:13.7639853Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:13.7640493Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.7640501Z 2025-05-07T20:11:13.7902303Z GLIBCXX_3.4 2025-05-07T20:11:13.7902438Z GLIBCXX_3.4.9 2025-05-07T20:11:13.7902564Z GLIBCXX_3.4.11 2025-05-07T20:11:13.7902663Z GLIBCXX_3.4.15 2025-05-07T20:11:13.7902760Z GLIBCXX_3.4.18 2025-05-07T20:11:13.7902850Z GLIBCXX_3.4.20 2025-05-07T20:11:13.7902964Z GLIBCXX_3.4.21 2025-05-07T20:11:13.7902991Z 2025-05-07T20:11:13.7902996Z 2025-05-07T20:11:13.7924290Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.Yc04yDW9vf.symbols.txt 2025-05-07T20:11:13.7924307Z 2025-05-07T20:11:13.8142563Z 2025-05-07T20:11:13.8169134Z [CHECK] Total Number of symbols: 2987 2025-05-07T20:11:13.8188290Z [CHECK] Number of fbgemm symbols: 1 2025-05-07T20:11:13.8206586Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.RzB9g1mGsv.usymbols.txt 2025-05-07T20:11:13.8206614Z 2025-05-07T20:11:13.8232881Z 2025-05-07T20:11:13.8258689Z [CHECK] Listing out undefined symbols (189 total): 2025-05-07T20:11:13.8272549Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.8273586Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.8274065Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:13.8274415Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:13.8274712Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:13.8275043Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:13.8275344Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:13.8275659Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:13.8276372Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:13.8276696Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:13.8276995Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:13.8277278Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:13.8277859Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:13.8278186Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:13.8278473Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:13.8278978Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:13.8279496Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:13.8279877Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:13.8280279Z U at::RecordFunction::end() 2025-05-07T20:11:13.8280657Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:13.8281085Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:13.8281790Z U at::_ops::_sparse_coo_tensor_unsafe::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.8282098Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:11:13.8282637Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.8283302Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:13.8297096Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:13.8297373Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:13.8297574Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:13.8297722Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:11:13.8297879Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:13.8298031Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:13.8298206Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:13.8298338Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:13.8298457Z U bcmp@GLIBC_2.2.5 2025-05-07T20:11:13.8298561Z U c10::AnyType::get() 2025-05-07T20:11:13.8298664Z U c10::BoolType::get() 2025-05-07T20:11:13.8298866Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:13.8298984Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:13.8299499Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:13.8300257Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:13.8300819Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:13.8300929Z U c10::Error::what() const 2025-05-07T20:11:13.8301058Z U c10::FloatType::get() 2025-05-07T20:11:13.8301171Z U c10::GradMode::is_enabled() 2025-05-07T20:11:13.8301303Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:13.8301489Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:13.8301611Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:13.8301730Z U c10::IValue::isBoolList() const 2025-05-07T20:11:13.8301964Z U c10::IValue::isIntList() const 2025-05-07T20:11:13.8302086Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:13.8302207Z U c10::IValue::isTensorList() const 2025-05-07T20:11:13.8302355Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:13.8302514Z U c10::IntType::get() 2025-05-07T20:11:13.8303029Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:13.8303229Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:13.8303357Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:13.8303491Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:13.8303621Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:13.8303872Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:13.8304162Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:13.8304272Z U c10::StringType::get() 2025-05-07T20:11:13.8304478Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:13.8304630Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:13.8304809Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:11:13.8304984Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:13.8305147Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:11:13.8305561Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:13.8305725Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:13.8305863Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:11:13.8306008Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:11:13.8306168Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:13.8306300Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:13.8306436Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:13.8306584Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:13.8306718Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:13.8306833Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:13.8306956Z U c10::SymIntType::get() 2025-05-07T20:11:13.8307085Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:13.8307193Z U c10::TensorType::get() 2025-05-07T20:11:13.8307320Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:13.8307769Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:13.8308288Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:13.8308566Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:13.8309067Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.8309412Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:13.8310040Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:13.8310369Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:13.8310591Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:13.8310735Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:13.8310894Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:13.8311294Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:13.8311555Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:13.8311724Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:13.8311875Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:11:13.8312038Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:13.8312233Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:13.8312356Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:13.8312653Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:13.8312751Z U free@GLIBC_2.2.5 2025-05-07T20:11:13.8312929Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:13.8313050Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:13.8313148Z U memcpy@GLIBC_2.14 2025-05-07T20:11:13.8313248Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:13.8313363Z U memset@GLIBC_2.2.5 2025-05-07T20:11:13.8313481Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:13.8313605Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:13.8313706Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:13.8313932Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:13.8314270Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:13.8314663Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.8315006Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:13.8315380Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:13.8315762Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:13.8315885Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:13.8316006Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:13.8316168Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.8316316Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:13.8316487Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:13.8316636Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:13.8316950Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:13.8317199Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:13.8317803Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.8317963Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:13.8318089Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:13.8318235Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:13.8318384Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:13.8318504Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:13.8318694Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.8318975Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:13.8319107Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:13.8319280Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:13.8319434Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:13.8319869Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:13.8320017Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:13.8320152Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:13.8320279Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:13.8320381Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:13.8320526Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:13.8321123Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:13.8321592Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.8321907Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:13.8322036Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:13.8322353Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:13.8322544Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:13.8322756Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:13.8322969Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:13.8323323Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:13.8323480Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:13.8323694Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:13.8323879Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:13.8324010Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:13.8324132Z U torch::autograd::Node::metadata() 2025-05-07T20:11:13.8324291Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:13.8324543Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:13.8324821Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:13.8324986Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:13.8325206Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:13.8325432Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:13.8328166Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:13.8328387Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:13.8328550Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:13.8328720Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:13.8329543Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:11:13.8329732Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:13.8330147Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:13.8330534Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:13.8330649Z U typeinfo for c10::Error 2025-05-07T20:11:13.8330795Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:13.8330942Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:13.8331078Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:13.8331215Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:13.8331355Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:13.8331514Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:13.8331680Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:13.8331860Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:13.8332020Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:13.8332127Z U vtable for c10::Error 2025-05-07T20:11:13.8332479Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:13.8332616Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:13.8332847Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:13.8332985Z U vtable for torch::autograd::Node 2025-05-07T20:11:13.8333166Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:13.8333287Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:13.8333414Z w _ITM_registerTMCloneTable 2025-05-07T20:11:13.8333521Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:13.8333620Z w __gmon_start__ 2025-05-07T20:11:13.8333721Z w __pthread_key_create 2025-05-07T20:11:13.8333853Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:13.8333970Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:13.8334122Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:13.8334431Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:13.8334439Z 2025-05-07T20:11:13.8334618Z linux-vdso.so.1 (0x00007ffe6c9f9000) 2025-05-07T20:11:13.8334717Z libc10.so => not found 2025-05-07T20:11:13.8334839Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.8334935Z libc10_cuda.so => not found 2025-05-07T20:11:13.8335060Z libnccl.so.2 => not found 2025-05-07T20:11:13.8335175Z libcuda.so.1 => not found 2025-05-07T20:11:13.8335634Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so (0x00007f9ea7304000) 2025-05-07T20:11:13.8336128Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f9ea5c00000) 2025-05-07T20:11:13.8336254Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.8336354Z libtorch.so => not found 2025-05-07T20:11:13.8336454Z libtorch_cpu.so => not found 2025-05-07T20:11:13.8336554Z libtorch_cuda.so => not found 2025-05-07T20:11:13.8336739Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f9ea599c000) 2025-05-07T20:11:13.8336897Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f9ea72d4000) 2025-05-07T20:11:13.8337028Z libc.so.6 => /lib64/libc.so.6 (0x00007f9ea5794000) 2025-05-07T20:11:13.8337173Z /lib64/ld-linux-x86-64.so.2 (0x00007f9ea7315000) 2025-05-07T20:11:13.8337297Z libtorch.so => not found 2025-05-07T20:11:13.8337391Z libc10.so => not found 2025-05-07T20:11:13.8337491Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.8337603Z libc10_cuda.so => not found 2025-05-07T20:11:13.8337704Z libnccl.so.2 => not found 2025-05-07T20:11:13.8337799Z libcuda.so.1 => not found 2025-05-07T20:11:13.8337914Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.8338014Z libtorch_cpu.so => not found 2025-05-07T20:11:13.8338115Z libtorch_cuda.so => not found 2025-05-07T20:11:13.8338270Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f9ea6daa000) 2025-05-07T20:11:13.8338466Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f9ea72cb000) 2025-05-07T20:11:13.8338566Z libtorch.so => not found 2025-05-07T20:11:13.8338658Z libc10.so => not found 2025-05-07T20:11:13.8338775Z libnvrtc.so.12 => not found 2025-05-07T20:11:13.8338873Z libc10_cuda.so => not found 2025-05-07T20:11:13.8338972Z libnccl.so.2 => not found 2025-05-07T20:11:13.8339069Z libcuda.so.1 => not found 2025-05-07T20:11:13.8339192Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:13.8339291Z libtorch_cpu.so => not found 2025-05-07T20:11:13.8339396Z libtorch_cuda.so => not found 2025-05-07T20:11:13.8339511Z libcudart.so.12 => not found 2025-05-07T20:11:13.8339644Z libm.so.6 => /lib64/libm.so.6 (0x00007f9ea56b9000) 2025-05-07T20:11:13.8339662Z 2025-05-07T20:11:13.8339779Z [CHECK] Displaying ELF information: 2025-05-07T20:11:13.8340210Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:11:13.8340217Z 2025-05-07T20:11:13.8374547Z 2025-05-07T20:11:13.8374928Z Dynamic section at offset 0x4b5fc8 contains 40 entries: 2025-05-07T20:11:13.8375070Z Tag Type Name/Value 2025-05-07T20:11:13.8375323Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:13.8375751Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:13.8376466Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:13.8376708Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:13.8376953Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:13.8377178Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:11:13.8377420Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:13.8377635Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:13.8377833Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:13.8378059Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:13.8378416Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:13.8378626Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:13.8378888Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:13.8379083Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:13.8379303Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:13.8379654Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_split_host.so] 2025-05-07T20:11:13.8379859Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:13.8379979Z 0x000000000000000c (INIT) 0xd6000 2025-05-07T20:11:13.8380204Z 0x000000000000000d (FINI) 0x3f64b8 2025-05-07T20:11:13.8380346Z 0x0000000000000019 (INIT_ARRAY) 0x4add80 2025-05-07T20:11:13.8380485Z 0x000000000000001b (INIT_ARRAYSZ) 304 (bytes) 2025-05-07T20:11:13.8380608Z 0x000000000000001a (FINI_ARRAY) 0x4adeb0 2025-05-07T20:11:13.8380756Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:13.8380936Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:11:13.8381097Z 0x0000000000000005 (STRTAB) 0x16e00 2025-05-07T20:11:13.8381216Z 0x0000000000000006 (SYMTAB) 0x55e0 2025-05-07T20:11:13.8381372Z 0x000000000000000a (STRSZ) 609767 (bytes) 2025-05-07T20:11:13.8381498Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:13.8381623Z 0x0000000000000003 (PLTGOT) 0x4b8fe8 2025-05-07T20:11:13.8381783Z 0x0000000000000002 (PLTRELSZ) 31704 (bytes) 2025-05-07T20:11:13.8381896Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:13.8382011Z 0x0000000000000017 (JMPREL) 0xcdaf0 2025-05-07T20:11:13.8382140Z 0x0000000000000007 (RELA) 0xad450 2025-05-07T20:11:13.8382281Z 0x0000000000000008 (RELASZ) 132768 (bytes) 2025-05-07T20:11:13.8382404Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:13.8382527Z 0x000000006ffffffe (VERNEED) 0xad340 2025-05-07T20:11:13.8382658Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:11:13.8382781Z 0x000000006ffffff0 (VERSYM) 0xabbe8 2025-05-07T20:11:13.8382893Z 0x000000006ffffff9 (RELACOUNT) 40 2025-05-07T20:11:13.8383011Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:13.8383018Z 2025-05-07T20:11:13.8383142Z ################################################################################ 2025-05-07T20:11:13.8383147Z 2025-05-07T20:11:13.8383152Z 2025-05-07T20:11:13.8383268Z ################################################################################ 2025-05-07T20:11:13.8383596Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:13.8383704Z [CHECK] Listing out library size: 2025-05-07T20:11:13.8384005Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:13.8384010Z 2025-05-07T20:11:13.8391624Z 339 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:13.8391668Z 2025-05-07T20:11:13.8392911Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:13.8394452Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.8394469Z 2025-05-07T20:11:13.9348776Z GLIBC_2.2.5 2025-05-07T20:11:13.9349726Z GLIBC_2.3 2025-05-07T20:11:13.9349999Z GLIBC_2.14 2025-05-07T20:11:13.9350167Z 2025-05-07T20:11:13.9350171Z 2025-05-07T20:11:13.9350635Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:13.9351983Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:13.9352645Z 2025-05-07T20:11:14.0301708Z GLIBCXX_3.4 2025-05-07T20:11:14.0302732Z GLIBCXX_3.4.9 2025-05-07T20:11:14.0303364Z GLIBCXX_3.4.20 2025-05-07T20:11:14.0303944Z GLIBCXX_3.4.21 2025-05-07T20:11:14.0304315Z 2025-05-07T20:11:14.0304329Z 2025-05-07T20:11:14.0322155Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.HnTM53GRNx.symbols.txt 2025-05-07T20:11:14.0322720Z 2025-05-07T20:11:14.1249192Z 2025-05-07T20:11:14.1291660Z [CHECK] Total Number of symbols: 12626 2025-05-07T20:11:14.1340226Z [CHECK] Number of fbgemm symbols: 5267 2025-05-07T20:11:14.1357428Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.HEz18rylQB.usymbols.txt 2025-05-07T20:11:14.1357994Z 2025-05-07T20:11:14.1409740Z 2025-05-07T20:11:14.1432188Z [CHECK] Listing out undefined symbols (171 total): 2025-05-07T20:11:14.1449612Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.1451483Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:14.1452848Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.1454012Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.1455107Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.1456215Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:14.1457283Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:14.1458320Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:14.1459343Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.1459698Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:14.1460027Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:14.1460659Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:14.1461019Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:14.1461353Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:14.1461705Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:14.1462026Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:14.1462368Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:14.1462691Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:14.1463016Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:14.1463347Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:14.1463668Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:14.1464070Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:14.1464490Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:14.1465052Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:14.1465782Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:11:14.1466421Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:11:14.1467141Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:11:14.1468110Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.1469022Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:14.1469498Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:14.1470059Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:14.1470513Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:14.1470961Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.1471480Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.1471890Z U c10::BoolType::get() 2025-05-07T20:11:14.1472410Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:14.1472861Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:14.1473273Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:14.1473971Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:14.1475164Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:14.1476611Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:14.1477254Z U c10::Error::what() const 2025-05-07T20:11:14.1477587Z U c10::FloatType::get() 2025-05-07T20:11:14.1477949Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.1478409Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.1478845Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:14.1479219Z U c10::IntType::get() 2025-05-07T20:11:14.1479608Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:14.1480019Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:14.1480396Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:14.1480760Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:14.1481165Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:14.1481594Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:14.1482002Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:14.1482776Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:14.1483382Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:14.1483745Z U c10::SymInt::operator+=(c10::SymInt const&) 2025-05-07T20:11:14.1484114Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:14.1484455Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:14.1484805Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:14.1485152Z U c10::SymInt::sym_ge(c10::SymInt const&) const 2025-05-07T20:11:14.1485508Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:14.1485856Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:14.1486212Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:11:14.1486544Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:14.1486844Z U c10::SymIntType::get() 2025-05-07T20:11:14.1487145Z U c10::TensorType::get() 2025-05-07T20:11:14.1487445Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:14.1488393Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:14.1489295Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:14.1489635Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:14.1489978Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:14.1490327Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:14.1490660Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:14.1490976Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:14.1491463Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:14.1491913Z U c10::cuda::device_count() 2025-05-07T20:11:14.1492235Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:14.1492604Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:14.1492971Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:14.1493356Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:14.1493746Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:14.1494145Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:14.1494850Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:14.1495663Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:14.1496481Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.1497370Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:14.1498334Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.1499290Z U c10::get_default_dtype() 2025-05-07T20:11:14.1499623Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:14.1499949Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:14.1500782Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:14.1501477Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:14.1501945Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:14.1502309Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:14.1502693Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:14.1503102Z U c10::operator%(c10::SymInt const&, int) 2025-05-07T20:11:14.1503457Z U c10::operator*(c10::SymInt const&, long) 2025-05-07T20:11:14.1503825Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:14.1504191Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:11:14.1504569Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:14.1504973Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:14.1505392Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:14.1505821Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:14.1506182Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:14.1506606Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:14.1507050Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:14.1507415Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:14.1507837Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:14.1508188Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:14.1508655Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:14.1509007Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:14.1509343Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:14.1509683Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:14.1510027Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:14.1510355Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:14.1510669Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:14.1511015Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:14.1511349Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:14.1511848Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:14.1512343Z U float at::Tensor::item() const 2025-05-07T20:11:14.1512685Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.1513118Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.1513458Z U free@GLIBC_2.2.5 2025-05-07T20:11:14.1513748Z U int at::Tensor::item() const 2025-05-07T20:11:14.1514071Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.1514437Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.1514851Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:14.1515241Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.1515620Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.1515955Z U memcpy@GLIBC_2.14 2025-05-07T20:11:14.1516236Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:14.1516508Z U memset@GLIBC_2.2.5 2025-05-07T20:11:14.1516812Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:14.1517157Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:14.1517694Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:14.1518484Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:14.1519055Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:14.1519409Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.1519792Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.1520184Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:14.1520690Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:14.1521572Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.1522328Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:14.1522677Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:14.1523001Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:14.1523341Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.1523666Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:14.1524060Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.1524612Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.1525059Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:14.1525400Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:14.1525692Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:14.1526035Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:14.1526839Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:14.1527915Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.1528691Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.1529380Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:14.1529919Z U typeinfo for c10::Error 2025-05-07T20:11:14.1530269Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:14.1530691Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:14.1531103Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:14.1531460Z U vtable for c10::Error 2025-05-07T20:11:14.1531959Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.1532593Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:14.1533074Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:14.1533459Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:14.1533782Z w _ITM_registerTMCloneTable 2025-05-07T20:11:14.1534088Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:14.1534390Z w __gmon_start__ 2025-05-07T20:11:14.1534643Z w __pthread_key_create 2025-05-07T20:11:14.1534978Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:14.1535426Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:14.1535768Z 2025-05-07T20:11:14.1535904Z linux-vdso.so.1 (0x00007ffe251f0000) 2025-05-07T20:11:14.1536194Z libc10.so => not found 2025-05-07T20:11:14.1536426Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.1536696Z libc10_cuda.so => not found 2025-05-07T20:11:14.1536943Z libnccl.so.2 => not found 2025-05-07T20:11:14.1537206Z libcuda.so.1 => not found 2025-05-07T20:11:14.1537801Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f8fe2600000) 2025-05-07T20:11:14.1538445Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.1538702Z libtorch.so => not found 2025-05-07T20:11:14.1538960Z libtorch_cpu.so => not found 2025-05-07T20:11:14.1539212Z libtorch_cuda.so => not found 2025-05-07T20:11:14.1539472Z libcudart.so.12 => not found 2025-05-07T20:11:14.1539797Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f8fe239c000) 2025-05-07T20:11:14.1540295Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f8ff858b000) 2025-05-07T20:11:14.1540870Z libc.so.6 => /lib64/libc.so.6 (0x00007f8fe2194000) 2025-05-07T20:11:14.1541311Z /lib64/ld-linux-x86-64.so.2 (0x00007f8ff85c1000) 2025-05-07T20:11:14.1541655Z libc10.so => not found 2025-05-07T20:11:14.1541902Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.1542184Z libc10_cuda.so => not found 2025-05-07T20:11:14.1542446Z libnccl.so.2 => not found 2025-05-07T20:11:14.1542726Z libcuda.so.1 => not found 2025-05-07T20:11:14.1543262Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007f8fe1c00000) 2025-05-07T20:11:14.1544223Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so (0x00007f8ff857e000) 2025-05-07T20:11:14.1544892Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.1545165Z libtorch.so => not found 2025-05-07T20:11:14.1545501Z libtorch_cpu.so => not found 2025-05-07T20:11:14.1545770Z libtorch_cuda.so => not found 2025-05-07T20:11:14.1546067Z libcudart.so.12 => not found 2025-05-07T20:11:14.1546406Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f8ff8526000) 2025-05-07T20:11:14.1546938Z libm.so.6 => /lib64/libm.so.6 (0x00007f8fe2925000) 2025-05-07T20:11:14.1547255Z libc10.so => not found 2025-05-07T20:11:14.1547484Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.1547745Z libc10_cuda.so => not found 2025-05-07T20:11:14.1547987Z libnccl.so.2 => not found 2025-05-07T20:11:14.1548238Z libcuda.so.1 => not found 2025-05-07T20:11:14.1548719Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007f8fe1b89000) 2025-05-07T20:11:14.1549264Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.1549513Z libtorch.so => not found 2025-05-07T20:11:14.1549764Z libtorch_cpu.so => not found 2025-05-07T20:11:14.1550025Z libtorch_cuda.so => not found 2025-05-07T20:11:14.1550301Z libtorch.so => not found 2025-05-07T20:11:14.1550531Z libc10.so => not found 2025-05-07T20:11:14.1550759Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.1551008Z libc10_cuda.so => not found 2025-05-07T20:11:14.1551251Z libnccl.so.2 => not found 2025-05-07T20:11:14.1551478Z libcuda.so.1 => not found 2025-05-07T20:11:14.1551722Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.1551966Z libtorch_cpu.so => not found 2025-05-07T20:11:14.1552226Z libtorch_cuda.so => not found 2025-05-07T20:11:14.1552543Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f8ff8519000) 2025-05-07T20:11:14.1552906Z libtorch.so => not found 2025-05-07T20:11:14.1553128Z libc10.so => not found 2025-05-07T20:11:14.1553364Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.1553610Z libc10_cuda.so => not found 2025-05-07T20:11:14.1553841Z libnccl.so.2 => not found 2025-05-07T20:11:14.1554083Z libcuda.so.1 => not found 2025-05-07T20:11:14.1554316Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.1554573Z libtorch_cpu.so => not found 2025-05-07T20:11:14.1554810Z libtorch_cuda.so => not found 2025-05-07T20:11:14.1555098Z librt.so.1 => /lib64/librt.so.1 (0x00007f8ff8510000) 2025-05-07T20:11:14.1555322Z 2025-05-07T20:11:14.1555423Z [CHECK] Displaying ELF information: 2025-05-07T20:11:14.1555848Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:11:14.1556191Z 2025-05-07T20:11:14.1559980Z 2025-05-07T20:11:14.1560477Z Dynamic section at offset 0x15292018 contains 40 entries: 2025-05-07T20:11:14.1561570Z Tag Type Name/Value 2025-05-07T20:11:14.1562768Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:14.1564235Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:14.1565738Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:14.1566820Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:14.1567322Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:14.1567847Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:14.1568384Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:14.1568906Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:14.1569403Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:14.1569921Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:14.1570438Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:14.1570998Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:14.1571531Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:14.1572021Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:14.1572589Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:14.1573276Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_forward.so] 2025-05-07T20:11:14.1574030Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:14.1574437Z 0x000000000000000c (INIT) 0x453000 2025-05-07T20:11:14.1574772Z 0x000000000000000d (FINI) 0x1fe941c 2025-05-07T20:11:14.1575117Z 0x0000000000000019 (INIT_ARRAY) 0x152889a8 2025-05-07T20:11:14.1575465Z 0x000000000000001b (INIT_ARRAYSZ) 752 (bytes) 2025-05-07T20:11:14.1575824Z 0x000000000000001a (FINI_ARRAY) 0x15288c98 2025-05-07T20:11:14.1576528Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:14.1576867Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:11:14.1577212Z 0x0000000000000005 (STRTAB) 0x624b8 2025-05-07T20:11:14.1577537Z 0x0000000000000006 (SYMTAB) 0x184f0 2025-05-07T20:11:14.1577999Z 0x000000000000000a (STRSZ) 3694099 (bytes) 2025-05-07T20:11:14.1578364Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:14.1578720Z 0x0000000000000003 (PLTGOT) 0x152a8fe8 2025-05-07T20:11:14.1579093Z 0x0000000000000002 (PLTRELSZ) 14520 (bytes) 2025-05-07T20:11:14.1579436Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:14.1579769Z 0x0000000000000017 (JMPREL) 0x44ece0 2025-05-07T20:11:14.1580194Z 0x0000000000000007 (RELA) 0x3ee668 2025-05-07T20:11:14.1580561Z 0x0000000000000008 (RELASZ) 394872 (bytes) 2025-05-07T20:11:14.1580916Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:14.1581268Z 0x000000006ffffffe (VERNEED) 0x3ee578 2025-05-07T20:11:14.1581601Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:14.1581925Z 0x000000006ffffff0 (VERSYM) 0x3e82cc 2025-05-07T20:11:14.1582262Z 0x000000006ffffff9 (RELACOUNT) 1976 2025-05-07T20:11:14.1582575Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:14.1582774Z 2025-05-07T20:11:14.1582895Z ################################################################################ 2025-05-07T20:11:14.1583122Z 2025-05-07T20:11:14.1583128Z 2025-05-07T20:11:14.1583244Z ################################################################################ 2025-05-07T20:11:14.1583775Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:14.1584301Z [CHECK] Listing out library size: 2025-05-07T20:11:14.1584785Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:14.1585185Z 2025-05-07T20:11:14.1585435Z 1 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:14.1585771Z 2025-05-07T20:11:14.1586193Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:14.1587249Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.1587871Z 2025-05-07T20:11:14.1636738Z GLIBC_2.2.5 2025-05-07T20:11:14.1637099Z GLIBC_2.3 2025-05-07T20:11:14.1637316Z GLIBC_2.14 2025-05-07T20:11:14.1637430Z 2025-05-07T20:11:14.1637444Z 2025-05-07T20:11:14.1638078Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:14.1639518Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.1640159Z 2025-05-07T20:11:14.1697396Z GLIBCXX_3.4 2025-05-07T20:11:14.1698051Z GLIBCXX_3.4.9 2025-05-07T20:11:14.1698505Z GLIBCXX_3.4.18 2025-05-07T20:11:14.1698736Z GLIBCXX_3.4.20 2025-05-07T20:11:14.1699162Z GLIBCXX_3.4.21 2025-05-07T20:11:14.1699287Z 2025-05-07T20:11:14.1699291Z 2025-05-07T20:11:14.1717642Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.Nnmcdz8yeR.symbols.txt 2025-05-07T20:11:14.1718200Z 2025-05-07T20:11:14.1742409Z 2025-05-07T20:11:14.1768626Z [CHECK] Total Number of symbols: 357 2025-05-07T20:11:14.1777764Z [CHECK] Number of fbgemm symbols: 57 2025-05-07T20:11:14.1797932Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.TNsRaxKTsz.usymbols.txt 2025-05-07T20:11:14.1798498Z 2025-05-07T20:11:14.1813785Z 2025-05-07T20:11:14.1838305Z [CHECK] Listing out undefined symbols (118 total): 2025-05-07T20:11:14.1860272Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.1861146Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.1861897Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:14.1862240Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.1862635Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.1863007Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.1863388Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:14.1863747Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:14.1864104Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:14.1864474Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.1864807Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:14.1865126Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:14.1865439Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:14.1865759Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:14.1866090Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:14.1866441Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:14.1866761Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:14.1867097Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:14.1867455Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:14.1867779Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:14.1868605Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.1869949Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.1870897Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:14.1871347Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:14.1871707Z U c10::IntType::get() 2025-05-07T20:11:14.1872099Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:14.1872512Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:14.1873131Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.1873966Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:14.1874573Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:14.1875005Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:14.1875317Z U c10::TensorType::get() 2025-05-07T20:11:14.1875638Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:14.1878305Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:14.1879327Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:14.1879713Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:14.1880076Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:14.1880422Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:14.1880783Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:14.1881129Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:14.1881621Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:14.1882093Z U c10::cuda::device_count() 2025-05-07T20:11:14.1882459Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:14.1882901Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:14.1883292Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:14.1883710Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:14.1884121Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:14.1884529Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:14.1885285Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:14.1886177Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:14.1887064Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.1888201Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:14.1889488Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.1890239Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:14.1890571Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:14.1890891Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:14.1891258Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:14.1891635Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:14.1891993Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:14.1892402Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:14.1892817Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:14.1893183Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:14.1893523Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:14.1893869Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:14.1894191Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:14.1894519Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:14.1894862Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:14.1895206Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:14.1895563Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:14.1895926Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:14.1896262Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:14.1896575Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:14.1896956Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:14.1897308Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:14.1897659Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.1898106Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:14.1898518Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.1898870Z U memcpy@GLIBC_2.14 2025-05-07T20:11:14.1899145Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:14.1899432Z U memset@GLIBC_2.2.5 2025-05-07T20:11:14.1899719Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:14.1900147Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:14.1900903Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:14.1901750Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:14.1902630Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:14.1903462Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:14.1904066Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:14.1904424Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:14.1904789Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.1905204Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.1905642Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:14.1906171Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:14.1907127Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.1907955Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:14.1908312Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:14.1908685Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.1909035Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:14.1909460Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.1910005Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.1910507Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:14.1910871Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:14.1911189Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:14.1911519Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:14.1912340Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:14.1913521Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.1914366Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.1915213Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:14.1915949Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.1916514Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:14.1916966Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:14.1917454Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:14.1918189Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.1918864Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:14.1919340Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:14.1919668Z w _ITM_registerTMCloneTable 2025-05-07T20:11:14.1920014Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:14.1920320Z w __gmon_start__ 2025-05-07T20:11:14.1920625Z w __pthread_key_create 2025-05-07T20:11:14.1920971Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:14.1921513Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:14.1921856Z 2025-05-07T20:11:14.1922016Z linux-vdso.so.1 (0x00007ffef8758000) 2025-05-07T20:11:14.1922322Z libtorch.so => not found 2025-05-07T20:11:14.1922617Z libc10.so => not found 2025-05-07T20:11:14.1922877Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.1923182Z libc10_cuda.so => not found 2025-05-07T20:11:14.1923449Z libnccl.so.2 => not found 2025-05-07T20:11:14.1923737Z libcuda.so.1 => not found 2025-05-07T20:11:14.1924005Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.1924316Z libtorch_cpu.so => not found 2025-05-07T20:11:14.1924628Z libtorch_cuda.so => not found 2025-05-07T20:11:14.1924904Z libcudart.so.12 => not found 2025-05-07T20:11:14.1925263Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f792f8bb000) 2025-05-07T20:11:14.1925670Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f792f865000) 2025-05-07T20:11:14.1926090Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f792f837000) 2025-05-07T20:11:14.1926462Z libc.so.6 => /lib64/libc.so.6 (0x00007f792f62f000) 2025-05-07T20:11:14.1926842Z /lib64/ld-linux-x86-64.so.2 (0x00007f792fb9a000) 2025-05-07T20:11:14.1927190Z libm.so.6 => /lib64/libm.so.6 (0x00007f792f554000) 2025-05-07T20:11:14.1927435Z 2025-05-07T20:11:14.1927546Z [CHECK] Displaying ELF information: 2025-05-07T20:11:14.1928016Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:11:14.1928378Z 2025-05-07T20:11:14.1936325Z 2025-05-07T20:11:14.1936526Z Dynamic section at offset 0x71b10 contains 39 entries: 2025-05-07T20:11:14.1936972Z Tag Type Name/Value 2025-05-07T20:11:14.1937416Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:14.1937979Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:14.1938521Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:14.1939067Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:14.1939602Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:14.1940232Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:14.1940758Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:14.1941300Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:14.1941821Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:14.1942358Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:14.1942935Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:14.1943466Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:11:14.1943991Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:14.1944526Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:14.1945064Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:14.1945652Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:11:14.1946188Z 0x000000000000000c (INIT) 0x10000 2025-05-07T20:11:14.1946527Z 0x000000000000000d (FINI) 0x316ac 2025-05-07T20:11:14.1946876Z 0x0000000000000019 (INIT_ARRAY) 0x71130 2025-05-07T20:11:14.1947238Z 0x000000000000001b (INIT_ARRAYSZ) 40 (bytes) 2025-05-07T20:11:14.1947587Z 0x000000000000001a (FINI_ARRAY) 0x71158 2025-05-07T20:11:14.1947951Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:14.1948302Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:11:14.1948653Z 0x0000000000000005 (STRTAB) 0x2ba8 2025-05-07T20:11:14.1948980Z 0x0000000000000006 (SYMTAB) 0xa18 2025-05-07T20:11:14.1949374Z 0x000000000000000a (STRSZ) 36157 (bytes) 2025-05-07T20:11:14.1949734Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:14.1950090Z 0x0000000000000003 (PLTGOT) 0x71fe8 2025-05-07T20:11:14.1950466Z 0x0000000000000002 (PLTRELSZ) 5520 (bytes) 2025-05-07T20:11:14.1950816Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:14.1951155Z 0x0000000000000017 (JMPREL) 0xdfa8 2025-05-07T20:11:14.1951486Z 0x0000000000000007 (RELA) 0xbcc8 2025-05-07T20:11:14.1951847Z 0x0000000000000008 (RELASZ) 8928 (bytes) 2025-05-07T20:11:14.1952204Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:14.1952569Z 0x000000006ffffffe (VERNEED) 0xbbb8 2025-05-07T20:11:14.1952902Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:14.1953243Z 0x000000006ffffff0 (VERSYM) 0xb8e6 2025-05-07T20:11:14.1953588Z 0x000000006ffffff9 (RELACOUNT) 162 2025-05-07T20:11:14.1953903Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:14.1954106Z 2025-05-07T20:11:14.1954239Z ################################################################################ 2025-05-07T20:11:14.1954469Z 2025-05-07T20:11:14.1954474Z 2025-05-07T20:11:14.1954594Z ################################################################################ 2025-05-07T20:11:14.1955118Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:14.1955627Z [CHECK] Listing out library size: 2025-05-07T20:11:14.1956083Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:14.1956463Z 2025-05-07T20:11:14.1956701Z 35 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:14.1957017Z 2025-05-07T20:11:14.1957416Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:14.1958421Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.1958430Z 2025-05-07T20:11:14.2066031Z GLIBC_2.2.5 2025-05-07T20:11:14.2066970Z GLIBC_2.3 2025-05-07T20:11:14.2067218Z GLIBC_2.14 2025-05-07T20:11:14.2067236Z 2025-05-07T20:11:14.2067241Z 2025-05-07T20:11:14.2067700Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:14.2069429Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.2069439Z 2025-05-07T20:11:14.2181584Z GLIBCXX_3.4 2025-05-07T20:11:14.2181871Z GLIBCXX_3.4.9 2025-05-07T20:11:14.2181984Z GLIBCXX_3.4.11 2025-05-07T20:11:14.2182072Z GLIBCXX_3.4.15 2025-05-07T20:11:14.2182157Z GLIBCXX_3.4.18 2025-05-07T20:11:14.2182250Z GLIBCXX_3.4.20 2025-05-07T20:11:14.2182392Z GLIBCXX_3.4.21 2025-05-07T20:11:14.2182408Z 2025-05-07T20:11:14.2182638Z 2025-05-07T20:11:14.2203659Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.ySDupUgAf2.symbols.txt 2025-05-07T20:11:14.2203692Z 2025-05-07T20:11:14.2288068Z 2025-05-07T20:11:14.2311351Z [CHECK] Total Number of symbols: 1545 2025-05-07T20:11:14.2328059Z [CHECK] Number of fbgemm symbols: 211 2025-05-07T20:11:14.2342910Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.Q67JoGRoGm.usymbols.txt 2025-05-07T20:11:14.2342933Z 2025-05-07T20:11:14.2366351Z 2025-05-07T20:11:14.2389163Z [CHECK] Listing out undefined symbols (266 total): 2025-05-07T20:11:14.2405073Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.2405614Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.2406236Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:14.2406518Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.2406669Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.2406818Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.2406966Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:14.2407097Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:14.2407221Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:14.2407375Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.2407500Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:14.2407614Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:14.2407731Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:14.2407861Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:14.2407976Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:14.2408086Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:14.2408210Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:14.2408317Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:14.2408430Z U __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:11:14.2408526Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:14.2408661Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:14.2408763Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:14.2408871Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:14.2409000Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:14.2409119Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:14.2409264Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:14.2409460Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:14.2409593Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:14.2409720Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:14.2409891Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:14.2410087Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:14.2410199Z U at::TensorMaker::make_tensor() 2025-05-07T20:11:14.2410332Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:11:14.2410512Z U at::_ops::concat::call(c10::ArrayRef, long) 2025-05-07T20:11:14.2410673Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:14.2411459Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.2412110Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.2412313Z U at::_ops::eq_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.2412500Z U at::_ops::eq_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:14.2412681Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:14.2412834Z U at::_ops::gt_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.2413143Z U at::_ops::index_add::call(at::Tensor const&, long, at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.2413340Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:11:14.2413456Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:11:14.2413671Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:14.2413867Z U at::_ops::narrow::call(at::Tensor const&, long, c10::SymInt, c10::SymInt) 2025-05-07T20:11:14.2414028Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:14.2414272Z U at::_ops::split_with_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:11:14.2414573Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:14.2415162Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:14.2415350Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:14.2415514Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:14.2415973Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.2416518Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.2416651Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:14.2416778Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:14.2416924Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:14.2417027Z U at::globalContext() 2025-05-07T20:11:14.2417173Z U at::has_internal_overlap(at::TensorBase const&) 2025-05-07T20:11:14.2417299Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:14.2417393Z U bcmp@GLIBC_2.2.5 2025-05-07T20:11:14.2417503Z U bool at::Tensor::item() const 2025-05-07T20:11:14.2417614Z U c10::AnyType::get() 2025-05-07T20:11:14.2417776Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:11:14.2417963Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.2418076Z U c10::BoolType::get() 2025-05-07T20:11:14.2418235Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:14.2418403Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:14.2418616Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:14.2419101Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:14.2419711Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:14.2420223Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:14.2420336Z U c10::Error::what() const 2025-05-07T20:11:14.2420438Z U c10::GradMode::is_enabled() 2025-05-07T20:11:14.2420747Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:14.2420932Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.2421090Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:14.2421228Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:14.2421352Z U c10::IValue::isBoolList() const 2025-05-07T20:11:14.2421526Z U c10::IValue::isIntList() const 2025-05-07T20:11:14.2421655Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:14.2421769Z U c10::IValue::isTensorList() const 2025-05-07T20:11:14.2421914Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:14.2422055Z U c10::IntType::get() 2025-05-07T20:11:14.2422536Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.2422709Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:14.2422861Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:14.2422990Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:14.2423119Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:14.2423430Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:14.2423594Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:14.2423705Z U c10::StringType::get() 2025-05-07T20:11:14.2423890Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:14.2424292Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:14.2424428Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:14.2424568Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:14.2424678Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:14.2424781Z U c10::SymIntType::get() 2025-05-07T20:11:14.2424938Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:14.2425090Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:14.2425536Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:11:14.2425690Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:14.2425818Z U c10::TensorType::get() 2025-05-07T20:11:14.2426014Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:11:14.2426126Z U c10::Type::is_module() const 2025-05-07T20:11:14.2426277Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:14.2427139Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:14.2427267Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:14.2428636Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:14.2428758Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:14.2428875Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:14.2429055Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:14.2429166Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:14.2429398Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:14.2429518Z U c10::cuda::device_count() 2025-05-07T20:11:14.2429650Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:14.2429782Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:14.2429941Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:14.2430074Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:14.2430224Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:14.2430375Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:14.2430781Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.2431267Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:14.2431521Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:14.2431986Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.2432305Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:14.2432859Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.2433121Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:11:14.2433386Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:11:14.2433574Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:11:14.2433687Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:14.2433812Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:14.2434112Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:14.2434283Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:14.2434452Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:14.2434610Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:14.2434727Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:14.2434849Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:14.2435013Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:14.2435365Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:14.2435511Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:14.2435684Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:14.2435838Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:14.2435941Z U c10::throwNullDataPtrError() 2025-05-07T20:11:14.2436105Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:11:14.2436203Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:14.2436309Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:14.2436532Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:14.2436641Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:14.2436767Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:14.2436908Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:14.2437030Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:14.2437137Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:14.2437263Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:14.2437390Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:14.2437499Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:14.2437646Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:14.2437777Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:14.2437909Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:14.2438025Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:14.2438146Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:14.2438254Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:14.2438365Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:14.2438533Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:14.2438643Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:14.2438829Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:11:14.2438994Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.2439091Z U free@GLIBC_2.2.5 2025-05-07T20:11:14.2439237Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.2439342Z U log2f@GLIBC_2.2.5 2025-05-07T20:11:14.2439454Z U long at::Tensor::item() const 2025-05-07T20:11:14.2439625Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:14.2439765Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.2439913Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.2440010Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:14.2440107Z U memcpy@GLIBC_2.14 2025-05-07T20:11:14.2440218Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:14.2440313Z U memset@GLIBC_2.2.5 2025-05-07T20:11:14.2440431Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:14.2440556Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:14.2440651Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:14.2440860Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:14.2441189Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:14.2441564Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:14.2441875Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:14.2442272Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:14.2442387Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:14.2442503Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:14.2442650Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.2442817Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.2442982Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:14.2443147Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:14.2443281Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:14.2443511Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:14.2444065Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.2444197Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:14.2444315Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:14.2444441Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:14.2444606Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.2444724Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:14.2444935Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.2445161Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.2445285Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:14.2445460Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:14.2445590Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:14.2445766Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:14.2446187Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:14.2446328Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:14.2446436Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:14.2446551Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:14.2446646Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:14.2446770Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:14.2447543Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:14.2448003Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.2448260Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.2448406Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:14.2448881Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:14.2449072Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:14.2449308Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:14.2449506Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:14.2449860Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:14.2450038Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:14.2450263Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:14.2450447Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:14.2450592Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:14.2450772Z U torch::autograd::Node::metadata() 2025-05-07T20:11:14.2450935Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:14.2451275Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:14.2451546Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:14.2451694Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:14.2451934Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:14.2452163Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:14.2454865Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:14.2455063Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:14.2455228Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:14.2455420Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:14.2455580Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:14.2456000Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:14.2456393Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:14.2456949Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:14.2457083Z U typeinfo for c10::Error 2025-05-07T20:11:14.2457190Z U typeinfo for c10::Type 2025-05-07T20:11:14.2457341Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:14.2457614Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:14.2457866Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:14.2457986Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:14.2458132Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:14.2458300Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:14.2458451Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:14.2458600Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:14.2458766Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:11:14.2458865Z U vtable for c10::Error 2025-05-07T20:11:14.2458964Z U vtable for c10::ListType 2025-05-07T20:11:14.2459318Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.2459449Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:14.2459695Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:14.2459835Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:11:14.2459947Z U vtable for torch::autograd::Node 2025-05-07T20:11:14.2460242Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:14.2460370Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:14.2460473Z w _ITM_registerTMCloneTable 2025-05-07T20:11:14.2460755Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:14.2460874Z w __gmon_start__ 2025-05-07T20:11:14.2460975Z w __pthread_key_create 2025-05-07T20:11:14.2461092Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:14.2461207Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:14.2461371Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:14.2461649Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:14.2461689Z 2025-05-07T20:11:14.2461846Z linux-vdso.so.1 (0x00007ffe925c4000) 2025-05-07T20:11:14.2461938Z libc10.so => not found 2025-05-07T20:11:14.2462041Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.2462141Z libc10_cuda.so => not found 2025-05-07T20:11:14.2462253Z libnccl.so.2 => not found 2025-05-07T20:11:14.2462349Z libcuda.so.1 => not found 2025-05-07T20:11:14.2462910Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f9169e59000) 2025-05-07T20:11:14.2463395Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f9168c00000) 2025-05-07T20:11:14.2463502Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.2463598Z libtorch.so => not found 2025-05-07T20:11:14.2463698Z libtorch_cpu.so => not found 2025-05-07T20:11:14.2463819Z libtorch_cuda.so => not found 2025-05-07T20:11:14.2463922Z libcudart.so.12 => not found 2025-05-07T20:11:14.2464087Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f916899c000) 2025-05-07T20:11:14.2464233Z libm.so.6 => /lib64/libm.so.6 (0x00007f916c3dc000) 2025-05-07T20:11:14.2464390Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f916c3ae000) 2025-05-07T20:11:14.2464515Z libc.so.6 => /lib64/libc.so.6 (0x00007f9168794000) 2025-05-07T20:11:14.2464661Z /lib64/ld-linux-x86-64.so.2 (0x00007f916c4bf000) 2025-05-07T20:11:14.2464754Z libc10.so => not found 2025-05-07T20:11:14.2464855Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.2464949Z libc10_cuda.so => not found 2025-05-07T20:11:14.2465058Z libnccl.so.2 => not found 2025-05-07T20:11:14.2465153Z libcuda.so.1 => not found 2025-05-07T20:11:14.2465256Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.2465367Z libtorch.so => not found 2025-05-07T20:11:14.2465464Z libtorch_cpu.so => not found 2025-05-07T20:11:14.2465566Z libtorch_cuda.so => not found 2025-05-07T20:11:14.2465669Z libcudart.so.12 => not found 2025-05-07T20:11:14.2465780Z libtorch.so => not found 2025-05-07T20:11:14.2465869Z libc10.so => not found 2025-05-07T20:11:14.2465969Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.2466082Z libc10_cuda.so => not found 2025-05-07T20:11:14.2466177Z libnccl.so.2 => not found 2025-05-07T20:11:14.2466275Z libcuda.so.1 => not found 2025-05-07T20:11:14.2466378Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.2466494Z libtorch_cpu.so => not found 2025-05-07T20:11:14.2466594Z libtorch_cuda.so => not found 2025-05-07T20:11:14.2466693Z libcudart.so.12 => not found 2025-05-07T20:11:14.2466861Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f9169e03000) 2025-05-07T20:11:14.2466867Z 2025-05-07T20:11:14.2469688Z [CHECK] Displaying ELF information: 2025-05-07T20:11:14.2469968Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:11:14.2469974Z 2025-05-07T20:11:14.2484688Z 2025-05-07T20:11:14.2485503Z Dynamic section at offset 0x220d958 contains 42 entries: 2025-05-07T20:11:14.2485902Z Tag Type Name/Value 2025-05-07T20:11:14.2486131Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:14.2486382Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:14.2486598Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:14.2486808Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:14.2487040Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:14.2487307Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:14.2487614Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:14.2487838Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:14.2488047Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:14.2488386Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:14.2488606Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:14.2488823Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:14.2489064Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:14.2489267Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:14.2489477Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:14.2489712Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:14.2489948Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:14.2490201Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:11:14.2490429Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:14.2490570Z 0x000000000000000c (INIT) 0x56000 2025-05-07T20:11:14.2490699Z 0x000000000000000d (FINI) 0x1515ac 2025-05-07T20:11:14.2490832Z 0x0000000000000019 (INIT_ARRAY) 0x220b430 2025-05-07T20:11:14.2490996Z 0x000000000000001b (INIT_ARRAYSZ) 144 (bytes) 2025-05-07T20:11:14.2491150Z 0x000000000000001a (FINI_ARRAY) 0x220b4c0 2025-05-07T20:11:14.2491283Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:14.2491436Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:11:14.2491562Z 0x0000000000000005 (STRTAB) 0xbb50 2025-05-07T20:11:14.2491682Z 0x0000000000000006 (SYMTAB) 0x2a60 2025-05-07T20:11:14.2491860Z 0x000000000000000a (STRSZ) 242227 (bytes) 2025-05-07T20:11:14.2491993Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:14.2492128Z 0x0000000000000003 (PLTGOT) 0x220efe8 2025-05-07T20:11:14.2492309Z 0x0000000000000002 (PLTRELSZ) 16872 (bytes) 2025-05-07T20:11:14.2492432Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:14.2492554Z 0x0000000000000017 (JMPREL) 0x512d8 2025-05-07T20:11:14.2492675Z 0x0000000000000007 (RELA) 0x47af8 2025-05-07T20:11:14.2492840Z 0x0000000000000008 (RELASZ) 38880 (bytes) 2025-05-07T20:11:14.2492970Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:14.2493098Z 0x000000006ffffffe (VERNEED) 0x47998 2025-05-07T20:11:14.2493242Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:11:14.2493368Z 0x000000006ffffff0 (VERSYM) 0x46d84 2025-05-07T20:11:14.2493491Z 0x000000006ffffff9 (RELACOUNT) 571 2025-05-07T20:11:14.2493766Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:14.2493801Z 2025-05-07T20:11:14.2493937Z ################################################################################ 2025-05-07T20:11:14.2493942Z 2025-05-07T20:11:14.2493980Z 2025-05-07T20:11:14.2494105Z ################################################################################ 2025-05-07T20:11:14.2494372Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:14.2494491Z [CHECK] Listing out library size: 2025-05-07T20:11:14.2494736Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:14.2494740Z 2025-05-07T20:11:14.2497405Z 73 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:14.2498543Z 2025-05-07T20:11:14.2499392Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:14.2499874Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.2499880Z 2025-05-07T20:11:14.2902783Z GLIBC_2.2.5 2025-05-07T20:11:14.2903035Z GLIBC_2.3 2025-05-07T20:11:14.2903266Z GLIBC_2.14 2025-05-07T20:11:14.2903676Z 2025-05-07T20:11:14.2903692Z 2025-05-07T20:11:14.2904885Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:14.2906315Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.2906332Z 2025-05-07T20:11:14.3303683Z GLIBCXX_3.4 2025-05-07T20:11:14.3303839Z GLIBCXX_3.4.9 2025-05-07T20:11:14.3303935Z GLIBCXX_3.4.11 2025-05-07T20:11:14.3304029Z GLIBCXX_3.4.14 2025-05-07T20:11:14.3304136Z GLIBCXX_3.4.15 2025-05-07T20:11:14.3304227Z GLIBCXX_3.4.18 2025-05-07T20:11:14.3304318Z GLIBCXX_3.4.19 2025-05-07T20:11:14.3304407Z GLIBCXX_3.4.20 2025-05-07T20:11:14.3304538Z GLIBCXX_3.4.21 2025-05-07T20:11:14.3304695Z 2025-05-07T20:11:14.3304852Z 2025-05-07T20:11:14.3326494Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.4iVHROkZMC.symbols.txt 2025-05-07T20:11:14.3326549Z 2025-05-07T20:11:14.3665631Z 2025-05-07T20:11:14.3694018Z [CHECK] Total Number of symbols: 6648 2025-05-07T20:11:14.3717176Z [CHECK] Number of fbgemm symbols: 4516 2025-05-07T20:11:14.3735564Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.ZfDZEAnrgz.usymbols.txt 2025-05-07T20:11:14.3735791Z 2025-05-07T20:11:14.3771163Z 2025-05-07T20:11:14.3798514Z [CHECK] Listing out undefined symbols (465 total): 2025-05-07T20:11:14.3815811Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.3831556Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.3833566Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:14.3834516Z U __assert_fail@GLIBC_2.2.5 2025-05-07T20:11:14.3835549Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.3836697Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:14.3837805Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.3838898Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:14.3839697Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:14.3840068Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:14.3840428Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:14.3840794Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:14.3841129Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:14.3841441Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:14.3841981Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:14.3842433Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:14.3842770Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:14.3843084Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:14.3843482Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:14.3843796Z U __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:11:14.3844116Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:14.3844441Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:14.3844863Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:14.3845174Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:14.3845480Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:11:14.3845794Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:14.3846167Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:14.3846586Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:14.3846944Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:14.3847305Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:14.3847719Z U at::SplitUntil32Bit::begin() const 2025-05-07T20:11:14.3848043Z U at::SplitUntil32Bit::end() const 2025-05-07T20:11:14.3848411Z U at::SplitUntil32Bit::iterator::operator*() const 2025-05-07T20:11:14.3848888Z U at::SplitUntil32Bit::iterator::operator++() 2025-05-07T20:11:14.3849326Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:11:14.3849818Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:14.3850257Z U at::TensorIteratorBase::build(at::TensorIteratorConfig&) 2025-05-07T20:11:14.3850693Z U at::TensorIteratorBase::can_use_32bit_indexing() const 2025-05-07T20:11:14.3851075Z U at::TensorIteratorBase::data_ptr(long) const 2025-05-07T20:11:14.3851443Z U at::TensorIteratorBase::is_contiguous() const 2025-05-07T20:11:14.3851806Z U at::TensorIteratorBase::numel() const 2025-05-07T20:11:14.3852155Z U at::TensorIteratorBase::with_32bit_indexing() const 2025-05-07T20:11:14.3852614Z U at::TensorIteratorConfig::add_borrowed_input(at::TensorBase const&) 2025-05-07T20:11:14.3853125Z U at::TensorIteratorConfig::add_borrowed_output(at::TensorBase const&) 2025-05-07T20:11:14.3853549Z U at::TensorMaker::make_tensor() 2025-05-07T20:11:14.3853887Z U at::_ops::_is_all_true::call(at::Tensor const&) 2025-05-07T20:11:14.3854249Z U at::_ops::_unique::call(at::Tensor const&, bool, bool) 2025-05-07T20:11:14.3854714Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.3855242Z U at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.3855666Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:11:14.3856203Z U at::_ops::baddbmm::call(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) 2025-05-07T20:11:14.3856821Z U at::_ops::broadcast_to::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:14.3857277Z U at::_ops::cat::call(c10::IListRef const&, long) 2025-05-07T20:11:14.3857721Z U at::_ops::cat_out::call(c10::IListRef const&, long, at::Tensor&) 2025-05-07T20:11:14.3858194Z U at::_ops::clamp_max::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.3858664Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:11:14.3859196Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:11:14.3859628Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:14.3860213Z U at::_ops::cumsum::call(at::Tensor const&, long, std::optional) 2025-05-07T20:11:14.3861137Z U at::_ops::diff::call(at::Tensor const&, long, long, std::optional const&, std::optional const&) 2025-05-07T20:11:14.3861784Z U at::_ops::div_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.3862650Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.3863996Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.3864942Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:11:14.3865402Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:11:14.3865876Z U at::_ops::floor::call(at::Tensor const&) 2025-05-07T20:11:14.3866653Z U at::_ops::full::call(c10::ArrayRef, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.3867464Z U at::_ops::ge_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.3868067Z U at::_ops::index_put_::call(at::Tensor&, c10::List > const&, at::Tensor const&, bool) 2025-05-07T20:11:14.3868705Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:11:14.3869150Z U at::_ops::item::call(at::Tensor const&) 2025-05-07T20:11:14.3869564Z U at::_ops::le_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:14.3869961Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:11:14.3870374Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:14.3871250Z U at::_ops::ones_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.3872112Z U at::_ops::permute::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:14.3873044Z U at::_ops::range::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.3873812Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:14.3874569Z U at::_ops::resize_::call(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:11:14.3875154Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:11:14.3875868Z U at::_ops::set__source_Storage_storage_offset::call(at::Tensor&, c10::Storage, c10::SymInt, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:14.3877151Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:11:14.3877782Z U at::_ops::sort::call(at::Tensor const&, long, bool) 2025-05-07T20:11:14.3878265Z U at::_ops::split_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:11:14.3878770Z U at::_ops::squeeze_dim::call(at::Tensor const&, long) 2025-05-07T20:11:14.3879387Z U at::_ops::sub_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:11:14.3879927Z U at::_ops::sum::call(at::Tensor const&, std::optional) 2025-05-07T20:11:14.3880539Z U at::_ops::tensor_split_indices::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:11:14.3881257Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:14.3882329Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:11:14.3883257Z U at::_ops::transpose_int::call(at::Tensor const&, long, long) 2025-05-07T20:11:14.3883796Z U at::_ops::unique_consecutive::call(at::Tensor const&, bool, bool, std::optional) 2025-05-07T20:11:14.3884324Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:11:14.3884770Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:14.3885226Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:14.3885721Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:11:14.3886427Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.3887612Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.3888585Z U at::checkScalarTypes(char const*, at::TensorArg const&, c10::ArrayRef) 2025-05-07T20:11:14.3889100Z U at::cuda::getCurrentCUDABlasHandle() 2025-05-07T20:11:14.3889485Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:14.3889881Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:11:14.3890271Z U at::cuda::get_p2p_access(signed char, signed char) 2025-05-07T20:11:14.3890896Z U at::detail::computeStorageNbytes(c10::ArrayRef, c10::ArrayRef, unsigned long, unsigned long) 2025-05-07T20:11:14.3891465Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:14.3891859Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:14.3892236Z U at::get_num_threads() 2025-05-07T20:11:14.3892528Z U at::get_thread_num() 2025-05-07T20:11:14.3892949Z U at::internal::OpaqueOptionalTensorRef::~OpaqueOptionalTensorRef() 2025-05-07T20:11:14.3893387Z U at::internal::set_thread_num(int) 2025-05-07T20:11:14.3893854Z U at::native::_rowwise_prune(at::Tensor const&, at::Tensor const&, c10::ScalarType) 2025-05-07T20:11:14.3894795Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.3896250Z U at::native::empty_meta_symint(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:14.3897198Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:11:14.3897703Z U at::print(std::ostream&, at::Tensor const&, long) 2025-05-07T20:11:14.3898054Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:14.3898439Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:11:14.3898815Z U bcmp@GLIBC_2.2.5 2025-05-07T20:11:14.3899138Z U bool at::Tensor::item() const 2025-05-07T20:11:14.3899468Z U bool* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.3899842Z U bool* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.3900306Z U c10::AnyType::get() 2025-05-07T20:11:14.3900828Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:11:14.3901295Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.3901786Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.3902211Z U c10::BoolType::get() 2025-05-07T20:11:14.3902526Z U c10::DeviceObjType::get() 2025-05-07T20:11:14.3902888Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:14.3903343Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:14.3903747Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:14.3904499Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:14.3905803Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:14.3906906Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:14.3907510Z U c10::Error::what() const 2025-05-07T20:11:14.3907807Z U c10::FloatType::get() 2025-05-07T20:11:14.3908119Z U c10::GradMode::is_enabled() 2025-05-07T20:11:14.3908449Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:14.3908823Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.3909279Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.3909726Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:14.3910128Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:14.3910479Z U c10::IValue::isBoolList() const 2025-05-07T20:11:14.3910800Z U c10::IValue::isIntList() const 2025-05-07T20:11:14.3911142Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:14.3911466Z U c10::IValue::isTensorList() const 2025-05-07T20:11:14.3911835Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:14.3912193Z U c10::InferenceMode::is_enabled() 2025-05-07T20:11:14.3912516Z U c10::IntType::get() 2025-05-07T20:11:14.3913298Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.3914000Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:14.3914411Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:14.3914914Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:14.3915244Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:14.3915663Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.3916087Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:14.3916424Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:14.3916734Z U c10::ScalarTypeType::get() 2025-05-07T20:11:14.3917190Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:14.3917899Z U c10::SmallVectorBase::mallocForGrow(unsigned long, unsigned long, unsigned long&) 2025-05-07T20:11:14.3918440Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:14.3918812Z U c10::StringType::get() 2025-05-07T20:11:14.3919162Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:14.3919546Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:14.3919949Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:14.3920571Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:14.3921171Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:14.3921519Z U c10::SymInt::operator%(c10::SymInt const&) const 2025-05-07T20:11:14.3921885Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:11:14.3922429Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:14.3922771Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:14.3923126Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:14.3923506Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:11:14.3923850Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:14.3924170Z U c10::SymIntType::get() 2025-05-07T20:11:14.3924709Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:14.3925104Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:14.3925769Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:11:14.3926480Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:14.3926864Z U c10::TensorType::get() 2025-05-07T20:11:14.3927849Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:11:14.3928951Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:11:14.3929360Z U c10::Type::is_module() const 2025-05-07T20:11:14.3929701Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:14.3930653Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:14.3931601Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:14.3932018Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:11:14.3932570Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:11:14.3933281Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:11:14.3933856Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:14.3934201Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:14.3934557Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:14.3934889Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:14.3935235Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:14.3935707Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:14.3936166Z U c10::cuda::current_device() 2025-05-07T20:11:14.3936482Z U c10::cuda::device_count() 2025-05-07T20:11:14.3936873Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:14.3937273Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:14.3937662Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:14.3938223Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:14.3938635Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:14.3939023Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:14.3939682Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:14.3940845Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:14.3941729Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:14.3942598Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.3943586Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:14.3944617Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:14.3945584Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:11:14.3946252Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:11:14.3946836Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:11:14.3947272Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:14.3947600Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:14.3948149Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:14.3948787Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:14.3949201Z U c10::impl::PyObjectSlot::PyObjectSlot() 2025-05-07T20:11:14.3949572Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:11:14.3949955Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:14.3950383Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:14.3950789Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:14.3951130Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:14.3951523Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:14.3952157Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:14.3952887Z U c10::operator*(c10::SymInt const&, int) 2025-05-07T20:11:14.3953239Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:11:14.3953591Z U c10::operator+(c10::SymInt const&, unsigned long) 2025-05-07T20:11:14.3953958Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:14.3954309Z U c10::operator-(c10::SymInt const&, unsigned long) 2025-05-07T20:11:14.3954679Z U c10::operator/(c10::SymInt const&, int) 2025-05-07T20:11:14.3955006Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:11:14.3955375Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:14.3955760Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:14.3956203Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:14.3956613Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:14.3956970Z U c10::operator==(c10::SymInt const&, int) 2025-05-07T20:11:14.3957367Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:11:14.3957715Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:11:14.3958040Z U c10::report_overflow(char const*) 2025-05-07T20:11:14.3958371Z U c10::throwNullDataPtrError() 2025-05-07T20:11:14.3958695Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:11:14.3959026Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:14.3959341Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:14.3959748Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:14.3960174Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:14.3960477Z U ceil@GLIBC_2.2.5 2025-05-07T20:11:14.3960777Z U cublasGemmStridedBatchedEx 2025-05-07T20:11:14.3961078Z U cublasSetStream_v2 2025-05-07T20:11:14.3961432Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:14.3961784Z U cudaDeviceGetByPCIBusId@libcudart.so.12 2025-05-07T20:11:14.3962139Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:14.3962493Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:14.3962842Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:14.3963189Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:14.3963523Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:14.3963846Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:14.3964181Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:14.3964523Z U cudaFree@libcudart.so.12 2025-05-07T20:11:14.3964841Z U cudaFuncGetAttributes@libcudart.so.12 2025-05-07T20:11:14.3965194Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:14.3965522Z U cudaGetDevice@libcudart.so.12 2025-05-07T20:11:14.3965858Z U cudaGetDeviceCount@libcudart.so.12 2025-05-07T20:11:14.3966218Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:14.3966573Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:14.3966910Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:14.3967253Z U cudaHostGetDevicePointer@libcudart.so.12 2025-05-07T20:11:14.3967607Z U cudaHostRegister@libcudart.so.12 2025-05-07T20:11:14.3967930Z U cudaHostUnregister@libcudart.so.12 2025-05-07T20:11:14.3968266Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:14.3968603Z U cudaMallocManaged@libcudart.so.12 2025-05-07T20:11:14.3968920Z U cudaMemAdvise@libcudart.so.12 2025-05-07T20:11:14.3969262Z U cudaMemPrefetchAsync@libcudart.so.12 2025-05-07T20:11:14.3969594Z U cudaMemcpy2DAsync@libcudart.so.12 2025-05-07T20:11:14.3969930Z U cudaMemcpyAsync@libcudart.so.12 2025-05-07T20:11:14.3970245Z U cudaMemsetAsync@libcudart.so.12 2025-05-07T20:11:14.3970741Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.12 2025-05-07T20:11:14.3971246Z U cudaPeekAtLastError@libcudart.so.12 2025-05-07T20:11:14.3971577Z U cudaSetDevice@libcudart.so.12 2025-05-07T20:11:14.3971898Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:14.3972231Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:14.3972578Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:14.3973004Z U double* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.3973419Z U double* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.3973771Z U exit@GLIBC_2.2.5 2025-05-07T20:11:14.3974054Z U exp10@GLIBC_2.2.5 2025-05-07T20:11:14.3974361Z U exp2@GLIBC_2.2.5 2025-05-07T20:11:14.3974621Z U exp@GLIBC_2.2.5 2025-05-07T20:11:14.3974892Z U expf@GLIBC_2.2.5 2025-05-07T20:11:14.3975252Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:11:14.3975748Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:14.3976617Z U fbgemm_gpu::asynchronous_exclusive_cumsum_cpu(at::Tensor const&) 2025-05-07T20:11:14.3977177Z U fbgemm_gpu::asynchronous_exclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:14.3977697Z U fbgemm_gpu::asynchronous_inclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:14.3978150Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.3978570Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.3978925Z U fmod@GLIBC_2.2.5 2025-05-07T20:11:14.3979289Z U free@GLIBC_2.2.5 2025-05-07T20:11:14.3979597Z U get_info_B_num_bits_from_T(int, int) 2025-05-07T20:11:14.3979932Z U int at::Tensor::item() const 2025-05-07T20:11:14.3980422Z U int const* at::TensorBase::const_data_ptr() const 2025-05-07T20:11:14.3980819Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.3981280Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.3981619Z U isnan@GLIBC_2.2.5 2025-05-07T20:11:14.3981913Z U lgamma@GLIBC_2.2.5 2025-05-07T20:11:14.3982206Z U llrint@GLIBC_2.2.5 2025-05-07T20:11:14.3982489Z U llround@GLIBC_2.2.5 2025-05-07T20:11:14.3982780Z U log10@GLIBC_2.2.5 2025-05-07T20:11:14.3983047Z U log2@GLIBC_2.2.5 2025-05-07T20:11:14.3983324Z U log@GLIBC_2.2.5 2025-05-07T20:11:14.3983590Z U logl@GLIBC_2.2.5 2025-05-07T20:11:14.3983888Z U long at::Tensor::item() const 2025-05-07T20:11:14.3984275Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:14.3984731Z U long const* at::TensorBase::const_data_ptr() const 2025-05-07T20:11:14.3985155Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.3985532Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.3985889Z U lrint@GLIBC_2.2.5 2025-05-07T20:11:14.3986163Z U madvise@GLIBC_2.2.5 2025-05-07T20:11:14.3986447Z U malloc@GLIBC_2.2.5 2025-05-07T20:11:14.3986725Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:14.3987005Z U memcpy@GLIBC_2.14 2025-05-07T20:11:14.3987287Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:14.3987567Z U memset@GLIBC_2.2.5 2025-05-07T20:11:14.3987853Z U nextafter@GLIBC_2.2.5 2025-05-07T20:11:14.3988156Z U nvmlDeviceGetCount_v2 2025-05-07T20:11:14.3988474Z U nvmlDeviceGetHandleByIndex_v2 2025-05-07T20:11:14.3988816Z U nvmlDeviceGetNvLinkRemotePciInfo_v2 2025-05-07T20:11:14.3989165Z U nvmlDeviceGetNvLinkState 2025-05-07T20:11:14.3989469Z U nvmlDeviceGetPciInfo_v3 2025-05-07T20:11:14.3989774Z U nvmlInit_v2 2025-05-07T20:11:14.3990048Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:14.3990399Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:14.3990757Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:14.3991196Z U pow@GLIBC_2.2.5 2025-05-07T20:11:14.3991500Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:14.3991838Z U short* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.3994259Z U signed char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.3994624Z U sin@GLIBC_2.2.5 2025-05-07T20:11:14.3994995Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:14.3995462Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:14.3995893Z U std::_Rb_tree_increment(std::_Rb_tree_node_base const*)@GLIBCXX_3.4 2025-05-07T20:11:14.3996333Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:11:14.3996942Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:11:14.3997723Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:14.3998506Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:14.3999305Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:14.4000063Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:14.4000874Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:14.4001426Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:11:14.4001742Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:11:14.4002056Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:14.4002370Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:14.4002686Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:11:14.4003010Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:14.4003376Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.4003736Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.4004099Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:11:14.4004495Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:14.4004877Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:14.4005251Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:14.4005670Z U std::basic_filebuf >::close()@GLIBCXX_3.4 2025-05-07T20:11:14.4006291Z U std::basic_ifstream >::basic_ifstream(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:11:14.4006942Z U std::basic_ifstream >::~basic_ifstream()@GLIBCXX_3.4 2025-05-07T20:11:14.4007487Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:14.4008133Z U std::basic_ofstream >::basic_ofstream(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:11:14.4008761Z U std::basic_ofstream >::~basic_ofstream()@GLIBCXX_3.4 2025-05-07T20:11:14.4009622Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.4010468Z U std::chrono::_V2::system_clock::now()@GLIBCXX_3.4.19 2025-05-07T20:11:14.4010801Z U std::cout@GLIBCXX_3.4 2025-05-07T20:11:14.4011133Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:11:14.4011544Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:14.4011872Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:14.4012197Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:14.4012516Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:14.4012839Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:14.4013218Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:11:14.4013682Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.4014191Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:14.4014627Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:11:14.4014946Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:14.4015274Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:11:14.4015642Z U std::ostream::write(char const*, long)@GLIBCXX_3.4 2025-05-07T20:11:14.4016033Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:14.4016421Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:14.4016811Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:14.4017479Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:14.4018250Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:14.4018588Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:14.4018880Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:14.4019139Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:14.4019418Z U sysconf@GLIBC_2.2.5 2025-05-07T20:11:14.4019702Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:14.4020746Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:14.4021912Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.4022967Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:11:14.4023841Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:14.4024333Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:14.4024860Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:14.4025456Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:14.4025948Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:14.4026455Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:14.4027103Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:14.4027709Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:14.4028159Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:14.4028664Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:14.4029135Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:14.4029466Z U torch::autograd::Node::metadata() 2025-05-07T20:11:14.4029826Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:14.4030348Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:14.4030972Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:14.4031500Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:14.4031948Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:14.4032494Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:14.4035407Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:14.4038122Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:14.4038509Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:14.4038899Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:14.4039299Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:14.4039929Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:14.4040744Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:14.4041545Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:14.4042199Z U torch::pickle_load(std::vector > const&) 2025-05-07T20:11:14.4042596Z U torch::pickle_save(c10::IValue const&) 2025-05-07T20:11:14.4043324Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:14.4044020Z U typeinfo for c10::Error 2025-05-07T20:11:14.4044313Z U typeinfo for c10::Type 2025-05-07T20:11:14.4044639Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:14.4044984Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:14.4045317Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:14.4045651Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:14.4045986Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:14.4046368Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:11:14.4046835Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:14.4047570Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(float const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:14.4048623Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:14.4049642Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, float const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:14.4050692Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:11:14.4051689Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, float*) 2025-05-07T20:11:14.4052694Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, unsigned short*) 2025-05-07T20:11:14.4053733Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:11:14.4054785Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:11:14.4056007Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:11:14.4057152Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:11:14.4058339Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:11:14.4059141Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:14.4059552Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:14.4059959Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:14.4060642Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:14.4061079Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:11:14.4061488Z U vtable for at::TensorIterator 2025-05-07T20:11:14.4061847Z U vtable for at::TensorIteratorBase 2025-05-07T20:11:14.4062176Z U vtable for c10::Error 2025-05-07T20:11:14.4062499Z U vtable for c10::ListType 2025-05-07T20:11:14.4063038Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:14.4063634Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:14.4064121Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:14.4064602Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:11:14.4064977Z U vtable for torch::autograd::Node 2025-05-07T20:11:14.4065382Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:14.4065800Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:14.4066107Z w _ITM_registerTMCloneTable 2025-05-07T20:11:14.4066409Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:14.4066711Z w __gmon_start__ 2025-05-07T20:11:14.4066967Z w __pthread_key_create 2025-05-07T20:11:14.4067267Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:14.4067578Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:14.4067878Z w pthread_once 2025-05-07T20:11:14.4068178Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:14.4068594Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:14.4068870Z 2025-05-07T20:11:14.4069047Z linux-vdso.so.1 (0x00007ffe501e3000) 2025-05-07T20:11:14.4069375Z libc10.so => not found 2025-05-07T20:11:14.4069622Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.4069872Z libc10_cuda.so => not found 2025-05-07T20:11:14.4070163Z libnccl.so.2 => not found 2025-05-07T20:11:14.4070403Z libcuda.so.1 => not found 2025-05-07T20:11:14.4070929Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007f03f2c00000) 2025-05-07T20:11:14.4071943Z fbgemm_gpu_embedding_inplace_ops.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so (0x00007f03f80b3000) 2025-05-07T20:11:14.4073201Z fbgemm_gpu_tbe_index_select.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so (0x00007f03f0800000) 2025-05-07T20:11:14.4074188Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f03ef000000) 2025-05-07T20:11:14.4075170Z fbgemm_gpu_tbe_optimizers.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so (0x00007f03ee600000) 2025-05-07T20:11:14.4075821Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.4076406Z libtorch.so => not found 2025-05-07T20:11:14.4077103Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f03ee459000) 2025-05-07T20:11:14.4078231Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f03ed200000) 2025-05-07T20:11:14.4078893Z libtorch_cpu.so => not found 2025-05-07T20:11:14.4079200Z libtorch_cuda.so => not found 2025-05-07T20:11:14.4079472Z libcudart.so.12 => not found 2025-05-07T20:11:14.4079854Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f03ecf9c000) 2025-05-07T20:11:14.4080280Z libm.so.6 => /lib64/libm.so.6 (0x00007f03f7fd4000) 2025-05-07T20:11:14.4080719Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f03f7fa6000) 2025-05-07T20:11:14.4081153Z libc.so.6 => /lib64/libc.so.6 (0x00007f03ecd94000) 2025-05-07T20:11:14.4081537Z /lib64/ld-linux-x86-64.so.2 (0x00007f03f812c000) 2025-05-07T20:11:14.4081886Z libc10.so => not found 2025-05-07T20:11:14.4082140Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.4082422Z libc10_cuda.so => not found 2025-05-07T20:11:14.4082683Z libnccl.so.2 => not found 2025-05-07T20:11:14.4082954Z libcuda.so.1 => not found 2025-05-07T20:11:14.4083470Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007f03f3189000) 2025-05-07T20:11:14.4084051Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.4084335Z libtorch.so => not found 2025-05-07T20:11:14.4084588Z libtorch_cpu.so => not found 2025-05-07T20:11:14.4084874Z libtorch_cuda.so => not found 2025-05-07T20:11:14.4085139Z libtorch.so => not found 2025-05-07T20:11:14.4085230Z libc10.so => not found 2025-05-07T20:11:14.4085348Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.4085443Z libc10_cuda.so => not found 2025-05-07T20:11:14.4085538Z libnccl.so.2 => not found 2025-05-07T20:11:14.4085661Z libcuda.so.1 => not found 2025-05-07T20:11:14.4085774Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.4085875Z libtorch_cpu.so => not found 2025-05-07T20:11:14.4085980Z libtorch_cuda.so => not found 2025-05-07T20:11:14.4086109Z libcudart.so.12 => not found 2025-05-07T20:11:14.4086274Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f03f2baa000) 2025-05-07T20:11:14.4086376Z libc10.so => not found 2025-05-07T20:11:14.4086508Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.4086610Z libc10_cuda.so => not found 2025-05-07T20:11:14.4086715Z libnccl.so.2 => not found 2025-05-07T20:11:14.4086818Z libcuda.so.1 => not found 2025-05-07T20:11:14.4086947Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.4087051Z libtorch.so => not found 2025-05-07T20:11:14.4087159Z libtorch_cpu.so => not found 2025-05-07T20:11:14.4087413Z libtorch_cuda.so => not found 2025-05-07T20:11:14.4087523Z libcudart.so.12 => not found 2025-05-07T20:11:14.4087627Z libtorch.so => not found 2025-05-07T20:11:14.4087726Z libc10.so => not found 2025-05-07T20:11:14.4087860Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.4087999Z libc10_cuda.so => not found 2025-05-07T20:11:14.4088109Z libnccl.so.2 => not found 2025-05-07T20:11:14.4088239Z libcuda.so.1 => not found 2025-05-07T20:11:14.4088350Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.4088459Z libtorch_cpu.so => not found 2025-05-07T20:11:14.4088565Z libtorch_cuda.so => not found 2025-05-07T20:11:14.4088801Z libcudart.so.12 => not found 2025-05-07T20:11:14.4088898Z libtorch.so => not found 2025-05-07T20:11:14.4088989Z libc10.so => not found 2025-05-07T20:11:14.4089112Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.4089209Z libc10_cuda.so => not found 2025-05-07T20:11:14.4089305Z libnccl.so.2 => not found 2025-05-07T20:11:14.4089405Z libcuda.so.1 => not found 2025-05-07T20:11:14.4089537Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.4089637Z libtorch_cpu.so => not found 2025-05-07T20:11:14.4089739Z libtorch_cuda.so => not found 2025-05-07T20:11:14.4089863Z libcudart.so.12 => not found 2025-05-07T20:11:14.4089990Z libc10.so => not found 2025-05-07T20:11:14.4090092Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.4090189Z libc10_cuda.so => not found 2025-05-07T20:11:14.4090310Z libnccl.so.2 => not found 2025-05-07T20:11:14.4090409Z libcuda.so.1 => not found 2025-05-07T20:11:14.4090514Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.4090634Z libtorch.so => not found 2025-05-07T20:11:14.4090734Z libtorch_cpu.so => not found 2025-05-07T20:11:14.4090836Z libtorch_cuda.so => not found 2025-05-07T20:11:14.4090934Z libcudart.so.12 => not found 2025-05-07T20:11:14.4091054Z libtorch.so => not found 2025-05-07T20:11:14.4091144Z libc10.so => not found 2025-05-07T20:11:14.4091244Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.4091367Z libc10_cuda.so => not found 2025-05-07T20:11:14.4091467Z libnccl.so.2 => not found 2025-05-07T20:11:14.4091563Z libcuda.so.1 => not found 2025-05-07T20:11:14.4091668Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.4091789Z libtorch_cpu.so => not found 2025-05-07T20:11:14.4091892Z libtorch_cuda.so => not found 2025-05-07T20:11:14.4091991Z libcudart.so.12 => not found 2025-05-07T20:11:14.4092113Z libtorch.so => not found 2025-05-07T20:11:14.4092207Z libc10.so => not found 2025-05-07T20:11:14.4092309Z libnvrtc.so.12 => not found 2025-05-07T20:11:14.4092407Z libc10_cuda.so => not found 2025-05-07T20:11:14.4092527Z libnccl.so.2 => not found 2025-05-07T20:11:14.4092625Z libcuda.so.1 => not found 2025-05-07T20:11:14.4092726Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:14.4092851Z libtorch_cpu.so => not found 2025-05-07T20:11:14.4092951Z libtorch_cuda.so => not found 2025-05-07T20:11:14.4093093Z librt.so.1 => /lib64/librt.so.1 (0x00007f03f7f83000) 2025-05-07T20:11:14.4093268Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f03f7f7e000) 2025-05-07T20:11:14.4093301Z 2025-05-07T20:11:14.4093412Z [CHECK] Displaying ELF information: 2025-05-07T20:11:14.4093612Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:11:14.4093616Z 2025-05-07T20:11:14.4093624Z 2025-05-07T20:11:14.4093786Z Dynamic section at offset 0x48e4fa8 contains 47 entries: 2025-05-07T20:11:14.4093927Z Tag Type Name/Value 2025-05-07T20:11:14.4094119Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:14.4094320Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:14.4094541Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:14.4094734Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:14.4094924Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:14.4095130Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:14.4095450Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:11:14.4095679Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:11:14.4095942Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:11:14.4096165Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:11:14.4096370Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:14.4096586Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:14.4096825Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:14.4097036Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:14.4097233Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:14.4097460Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:14.4097662Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:14.4097879Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:14.4098087Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:14.4098278Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:14.4098464Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:14.4098696Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:14.4098899Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_py.so] 2025-05-07T20:11:14.4099077Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:14.4099219Z 0x000000000000000c (INIT) 0x1bb000 2025-05-07T20:11:14.4099340Z 0x000000000000000d (FINI) 0x75816c 2025-05-07T20:11:14.4099466Z 0x0000000000000019 (INIT_ARRAY) 0x48d6858 2025-05-07T20:11:14.4099601Z 0x000000000000001b (INIT_ARRAYSZ) 1160 (bytes) 2025-05-07T20:11:14.4099756Z 0x000000000000001a (FINI_ARRAY) 0x48d6ce0 2025-05-07T20:11:14.4099879Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:14.4099995Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:11:14.4100237Z 0x0000000000000005 (STRTAB) 0x33248 2025-05-07T20:11:14.4100356Z 0x0000000000000006 (SYMTAB) 0xc2f0 2025-05-07T20:11:14.4100672Z 0x000000000000000a (STRSZ) 1276767 (bytes) 2025-05-07T20:11:14.4100813Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:14.4100974Z 0x0000000000000003 (PLTGOT) 0x48eafe8 2025-05-07T20:11:14.4101219Z 0x0000000000000002 (PLTRELSZ) 68808 (bytes) 2025-05-07T20:11:14.4101339Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:14.4101495Z 0x0000000000000017 (JMPREL) 0x1a9648 2025-05-07T20:11:14.4101617Z 0x0000000000000007 (RELA) 0x16e320 2025-05-07T20:11:14.4101764Z 0x0000000000000008 (RELASZ) 242472 (bytes) 2025-05-07T20:11:14.4101924Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:14.4102056Z 0x000000006ffffffe (VERNEED) 0x16e1a0 2025-05-07T20:11:14.4102174Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:11:14.4102302Z 0x000000006ffffff0 (VERSYM) 0x16ada8 2025-05-07T20:11:14.4102448Z 0x000000006ffffff9 (RELACOUNT) 2870 2025-05-07T20:11:14.4102559Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:14.4102564Z 2025-05-07T20:11:14.4102695Z ################################################################################ 2025-05-07T20:11:14.4102700Z 2025-05-07T20:11:14.4102704Z 2025-05-07T20:11:14.4102853Z ################################################################################ 2025-05-07T20:11:14.4103240Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:14.4103361Z [CHECK] Listing out library size: 2025-05-07T20:11:14.4103693Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:14.4103723Z 2025-05-07T20:11:14.4103966Z 904 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:14.4103991Z 2025-05-07T20:11:14.4104435Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:14.4105005Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.4105010Z 2025-05-07T20:11:14.5976784Z GLIBC_2.2.5 2025-05-07T20:11:14.5976994Z GLIBC_2.3 2025-05-07T20:11:14.5977141Z GLIBC_2.14 2025-05-07T20:11:14.5977149Z 2025-05-07T20:11:14.5977168Z 2025-05-07T20:11:14.5977684Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:14.5978252Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:14.5978434Z 2025-05-07T20:11:14.7980588Z GLIBCXX_3.4 2025-05-07T20:11:14.7980812Z GLIBCXX_3.4.9 2025-05-07T20:11:14.7980946Z GLIBCXX_3.4.11 2025-05-07T20:11:14.7981104Z GLIBCXX_3.4.14 2025-05-07T20:11:14.7981660Z GLIBCXX_3.4.15 2025-05-07T20:11:14.7981785Z GLIBCXX_3.4.18 2025-05-07T20:11:14.7981894Z GLIBCXX_3.4.20 2025-05-07T20:11:14.7981985Z GLIBCXX_3.4.21 2025-05-07T20:11:14.7981991Z 2025-05-07T20:11:14.7981995Z 2025-05-07T20:11:14.8003031Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.Pmgy8GtHJW.symbols.txt 2025-05-07T20:11:14.8003062Z 2025-05-07T20:11:14.9970801Z 2025-05-07T20:11:15.0102228Z [CHECK] Total Number of symbols: 12682 2025-05-07T20:11:15.0234326Z [CHECK] Number of fbgemm symbols: 2318 2025-05-07T20:11:15.0256146Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.UciHoVFGZq.usymbols.txt 2025-05-07T20:11:15.0256726Z 2025-05-07T20:11:15.0337148Z 2025-05-07T20:11:15.0366425Z [CHECK] Listing out undefined symbols (273 total): 2025-05-07T20:11:15.0385163Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.0387546Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.0389151Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:15.0390192Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:15.0391052Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:15.0391589Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:15.0391986Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:15.0392404Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:15.0392799Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:15.0393173Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:15.0393578Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:15.0393921Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:15.0394274Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:15.0394607Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:15.0394964Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:15.0395312Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:15.0395677Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:15.0396035Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:15.0396607Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:15.0396976Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:15.0397309Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:15.0397714Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:15.0398047Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:15.0398420Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:15.0398866Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:15.0399303Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:15.0399761Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:15.0400291Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:15.0400666Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:15.0401039Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:15.0401502Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:15.0402255Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:11:15.0402862Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:15.0403687Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.0404949Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.0405842Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:15.0407204Z U at::_ops::sparse_coo_tensor_indices_size::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.0408354Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:15.0408922Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:11:15.0409363Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:15.0410134Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.0411276Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.0412132Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:15.0412558Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:15.0412937Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:15.0413354Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:15.0413728Z U at::get_thread_num() 2025-05-07T20:11:15.0414060Z U at::globalContext() 2025-05-07T20:11:15.0414378Z U at::internal::set_thread_num(int) 2025-05-07T20:11:15.0414746Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:15.0415190Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:11:15.0415614Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:11:15.0415980Z U bcmp@GLIBC_2.2.5 2025-05-07T20:11:15.0416336Z U c10::AnyType::get() 2025-05-07T20:11:15.0416935Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.0417368Z U c10::BoolType::get() 2025-05-07T20:11:15.0417799Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:15.0418284Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:15.0418708Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:15.0419488Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:15.0420868Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:15.0422006Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:15.0422634Z U c10::Error::what() const 2025-05-07T20:11:15.0422958Z U c10::FloatType::get() 2025-05-07T20:11:15.0423349Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:15.0423719Z U c10::GradMode::is_enabled() 2025-05-07T20:11:15.0424056Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:15.0424457Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.0424910Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.0425397Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:15.0425800Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:15.0426168Z U c10::IValue::isBoolList() const 2025-05-07T20:11:15.0426529Z U c10::IValue::isIntList() const 2025-05-07T20:11:15.0426875Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:15.0427238Z U c10::IValue::isTensorList() const 2025-05-07T20:11:15.0427606Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:15.0427994Z U c10::IntType::get() 2025-05-07T20:11:15.0428370Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:15.0428800Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:15.0429186Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:15.0429552Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:15.0430037Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:15.0430518Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:11:15.0430917Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:11:15.0431464Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:15.0432030Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:15.0432441Z U c10::StringType::get() 2025-05-07T20:11:15.0432931Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:15.0433367Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:15.0434051Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:15.0434701Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:15.0435096Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:15.0435443Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:15.0435791Z U c10::SymIntType::get() 2025-05-07T20:11:15.0436213Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:15.0436639Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:15.0437227Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:15.0437642Z U c10::TensorType::get() 2025-05-07T20:11:15.0438007Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:15.0438971Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:15.0439973Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:15.0440373Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:15.0440754Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:15.0441103Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:15.0441465Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:15.0441825Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:15.0442297Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:15.0442828Z U c10::cuda::device_count() 2025-05-07T20:11:15.0443174Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:15.0443588Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:15.0443983Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:15.0444400Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:15.0444829Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:15.0445218Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:15.0445893Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:15.0446968Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:15.0447852Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:15.0448734Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:15.0449698Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:15.0450730Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:15.0451561Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:15.0451901Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:15.0452476Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:15.0453107Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:15.0453555Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:15.0453999Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:15.0454397Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:15.0454814Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:15.0455203Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:15.0455836Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:15.0456512Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:15.0456891Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:15.0457294Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:15.0457769Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:15.0458183Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:15.0458574Z U c10::operator<=(c10::SymInt const&, int) 2025-05-07T20:11:15.0458926Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:11:15.0459296Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:11:15.0459640Z U c10::throwNullDataPtrError() 2025-05-07T20:11:15.0459981Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:15.0460439Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:15.0460862Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:15.0461309Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:15.0461662Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:15.0462079Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:15.0462447Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:15.0462820Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:15.0463186Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:15.0463530Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:15.0463885Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:15.0464241Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:15.0464615Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:15.0464987Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:15.0465379Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:15.0465739Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:15.0466078Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:15.0466431Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:15.0466783Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:15.0467154Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:15.0468155Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:11:15.0469390Z U fbgemm::SparseAdaGradSignature::Type fbgemm::GenerateSparseAdaGrad(int, bool, int, bool) 2025-05-07T20:11:15.0469974Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:11:15.0470402Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:15.0470852Z U float at::Tensor::item() const 2025-05-07T20:11:15.0471218Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.0471648Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.0472030Z U free@GLIBC_2.2.5 2025-05-07T20:11:15.0472345Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.0472851Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.0473273Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:15.0473700Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.0474103Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.0474457Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:15.0474753Z U memcpy@GLIBC_2.14 2025-05-07T20:11:15.0475092Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:15.0475395Z U memset@GLIBC_2.2.5 2025-05-07T20:11:15.0475690Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:15.0476195Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:15.0477008Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.0477766Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.0478516Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.0479269Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.0480044Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.0480809Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.0481344Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:15.0482070Z U split_embedding_codegen_forward_cpu(at::Tensor, at::Tensor, at::Tensor, c10::SymInt, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long) 2025-05-07T20:11:15.0483039Z U split_embedding_codegen_grad_indice_weights_cpu(at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor) 2025-05-07T20:11:15.0483676Z U sqrt@GLIBC_2.2.5 2025-05-07T20:11:15.0483978Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:11:15.0484387Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:15.0485073Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:15.0485917Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:15.0486768Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:15.0487599Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:15.0488199Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:15.0488553Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:15.0489040Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:11:15.0489422Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.0489819Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.0490235Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:15.0490658Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:15.0491033Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:15.0491534Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:15.0492459Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.0493253Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:15.0493623Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:15.0493981Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:15.0494327Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:15.0512770Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:15.0513350Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.0513866Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.0514395Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:15.0514798Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:15.0515200Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:15.0515869Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:15.0516533Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:15.0516901Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:15.0517220Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:15.0517507Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:15.0517833Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:15.0518635Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:15.0519819Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:15.0520637Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:15.0521123Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:15.0521647Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:15.0522222Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:15.0522907Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:15.0523424Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:15.0524074Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:15.0524704Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:15.0525156Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:15.0525665Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:15.0526092Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:15.0526439Z U torch::autograd::Node::metadata() 2025-05-07T20:11:15.0526812Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:15.0527302Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:15.0527943Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:15.0528487Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:15.0528952Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:15.0529509Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:15.0532601Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:15.0535549Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:15.0535978Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:15.0536405Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:15.0536851Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:15.0537523Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:15.0538428Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:15.0539464Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:15.0541945Z U typeinfo for c10::Error 2025-05-07T20:11:15.0542314Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:15.0542712Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:15.0543082Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:15.0543469Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:15.0543825Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:15.0545124Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:11:15.0547362Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:11:15.0548704Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:15.0549128Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:15.0549571Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:15.0549993Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:15.0550379Z U vtable for c10::Error 2025-05-07T20:11:15.0550931Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.0551504Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:15.0551994Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:15.0552446Z U vtable for torch::autograd::Node 2025-05-07T20:11:15.0552855Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:15.0553254Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:15.0553590Z w _ITM_registerTMCloneTable 2025-05-07T20:11:15.0553917Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:15.0554218Z w __gmon_start__ 2025-05-07T20:11:15.0554508Z w __pthread_key_create 2025-05-07T20:11:15.0554814Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:15.0555235Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:15.0555600Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:15.0556105Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:15.0556488Z 2025-05-07T20:11:15.0556673Z linux-vdso.so.1 (0x00007fff991c5000) 2025-05-07T20:11:15.0556987Z libc10.so => not found 2025-05-07T20:11:15.0557238Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.0557519Z libc10_cuda.so => not found 2025-05-07T20:11:15.0557780Z libnccl.so.2 => not found 2025-05-07T20:11:15.0558046Z libcuda.so.1 => not found 2025-05-07T20:11:15.0558667Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f0df4c00000) 2025-05-07T20:11:15.0559707Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f0df4800000) 2025-05-07T20:11:15.0560830Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f0df4659000) 2025-05-07T20:11:15.0561577Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.0561862Z libtorch.so => not found 2025-05-07T20:11:15.0562407Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007f0df4000000) 2025-05-07T20:11:15.0563335Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f0df2e00000) 2025-05-07T20:11:15.0564010Z libtorch_cpu.so => not found 2025-05-07T20:11:15.0564279Z libtorch_cuda.so => not found 2025-05-07T20:11:15.0564562Z libcudart.so.12 => not found 2025-05-07T20:11:15.0564894Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f0df2b9c000) 2025-05-07T20:11:15.0565301Z libm.so.6 => /lib64/libm.so.6 (0x00007f0e30188000) 2025-05-07T20:11:15.0565680Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f0e3015a000) 2025-05-07T20:11:15.0566075Z libc.so.6 => /lib64/libc.so.6 (0x00007f0df2994000) 2025-05-07T20:11:15.0566432Z /lib64/ld-linux-x86-64.so.2 (0x00007f0e3026d000) 2025-05-07T20:11:15.0566770Z libtorch.so => not found 2025-05-07T20:11:15.0567031Z libc10.so => not found 2025-05-07T20:11:15.0567284Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.0567556Z libc10_cuda.so => not found 2025-05-07T20:11:15.0567816Z libnccl.so.2 => not found 2025-05-07T20:11:15.0568086Z libcuda.so.1 => not found 2025-05-07T20:11:15.0568341Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.0568625Z libtorch_cpu.so => not found 2025-05-07T20:11:15.0568895Z libtorch_cuda.so => not found 2025-05-07T20:11:15.0569177Z libcudart.so.12 => not found 2025-05-07T20:11:15.0569498Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f0e30100000) 2025-05-07T20:11:15.0569861Z libc10.so => not found 2025-05-07T20:11:15.0570118Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.0570376Z libc10_cuda.so => not found 2025-05-07T20:11:15.0570647Z libnccl.so.2 => not found 2025-05-07T20:11:15.0570902Z libcuda.so.1 => not found 2025-05-07T20:11:15.0571518Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so (0x00007f0df63f3000) 2025-05-07T20:11:15.0572173Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.0572457Z libtorch.so => not found 2025-05-07T20:11:15.0572706Z libtorch_cpu.so => not found 2025-05-07T20:11:15.0573105Z libtorch_cuda.so => not found 2025-05-07T20:11:15.0573381Z libcudart.so.12 => not found 2025-05-07T20:11:15.0573633Z libc10.so => not found 2025-05-07T20:11:15.0573884Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.0574137Z libc10_cuda.so => not found 2025-05-07T20:11:15.0574396Z libnccl.so.2 => not found 2025-05-07T20:11:15.0574640Z libcuda.so.1 => not found 2025-05-07T20:11:15.0574905Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.0575166Z libtorch.so => not found 2025-05-07T20:11:15.0575424Z libtorch_cpu.so => not found 2025-05-07T20:11:15.0575681Z libtorch_cuda.so => not found 2025-05-07T20:11:15.0576403Z libcudart.so.12 => not found 2025-05-07T20:11:15.0576683Z libc10.so => not found 2025-05-07T20:11:15.0576939Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.0577268Z libc10_cuda.so => not found 2025-05-07T20:11:15.0577609Z libnccl.so.2 => not found 2025-05-07T20:11:15.0577872Z libcuda.so.1 => not found 2025-05-07T20:11:15.0578387Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007f0df6374000) 2025-05-07T20:11:15.0578962Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.0579230Z libtorch.so => not found 2025-05-07T20:11:15.0579498Z libtorch_cpu.so => not found 2025-05-07T20:11:15.0579763Z libtorch_cuda.so => not found 2025-05-07T20:11:15.0580036Z libtorch.so => not found 2025-05-07T20:11:15.0580394Z libc10.so => not found 2025-05-07T20:11:15.0580633Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.0580908Z libc10_cuda.so => not found 2025-05-07T20:11:15.0581168Z libnccl.so.2 => not found 2025-05-07T20:11:15.0581436Z libcuda.so.1 => not found 2025-05-07T20:11:15.0581691Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.0581978Z libtorch_cpu.so => not found 2025-05-07T20:11:15.0582245Z libtorch_cuda.so => not found 2025-05-07T20:11:15.0582582Z libcudart.so.12 => not found 2025-05-07T20:11:15.0582841Z libtorch.so => not found 2025-05-07T20:11:15.0583100Z libc10.so => not found 2025-05-07T20:11:15.0583358Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.0583617Z libc10_cuda.so => not found 2025-05-07T20:11:15.0583889Z libnccl.so.2 => not found 2025-05-07T20:11:15.0584138Z libcuda.so.1 => not found 2025-05-07T20:11:15.0584405Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.0584671Z libtorch_cpu.so => not found 2025-05-07T20:11:15.0584951Z libtorch_cuda.so => not found 2025-05-07T20:11:15.0585299Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f0df6367000) 2025-05-07T20:11:15.0585694Z libtorch.so => not found 2025-05-07T20:11:15.0585941Z libc10.so => not found 2025-05-07T20:11:15.0586197Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.0586474Z libc10_cuda.so => not found 2025-05-07T20:11:15.0586730Z libnccl.so.2 => not found 2025-05-07T20:11:15.0586990Z libcuda.so.1 => not found 2025-05-07T20:11:15.0587242Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.0587525Z libtorch_cpu.so => not found 2025-05-07T20:11:15.0587786Z libtorch_cuda.so => not found 2025-05-07T20:11:15.0588104Z librt.so.1 => /lib64/librt.so.1 (0x00007f0df635e000) 2025-05-07T20:11:15.0588346Z 2025-05-07T20:11:15.0588471Z [CHECK] Displaying ELF information: 2025-05-07T20:11:15.0588930Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:11:15.0589308Z 2025-05-07T20:11:15.0589312Z 2025-05-07T20:11:15.0589495Z Dynamic section at offset 0x38775ba0 contains 45 entries: 2025-05-07T20:11:15.0589876Z Tag Type Name/Value 2025-05-07T20:11:15.0590302Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:15.0590815Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:15.0591338Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:15.0591855Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:15.0592481Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:15.0593002Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:11:15.0593539Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:11:15.0594116Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:11:15.0594686Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:15.0595184Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:15.0595732Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:11:15.0596285Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:11:15.0596822Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:15.0597365Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:15.0597889Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:15.0598405Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:15.0598894Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:11:15.0599394Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:15.0599877Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:15.0600394Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:15.0600966Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:15.0601515Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:15.0601922Z 0x000000000000000c (INIT) 0x652000 2025-05-07T20:11:15.0602299Z 0x000000000000000d (FINI) 0x2f6443c 2025-05-07T20:11:15.0602803Z 0x0000000000000019 (INIT_ARRAY) 0x3871d880 2025-05-07T20:11:15.0603151Z 0x000000000000001b (INIT_ARRAYSZ) 1824 (bytes) 2025-05-07T20:11:15.0603514Z 0x000000000000001a (FINI_ARRAY) 0x3871dfa0 2025-05-07T20:11:15.0603852Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:15.0604198Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:11:15.0604533Z 0x0000000000000005 (STRTAB) 0x62978 2025-05-07T20:11:15.0604850Z 0x0000000000000006 (SYMTAB) 0x18470 2025-05-07T20:11:15.0605206Z 0x000000000000000a (STRSZ) 5120077 (bytes) 2025-05-07T20:11:15.0605558Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:15.0605907Z 0x0000000000000003 (PLTGOT) 0x38788fe8 2025-05-07T20:11:15.0606255Z 0x0000000000000002 (PLTRELSZ) 63264 (bytes) 2025-05-07T20:11:15.0606601Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:15.0606919Z 0x0000000000000017 (JMPREL) 0x641978 2025-05-07T20:11:15.0607255Z 0x0000000000000007 (RELA) 0x54ae50 2025-05-07T20:11:15.0607613Z 0x0000000000000008 (RELASZ) 1010472 (bytes) 2025-05-07T20:11:15.0607964Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:15.0608310Z 0x000000006ffffffe (VERNEED) 0x54ace0 2025-05-07T20:11:15.0608632Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:11:15.0608960Z 0x000000006ffffff0 (VERSYM) 0x5449c6 2025-05-07T20:11:15.0609285Z 0x000000006ffffff9 (RELACOUNT) 28262 2025-05-07T20:11:15.0609606Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:15.0609801Z 2025-05-07T20:11:15.0609931Z ################################################################################ 2025-05-07T20:11:15.0610155Z 2025-05-07T20:11:15.0610159Z 2025-05-07T20:11:15.0610272Z ################################################################################ 2025-05-07T20:11:15.0610820Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:15.0611336Z [CHECK] Listing out library size: 2025-05-07T20:11:15.0611832Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:15.0612240Z 2025-05-07T20:11:15.0612490Z 142 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:15.0612839Z 2025-05-07T20:11:15.0613262Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:15.0614377Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:15.0614996Z 2025-05-07T20:11:15.0829777Z GLIBC_2.2.5 2025-05-07T20:11:15.0830048Z GLIBC_2.3 2025-05-07T20:11:15.0830258Z GLIBC_2.14 2025-05-07T20:11:15.0834332Z 2025-05-07T20:11:15.0834350Z 2025-05-07T20:11:15.0834839Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:15.0835978Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:15.0836636Z 2025-05-07T20:11:15.1116977Z GLIBCXX_3.4 2025-05-07T20:11:15.1117319Z GLIBCXX_3.4.9 2025-05-07T20:11:15.1117914Z GLIBCXX_3.4.11 2025-05-07T20:11:15.1118181Z GLIBCXX_3.4.18 2025-05-07T20:11:15.1118414Z GLIBCXX_3.4.20 2025-05-07T20:11:15.1118619Z GLIBCXX_3.4.21 2025-05-07T20:11:15.1118761Z 2025-05-07T20:11:15.1118766Z 2025-05-07T20:11:15.1138716Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.ibsaszK5ep.symbols.txt 2025-05-07T20:11:15.1139262Z 2025-05-07T20:11:15.1378519Z 2025-05-07T20:11:15.1416086Z [CHECK] Total Number of symbols: 1629 2025-05-07T20:11:15.1440622Z [CHECK] Number of fbgemm symbols: 227 2025-05-07T20:11:15.1458005Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.ZzhoJTCJkB.usymbols.txt 2025-05-07T20:11:15.1458604Z 2025-05-07T20:11:15.1481986Z 2025-05-07T20:11:15.1507084Z [CHECK] Listing out undefined symbols (171 total): 2025-05-07T20:11:15.1527402Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.1529889Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.1531463Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:15.1532196Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:15.1532581Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:15.1533065Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:15.1533431Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:15.1533770Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:15.1534108Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:15.1534436Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:15.1534765Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:15.1535047Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:15.1535348Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:15.1535631Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:15.1535927Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:15.1536245Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:15.1536537Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:15.1536850Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:15.1537215Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:15.1537620Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:15.1538020Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:15.1538454Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:15.1539257Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.1540945Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.1541926Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:15.1542721Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:15.1543661Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.1544830Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.1545691Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:15.1546096Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:15.1546451Z U at::globalContext() 2025-05-07T20:11:15.1546952Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.1547339Z U c10::BoolType::get() 2025-05-07T20:11:15.1547724Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:15.1548063Z U c10::FloatType::get() 2025-05-07T20:11:15.1548365Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:15.1548735Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.1549148Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:15.1549490Z U c10::IntType::get() 2025-05-07T20:11:15.1549826Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:15.1550207Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:15.1550561Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:15.1550958Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:15.1551338Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:15.1551952Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:15.1552564Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:15.1552899Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:15.1553210Z U c10::SymIntType::get() 2025-05-07T20:11:15.1553535Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:15.1553930Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:15.1554278Z U c10::TensorType::get() 2025-05-07T20:11:15.1554576Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:15.1555465Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:15.1556364Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:15.1556697Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:15.1557028Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:15.1557344Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:15.1557473Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:15.1557582Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:15.1557818Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:15.1557931Z U c10::cuda::device_count() 2025-05-07T20:11:15.1558088Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:15.1558240Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:15.1558387Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:15.1558517Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:15.1558693Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:15.1558799Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:15.1559294Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:15.1559530Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:15.1560002Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:15.1560322Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:15.1560861Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:15.1561018Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:15.1561120Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:15.1561259Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:15.1561427Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:15.1561541Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:15.1561674Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:15.1561814Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:15.1561927Z U c10::throwNullDataPtrError() 2025-05-07T20:11:15.1562026Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:15.1562144Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:15.1562330Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:15.1562443Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:15.1562566Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:15.1562696Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:15.1562820Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:15.1562930Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:15.1563060Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:15.1563165Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:15.1563274Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:15.1563408Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:15.1563524Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:15.1563655Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:15.1563773Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:15.1563897Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:15.1564007Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:15.1564116Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:15.1564248Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:15.1564361Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:15.1566472Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:11:15.1566706Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:15.1566819Z U float at::Tensor::item() const 2025-05-07T20:11:15.1566964Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.1567113Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.1567233Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.1567380Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.1567551Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:15.1567677Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.1567831Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.1567950Z U memcpy@GLIBC_2.14 2025-05-07T20:11:15.1568054Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:15.1568147Z U memset@GLIBC_2.2.5 2025-05-07T20:11:15.1568257Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:15.1568373Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:15.1568692Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.1568979Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.1569290Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.1569599Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.1569917Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:15.1570299Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:15.1570603Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:15.1570953Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:15.1571080Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:15.1571188Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:15.1571324Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.1571468Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.1571629Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:15.1571758Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:15.1571996Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:15.1572535Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.1572651Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:15.1572780Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:15.1572890Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:15.1573047Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:15.1573235Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.1573456Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.1573602Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:15.1573721Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:15.1573811Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:15.1573925Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:15.1574483Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:15.1574913Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:15.1575153Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:15.1575505Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:15.1576191Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:15.1577797Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.1579227Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.1580701Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.1582082Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.1583441Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.1584895Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.1587034Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.1589241Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.1591147Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.1593108Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.1595007Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.1596898Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.1598628Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:11:15.1598785Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:15.1599016Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:15.1599176Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:15.1599511Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.1599784Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:15.1599899Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:15.1600006Z w _ITM_registerTMCloneTable 2025-05-07T20:11:15.1600126Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:15.1600216Z w __gmon_start__ 2025-05-07T20:11:15.1600314Z w __pthread_key_create 2025-05-07T20:11:15.1600426Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:15.1600549Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:15.1600698Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:15.1600954Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:15.1600962Z 2025-05-07T20:11:15.1601120Z linux-vdso.so.1 (0x00007ffec11c6000) 2025-05-07T20:11:15.1601242Z libc10.so => not found 2025-05-07T20:11:15.1601357Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.1601451Z libc10_cuda.so => not found 2025-05-07T20:11:15.1601543Z libnccl.so.2 => not found 2025-05-07T20:11:15.1601634Z libcuda.so.1 => not found 2025-05-07T20:11:15.1602220Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f22b1e00000) 2025-05-07T20:11:15.1602321Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.1602415Z libtorch.so => not found 2025-05-07T20:11:15.1602524Z libtorch_cpu.so => not found 2025-05-07T20:11:15.1602620Z libtorch_cuda.so => not found 2025-05-07T20:11:15.1602714Z libcudart.so.12 => not found 2025-05-07T20:11:15.1603002Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f22b1b9c000) 2025-05-07T20:11:15.1603148Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f22f4da1000) 2025-05-07T20:11:15.1603377Z libc.so.6 => /lib64/libc.so.6 (0x00007f22b1994000) 2025-05-07T20:11:15.1603512Z /lib64/ld-linux-x86-64.so.2 (0x00007f22f4dd7000) 2025-05-07T20:11:15.1603596Z libc10.so => not found 2025-05-07T20:11:15.1603683Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.1603778Z libc10_cuda.so => not found 2025-05-07T20:11:15.1603862Z libnccl.so.2 => not found 2025-05-07T20:11:15.1603946Z libcuda.so.1 => not found 2025-05-07T20:11:15.1604375Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f22b0200000) 2025-05-07T20:11:15.1604811Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f22afe00000) 2025-05-07T20:11:15.1605313Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f22f4bf8000) 2025-05-07T20:11:15.1605409Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.1605505Z libtorch.so => not found 2025-05-07T20:11:15.1605831Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007f22af800000) 2025-05-07T20:11:15.1606249Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f22ae600000) 2025-05-07T20:11:15.1606350Z libtorch_cpu.so => not found 2025-05-07T20:11:15.1606437Z libtorch_cuda.so => not found 2025-05-07T20:11:15.1606524Z libcudart.so.12 => not found 2025-05-07T20:11:15.1606650Z libm.so.6 => /lib64/libm.so.6 (0x00007f22ebb25000) 2025-05-07T20:11:15.1606734Z libtorch.so => not found 2025-05-07T20:11:15.1606815Z libc10.so => not found 2025-05-07T20:11:15.1606903Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.1607027Z libc10_cuda.so => not found 2025-05-07T20:11:15.1607147Z libnccl.so.2 => not found 2025-05-07T20:11:15.1607229Z libcuda.so.1 => not found 2025-05-07T20:11:15.1607331Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.1607416Z libtorch_cpu.so => not found 2025-05-07T20:11:15.1607528Z libtorch_cuda.so => not found 2025-05-07T20:11:15.1607613Z libcudart.so.12 => not found 2025-05-07T20:11:15.1607763Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f22b193e000) 2025-05-07T20:11:15.1607843Z libc10.so => not found 2025-05-07T20:11:15.1607931Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.1608026Z libc10_cuda.so => not found 2025-05-07T20:11:15.1608109Z libnccl.so.2 => not found 2025-05-07T20:11:15.1608194Z libcuda.so.1 => not found 2025-05-07T20:11:15.1608602Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so (0x00007f22ebb1a000) 2025-05-07T20:11:15.1608701Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.1608786Z libtorch.so => not found 2025-05-07T20:11:15.1608874Z libtorch_cpu.so => not found 2025-05-07T20:11:15.1608970Z libtorch_cuda.so => not found 2025-05-07T20:11:15.1609056Z libcudart.so.12 => not found 2025-05-07T20:11:15.1609138Z libc10.so => not found 2025-05-07T20:11:15.1609261Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.1609340Z libc10_cuda.so => not found 2025-05-07T20:11:15.1609421Z libnccl.so.2 => not found 2025-05-07T20:11:15.1609506Z libcuda.so.1 => not found 2025-05-07T20:11:15.1609606Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.1609690Z libtorch.so => not found 2025-05-07T20:11:15.1609776Z libtorch_cpu.so => not found 2025-05-07T20:11:15.1609877Z libtorch_cuda.so => not found 2025-05-07T20:11:15.1609957Z libcudart.so.12 => not found 2025-05-07T20:11:15.1610036Z libc10.so => not found 2025-05-07T20:11:15.1610121Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.1610213Z libc10_cuda.so => not found 2025-05-07T20:11:15.1610296Z libnccl.so.2 => not found 2025-05-07T20:11:15.1610379Z libcuda.so.1 => not found 2025-05-07T20:11:15.1610717Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007f22b0189000) 2025-05-07T20:11:15.1610803Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.1610885Z libtorch.so => not found 2025-05-07T20:11:15.1610975Z libtorch_cpu.so => not found 2025-05-07T20:11:15.1611070Z libtorch_cuda.so => not found 2025-05-07T20:11:15.1611148Z libtorch.so => not found 2025-05-07T20:11:15.1611226Z libc10.so => not found 2025-05-07T20:11:15.1611321Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.1611400Z libc10_cuda.so => not found 2025-05-07T20:11:15.1611485Z libnccl.so.2 => not found 2025-05-07T20:11:15.1611569Z libcuda.so.1 => not found 2025-05-07T20:11:15.1611666Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.1611752Z libtorch_cpu.so => not found 2025-05-07T20:11:15.1611839Z libtorch_cuda.so => not found 2025-05-07T20:11:15.1611932Z libcudart.so.12 => not found 2025-05-07T20:11:15.1612010Z libtorch.so => not found 2025-05-07T20:11:15.1612092Z libc10.so => not found 2025-05-07T20:11:15.1612175Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.1612269Z libc10_cuda.so => not found 2025-05-07T20:11:15.1612352Z libnccl.so.2 => not found 2025-05-07T20:11:15.1612436Z libcuda.so.1 => not found 2025-05-07T20:11:15.1612536Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.1612621Z libtorch_cpu.so => not found 2025-05-07T20:11:15.1612707Z libtorch_cuda.so => not found 2025-05-07T20:11:15.1612871Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f22ebb07000) 2025-05-07T20:11:15.1612964Z libtorch.so => not found 2025-05-07T20:11:15.1613044Z libc10.so => not found 2025-05-07T20:11:15.1613133Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.1613227Z libc10_cuda.so => not found 2025-05-07T20:11:15.1613309Z libnccl.so.2 => not found 2025-05-07T20:11:15.1613390Z libcuda.so.1 => not found 2025-05-07T20:11:15.1613476Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.1613571Z libtorch_cpu.so => not found 2025-05-07T20:11:15.1613704Z libtorch_cuda.so => not found 2025-05-07T20:11:15.1613828Z librt.so.1 => /lib64/librt.so.1 (0x00007f22b1939000) 2025-05-07T20:11:15.1613833Z 2025-05-07T20:11:15.1613938Z [CHECK] Displaying ELF information: 2025-05-07T20:11:15.1614200Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:11:15.1614230Z 2025-05-07T20:11:15.1627808Z 2025-05-07T20:11:15.1628482Z Dynamic section at offset 0x8d68cc8 contains 40 entries: 2025-05-07T20:11:15.1628833Z Tag Type Name/Value 2025-05-07T20:11:15.1629312Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:15.1629517Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:15.1629726Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:15.1629918Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:15.1630118Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:15.1630390Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:15.1630596Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:15.1630829Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:15.1631042Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:15.1631241Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:15.1631439Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:15.1631648Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:15.1631842Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:15.1632027Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:15.1632242Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:15.1632524Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_gwd.so] 2025-05-07T20:11:15.1632707Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:15.1632827Z 0x000000000000000c (INIT) 0xbe000 2025-05-07T20:11:15.1632955Z 0x000000000000000d (FINI) 0x5f04ec 2025-05-07T20:11:15.1633076Z 0x0000000000000019 (INIT_ARRAY) 0x8d5ea18 2025-05-07T20:11:15.1633203Z 0x000000000000001b (INIT_ARRAYSZ) 200 (bytes) 2025-05-07T20:11:15.1633332Z 0x000000000000001a (FINI_ARRAY) 0x8d5eae0 2025-05-07T20:11:15.1633451Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:15.1633559Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:11:15.1633668Z 0x0000000000000005 (STRTAB) 0xc600 2025-05-07T20:11:15.1633786Z 0x0000000000000006 (SYMTAB) 0x2d30 2025-05-07T20:11:15.1633923Z 0x000000000000000a (STRSZ) 597451 (bytes) 2025-05-07T20:11:15.1634039Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:15.1634167Z 0x0000000000000003 (PLTGOT) 0x8d6afe8 2025-05-07T20:11:15.1634304Z 0x0000000000000002 (PLTRELSZ) 12672 (bytes) 2025-05-07T20:11:15.1634408Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:15.1634529Z 0x0000000000000017 (JMPREL) 0xbab38 2025-05-07T20:11:15.1634638Z 0x0000000000000007 (RELA) 0x9f1a8 2025-05-07T20:11:15.1634769Z 0x0000000000000008 (RELASZ) 113040 (bytes) 2025-05-07T20:11:15.1634887Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:15.1635013Z 0x000000006ffffffe (VERNEED) 0x9f088 2025-05-07T20:11:15.1635123Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:15.1635234Z 0x000000006ffffff0 (VERSYM) 0x9e3cc 2025-05-07T20:11:15.1635353Z 0x000000006ffffff9 (RELACOUNT) 3303 2025-05-07T20:11:15.1635503Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:15.1635520Z 2025-05-07T20:11:15.1635635Z ################################################################################ 2025-05-07T20:11:15.1635640Z 2025-05-07T20:11:15.1635705Z 2025-05-07T20:11:15.1635831Z ################################################################################ 2025-05-07T20:11:15.1636159Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:15.1636259Z [CHECK] Listing out library size: 2025-05-07T20:11:15.1636592Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:15.1636597Z 2025-05-07T20:11:15.1644189Z 59 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:15.1644216Z 2025-05-07T20:11:15.1645755Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:15.1647390Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:15.1647492Z 2025-05-07T20:11:15.1809786Z GLIBC_2.2.5 2025-05-07T20:11:15.1810166Z GLIBC_2.3 2025-05-07T20:11:15.1810257Z GLIBC_2.14 2025-05-07T20:11:15.1810264Z 2025-05-07T20:11:15.1810268Z 2025-05-07T20:11:15.1810748Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:15.1811345Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:15.1811351Z 2025-05-07T20:11:15.1978058Z GLIBCXX_3.4 2025-05-07T20:11:15.1978169Z GLIBCXX_3.4.9 2025-05-07T20:11:15.1978266Z GLIBCXX_3.4.11 2025-05-07T20:11:15.1978347Z GLIBCXX_3.4.15 2025-05-07T20:11:15.1978437Z GLIBCXX_3.4.18 2025-05-07T20:11:15.1978520Z GLIBCXX_3.4.20 2025-05-07T20:11:15.1978612Z GLIBCXX_3.4.21 2025-05-07T20:11:15.1978623Z 2025-05-07T20:11:15.1978627Z 2025-05-07T20:11:15.2000902Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.J0SD2xr61g.symbols.txt 2025-05-07T20:11:15.2000975Z 2025-05-07T20:11:15.2125285Z 2025-05-07T20:11:15.2147648Z [CHECK] Total Number of symbols: 1874 2025-05-07T20:11:15.2166902Z [CHECK] Number of fbgemm symbols: 100 2025-05-07T20:11:15.2182294Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.ZG5O54pY37.usymbols.txt 2025-05-07T20:11:15.2182322Z 2025-05-07T20:11:15.2206345Z 2025-05-07T20:11:15.2229130Z [CHECK] Listing out undefined symbols (259 total): 2025-05-07T20:11:15.2246577Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.2246976Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.2247199Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:15.2247360Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:15.2247512Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:15.2247645Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:15.2247793Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:15.2247919Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:15.2248037Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:15.2248182Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:15.2248295Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:11:15.2248396Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:15.2248503Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:15.2248811Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:15.2248926Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:11:15.2249030Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:11:15.2249187Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:11:15.2249292Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:11:15.2249392Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:15.2249506Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:15.2249603Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:11:15.2249711Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:15.2249809Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:15.2249926Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:15.2250071Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:15.2250358Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:11:15.2250493Z U at::RecordFunction::currentThreadId() 2025-05-07T20:11:15.2250598Z U at::RecordFunction::end() 2025-05-07T20:11:15.2250719Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:11:15.2250916Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:11:15.2251102Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:15.2251259Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:15.2251844Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.2252475Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.2252638Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:11:15.2253114Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.2253775Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.2253912Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:15.2254027Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:11:15.2254169Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:11:15.2254279Z U at::globalContext() 2025-05-07T20:11:15.2254399Z U at::sequence_number::get_and_increment() 2025-05-07T20:11:15.2254488Z U bcmp@GLIBC_2.2.5 2025-05-07T20:11:15.2254577Z U c10::AnyType::get() 2025-05-07T20:11:15.2254783Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.2254882Z U c10::BoolType::get() 2025-05-07T20:11:15.2255036Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:15.2255224Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:11:15.2255334Z U c10::Dispatcher::realSingleton() 2025-05-07T20:11:15.2255812Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:11:15.2256525Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:11:15.2256874Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:15.2256975Z U c10::Error::what() const 2025-05-07T20:11:15.2257112Z U c10::FloatType::get() 2025-05-07T20:11:15.2257214Z U c10::GradMode::is_enabled() 2025-05-07T20:11:15.2257320Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:11:15.2257505Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.2257657Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:11:15.2257772Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:11:15.2257894Z U c10::IValue::isBoolList() const 2025-05-07T20:11:15.2257999Z U c10::IValue::isIntList() const 2025-05-07T20:11:15.2258109Z U c10::IValue::isSymIntList() const 2025-05-07T20:11:15.2258233Z U c10::IValue::isTensorList() const 2025-05-07T20:11:15.2258371Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:15.2258465Z U c10::IntType::get() 2025-05-07T20:11:15.2258662Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:15.2258779Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:15.2258902Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:15.2259022Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:11:15.2259244Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:15.2259507Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:11:15.2259658Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:15.2259771Z U c10::StringType::get() 2025-05-07T20:11:15.2259907Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:15.2260043Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:15.2260326Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:11:15.2260633Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:15.2260791Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:11:15.2261215Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:15.2261360Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:15.2261490Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:11:15.2261645Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:15.2261770Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:15.2261902Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:15.2262044Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:15.2262176Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:15.2262287Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:15.2262403Z U c10::SymIntType::get() 2025-05-07T20:11:15.2262556Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:15.2262678Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:11:15.2262848Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:15.2262949Z U c10::TensorType::get() 2025-05-07T20:11:15.2263073Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:15.2263851Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:15.2263987Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:15.2264137Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:15.2264273Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:15.2264390Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:15.2264512Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:15.2264627Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:15.2264895Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:15.2265002Z U c10::cuda::device_count() 2025-05-07T20:11:15.2265141Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:15.2265295Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:15.2265441Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:15.2265584Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:15.2265788Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:15.2265903Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:15.2266338Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:11:15.2266961Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:15.2267201Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:15.2267674Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:15.2267990Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:15.2268533Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:15.2268657Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:15.2268762Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:15.2269061Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:11:15.2269247Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:11:15.2269389Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:15.2269549Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:15.2269676Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:15.2269788Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:15.2269936Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:11:15.2270292Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:11:15.2270409Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:15.2270543Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:15.2270685Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:15.2270838Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:11:15.2270979Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:11:15.2272147Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:15.2272269Z U c10::throwNullDataPtrError() 2025-05-07T20:11:15.2272373Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:15.2272517Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:15.2272720Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:15.2272836Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:15.2272963Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:15.2273099Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:15.2273224Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:15.2273336Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:15.2273469Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:15.2273577Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:15.2273690Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:15.2273808Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:15.2273940Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:15.2274100Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:15.2274216Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:15.2274337Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:15.2274443Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:15.2274549Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:15.2274684Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:15.2274798Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:15.2277212Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:11:15.2277502Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:11:15.2277649Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.2277828Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.2277928Z U free@GLIBC_2.2.5 2025-05-07T20:11:15.2278058Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.2278206Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.2278405Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:15.2278542Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.2278692Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.2278812Z U memcmp@GLIBC_2.2.5 2025-05-07T20:11:15.2278909Z U memcpy@GLIBC_2.14 2025-05-07T20:11:15.2279008Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:15.2279117Z U memset@GLIBC_2.2.5 2025-05-07T20:11:15.2279236Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:15.2279360Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:15.2279706Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.2280129Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.2280230Z U realloc@GLIBC_2.2.5 2025-05-07T20:11:15.2280445Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:11:15.2280841Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:15.2281242Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:15.2281589Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:15.2281970Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:15.2282347Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:15.2282479Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:15.2282708Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:15.2282844Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.2283029Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.2283192Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:15.2283318Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:15.2283463Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:11:15.2283686Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:15.2284231Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.2284367Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:11:15.2284483Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:15.2284600Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:15.2284724Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:15.2284830Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:15.2285004Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.2285241Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.2285361Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:15.2285517Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:11:15.2285656Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:11:15.2286057Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:11:15.2286188Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:11:15.2286311Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:15.2286403Z U strcmp@GLIBC_2.2.5 2025-05-07T20:11:15.2286493Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:15.2286611Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:15.2287181Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:15.2287612Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:15.2287935Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:15.2288055Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:11:15.2288331Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:11:15.2288543Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:11:15.2288732Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:11:15.2288908Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:11:15.2289249Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:11:15.2289389Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:11:15.2289565Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:11:15.2289748Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:11:15.2289865Z U torch::autograd::Node::assign_parent() 2025-05-07T20:11:15.2289976Z U torch::autograd::Node::metadata() 2025-05-07T20:11:15.2290146Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:11:15.2290376Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:11:15.2290628Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:11:15.2290774Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:11:15.2290971Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:11:15.2291173Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:11:15.2293687Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:11:15.2293841Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:11:15.2294017Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:11:15.2294228Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:11:15.2295006Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:11:15.2295169Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:11:15.2295555Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:11:15.2295922Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:15.2296543Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:15.2296655Z U typeinfo for c10::Error 2025-05-07T20:11:15.2296814Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:15.2296940Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:11:15.2297098Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:11:15.2297251Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:11:15.2297376Z U typeinfo for torch::autograd::Node 2025-05-07T20:11:15.2298699Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.2300108Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.2301682Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.2303080Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.2304461Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.2305824Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:11:15.2306018Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:15.2306194Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:15.2306366Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:11:15.2306555Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:15.2306670Z U vtable for c10::Error 2025-05-07T20:11:15.2307013Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.2307187Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:11:15.2307426Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:15.2307553Z U vtable for torch::autograd::Node 2025-05-07T20:11:15.2307821Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:11:15.2307943Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:15.2308058Z w _ITM_registerTMCloneTable 2025-05-07T20:11:15.2308232Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:15.2308363Z w __gmon_start__ 2025-05-07T20:11:15.2308471Z w __pthread_key_create 2025-05-07T20:11:15.2308612Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:15.2308738Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:15.2308893Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:15.2309169Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:15.2309198Z 2025-05-07T20:11:15.2309317Z linux-vdso.so.1 (0x00007fff9c15d000) 2025-05-07T20:11:15.2309418Z libc10.so => not found 2025-05-07T20:11:15.2309524Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.2309644Z libc10_cuda.so => not found 2025-05-07T20:11:15.2309746Z libnccl.so.2 => not found 2025-05-07T20:11:15.2309843Z libcuda.so.1 => not found 2025-05-07T20:11:15.2310441Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f1e60800000) 2025-05-07T20:11:15.2310578Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.2310676Z libtorch.so => not found 2025-05-07T20:11:15.2310780Z libtorch_cpu.so => not found 2025-05-07T20:11:15.2310903Z libtorch_cuda.so => not found 2025-05-07T20:11:15.2311001Z libcudart.so.12 => not found 2025-05-07T20:11:15.2311172Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f1e6059c000) 2025-05-07T20:11:15.2311346Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f1e9e214000) 2025-05-07T20:11:15.2311500Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f1e9a5d2000) 2025-05-07T20:11:15.2311631Z libc.so.6 => /lib64/libc.so.6 (0x00007f1e60394000) 2025-05-07T20:11:15.2311786Z /lib64/ld-linux-x86-64.so.2 (0x00007f1e9e272000) 2025-05-07T20:11:15.2311884Z libc10.so => not found 2025-05-07T20:11:15.2311984Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.2312082Z libc10_cuda.so => not found 2025-05-07T20:11:15.2312198Z libnccl.so.2 => not found 2025-05-07T20:11:15.2312295Z libcuda.so.1 => not found 2025-05-07T20:11:15.2312761Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f1e5ec00000) 2025-05-07T20:11:15.2313252Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f1e5e800000) 2025-05-07T20:11:15.2313798Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f1e5e659000) 2025-05-07T20:11:15.2313907Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.2314027Z libtorch.so => not found 2025-05-07T20:11:15.2314387Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007f1e5e000000) 2025-05-07T20:11:15.2314852Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f1e5ce00000) 2025-05-07T20:11:15.2314977Z libtorch_cpu.so => not found 2025-05-07T20:11:15.2315076Z libtorch_cuda.so => not found 2025-05-07T20:11:15.2315171Z libcudart.so.12 => not found 2025-05-07T20:11:15.2315295Z libm.so.6 => /lib64/libm.so.6 (0x00007f1e5eb25000) 2025-05-07T20:11:15.2315403Z libtorch.so => not found 2025-05-07T20:11:15.2315494Z libc10.so => not found 2025-05-07T20:11:15.2315589Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.2315692Z libc10_cuda.so => not found 2025-05-07T20:11:15.2315784Z libnccl.so.2 => not found 2025-05-07T20:11:15.2315875Z libcuda.so.1 => not found 2025-05-07T20:11:15.2315972Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.2316079Z libtorch_cpu.so => not found 2025-05-07T20:11:15.2316175Z libtorch_cuda.so => not found 2025-05-07T20:11:15.2316433Z libcudart.so.12 => not found 2025-05-07T20:11:15.2316527Z libc10.so => not found 2025-05-07T20:11:15.2316617Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.2316706Z libc10_cuda.so => not found 2025-05-07T20:11:15.2316930Z libnccl.so.2 => not found 2025-05-07T20:11:15.2317023Z libcuda.so.1 => not found 2025-05-07T20:11:15.2317430Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so (0x00007f1e9a5c3000) 2025-05-07T20:11:15.2317521Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.2317620Z libtorch.so => not found 2025-05-07T20:11:15.2317707Z libtorch_cpu.so => not found 2025-05-07T20:11:15.2317796Z libtorch_cuda.so => not found 2025-05-07T20:11:15.2317879Z libcudart.so.12 => not found 2025-05-07T20:11:15.2317973Z libc10.so => not found 2025-05-07T20:11:15.2318061Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.2318145Z libc10_cuda.so => not found 2025-05-07T20:11:15.2318242Z libnccl.so.2 => not found 2025-05-07T20:11:15.2318332Z libcuda.so.1 => not found 2025-05-07T20:11:15.2318424Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.2318514Z libtorch.so => not found 2025-05-07T20:11:15.2318615Z libtorch_cpu.so => not found 2025-05-07T20:11:15.2318745Z libtorch_cuda.so => not found 2025-05-07T20:11:15.2318831Z libcudart.so.12 => not found 2025-05-07T20:11:15.2318923Z libc10.so => not found 2025-05-07T20:11:15.2319010Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.2319094Z libc10_cuda.so => not found 2025-05-07T20:11:15.2319178Z libnccl.so.2 => not found 2025-05-07T20:11:15.2319270Z libcuda.so.1 => not found 2025-05-07T20:11:15.2319597Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007f1e9a544000) 2025-05-07T20:11:15.2319690Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.2319787Z libtorch.so => not found 2025-05-07T20:11:15.2319872Z libtorch_cpu.so => not found 2025-05-07T20:11:15.2319963Z libtorch_cuda.so => not found 2025-05-07T20:11:15.2320049Z libtorch.so => not found 2025-05-07T20:11:15.2320142Z libc10.so => not found 2025-05-07T20:11:15.2320231Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.2320317Z libc10_cuda.so => not found 2025-05-07T20:11:15.2320412Z libnccl.so.2 => not found 2025-05-07T20:11:15.2320501Z libcuda.so.1 => not found 2025-05-07T20:11:15.2320589Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.2320678Z libtorch_cpu.so => not found 2025-05-07T20:11:15.2320778Z libtorch_cuda.so => not found 2025-05-07T20:11:15.2320865Z libcudart.so.12 => not found 2025-05-07T20:11:15.2320951Z libtorch.so => not found 2025-05-07T20:11:15.2321043Z libc10.so => not found 2025-05-07T20:11:15.2321129Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.2321213Z libc10_cuda.so => not found 2025-05-07T20:11:15.2321297Z libnccl.so.2 => not found 2025-05-07T20:11:15.2321394Z libcuda.so.1 => not found 2025-05-07T20:11:15.2321485Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.2321572Z libtorch_cpu.so => not found 2025-05-07T20:11:15.2321681Z libtorch_cuda.so => not found 2025-05-07T20:11:15.2321844Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f1e9a537000) 2025-05-07T20:11:15.2321929Z libtorch.so => not found 2025-05-07T20:11:15.2322010Z libc10.so => not found 2025-05-07T20:11:15.2322113Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.2322202Z libc10_cuda.so => not found 2025-05-07T20:11:15.2322287Z libnccl.so.2 => not found 2025-05-07T20:11:15.2322382Z libcuda.so.1 => not found 2025-05-07T20:11:15.2322467Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.2322552Z libtorch_cpu.so => not found 2025-05-07T20:11:15.2322641Z libtorch_cuda.so => not found 2025-05-07T20:11:15.2322776Z librt.so.1 => /lib64/librt.so.1 (0x00007f1e9a52e000) 2025-05-07T20:11:15.2322783Z 2025-05-07T20:11:15.2322878Z [CHECK] Displaying ELF information: 2025-05-07T20:11:15.2323150Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:11:15.2323154Z 2025-05-07T20:11:15.2339188Z 2025-05-07T20:11:15.2340499Z Dynamic section at offset 0x3a27010 contains 41 entries: 2025-05-07T20:11:15.2340890Z Tag Type Name/Value 2025-05-07T20:11:15.2341477Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:15.2342170Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:15.2342742Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:15.2343317Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:15.2343901Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:15.2344659Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:15.2345262Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:15.2345839Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:15.2346430Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:15.2347017Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:15.2347611Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:15.2348279Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:15.2348854Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:11:15.2349062Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:15.2349251Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:15.2349494Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:15.2349786Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_dense.so] 2025-05-07T20:11:15.2349969Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:15.2350092Z 0x000000000000000c (INIT) 0x80000 2025-05-07T20:11:15.2350226Z 0x000000000000000d (FINI) 0x261c5c 2025-05-07T20:11:15.2350347Z 0x0000000000000019 (INIT_ARRAY) 0x3a223b0 2025-05-07T20:11:15.2350480Z 0x000000000000001b (INIT_ARRAYSZ) 184 (bytes) 2025-05-07T20:11:15.2350601Z 0x000000000000001a (FINI_ARRAY) 0x3a22468 2025-05-07T20:11:15.2350734Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:15.2350848Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:11:15.2350959Z 0x0000000000000005 (STRTAB) 0xe368 2025-05-07T20:11:15.2351078Z 0x0000000000000006 (SYMTAB) 0x33a0 2025-05-07T20:11:15.2351210Z 0x000000000000000a (STRSZ) 374997 (bytes) 2025-05-07T20:11:15.2351325Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:15.2351447Z 0x0000000000000003 (PLTGOT) 0x3a28fe8 2025-05-07T20:11:15.2351581Z 0x0000000000000002 (PLTRELSZ) 18456 (bytes) 2025-05-07T20:11:15.2351688Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:15.2351801Z 0x0000000000000017 (JMPREL) 0x7b2d8 2025-05-07T20:11:15.2351930Z 0x0000000000000007 (RELA) 0x6ac28 2025-05-07T20:11:15.2352069Z 0x0000000000000008 (RELASZ) 67248 (bytes) 2025-05-07T20:11:15.2352196Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:15.2352324Z 0x000000006ffffffe (VERNEED) 0x6aae8 2025-05-07T20:11:15.2352437Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:15.2352558Z 0x000000006ffffff0 (VERSYM) 0x69c3e 2025-05-07T20:11:15.2352673Z 0x000000006ffffff9 (RELACOUNT) 1392 2025-05-07T20:11:15.2352790Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:15.2352795Z 2025-05-07T20:11:15.2352916Z ################################################################################ 2025-05-07T20:11:15.2352921Z 2025-05-07T20:11:15.2352925Z 2025-05-07T20:11:15.2353102Z ################################################################################ 2025-05-07T20:11:15.2353456Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:15.2353568Z [CHECK] Listing out library size: 2025-05-07T20:11:15.2353910Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:15.2353915Z 2025-05-07T20:11:15.2354198Z 328 ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:15.2354203Z 2025-05-07T20:11:15.2354647Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:15.2355206Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:11:15.2355211Z 2025-05-07T20:11:15.3008560Z GLIBC_2.2.5 2025-05-07T20:11:15.3008775Z GLIBC_2.3 2025-05-07T20:11:15.3008925Z GLIBC_2.14 2025-05-07T20:11:15.3008933Z 2025-05-07T20:11:15.3008937Z 2025-05-07T20:11:15.3009430Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:15.3010176Z + objdump -TC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:11:15.3010181Z 2025-05-07T20:11:15.3660788Z GLIBCXX_3.4 2025-05-07T20:11:15.3661037Z GLIBCXX_3.4.9 2025-05-07T20:11:15.3662142Z GLIBCXX_3.4.11 2025-05-07T20:11:15.3662260Z GLIBCXX_3.4.18 2025-05-07T20:11:15.3662353Z GLIBCXX_3.4.20 2025-05-07T20:11:15.3662469Z GLIBCXX_3.4.21 2025-05-07T20:11:15.3662479Z 2025-05-07T20:11:15.3662486Z 2025-05-07T20:11:15.3679632Z + nm -gDC ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.QE9lRzWRnm.symbols.txt 2025-05-07T20:11:15.3679642Z 2025-05-07T20:11:15.4304799Z 2025-05-07T20:11:15.4359839Z [CHECK] Total Number of symbols: 3739 2025-05-07T20:11:15.4409104Z [CHECK] Number of fbgemm symbols: 551 2025-05-07T20:11:15.4423816Z + nm -gDCu ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.TgOomuvH4r.usymbols.txt 2025-05-07T20:11:15.4425448Z 2025-05-07T20:11:15.4461104Z 2025-05-07T20:11:15.4485466Z [CHECK] Listing out undefined symbols (178 total): 2025-05-07T20:11:15.4502826Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.4505333Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.4506923Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:11:15.4507962Z U __cudaPopCallConfiguration@libcudart.so.12 2025-05-07T20:11:15.4509108Z U __cudaPushCallConfiguration@libcudart.so.12 2025-05-07T20:11:15.4510187Z U __cudaRegisterFatBinary@libcudart.so.12 2025-05-07T20:11:15.4510574Z U __cudaRegisterFatBinaryEnd@libcudart.so.12 2025-05-07T20:11:15.4510931Z U __cudaRegisterFunction@libcudart.so.12 2025-05-07T20:11:15.4511305Z U __cudaRegisterVar@libcudart.so.12 2025-05-07T20:11:15.4511656Z U __cudaUnregisterFatBinary@libcudart.so.12 2025-05-07T20:11:15.4512008Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:11:15.4512314Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:11:15.4512639Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:11:15.4512939Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:11:15.4513271Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:11:15.4513599Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:11:15.4513906Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:11:15.4514433Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:11:15.4514789Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:11:15.4515204Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:11:15.4515661Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:11:15.4516101Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:11:15.4516564Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:11:15.4517368Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.4518638Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.4519562Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:11:15.4520139Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:11:15.4521038Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.4522137Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:11:15.4522926Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:11:15.4523335Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:11:15.4523677Z U at::globalContext() 2025-05-07T20:11:15.4524084Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.4524514Z U c10::BoolType::get() 2025-05-07T20:11:15.4524858Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:11:15.4525241Z U c10::FloatType::get() 2025-05-07T20:11:15.4525546Z U c10::GeneratorImpl::device() const 2025-05-07T20:11:15.4525945Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.4526363Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:11:15.4526695Z U c10::IntType::get() 2025-05-07T20:11:15.4527060Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:11:15.4527436Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:11:15.4527822Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:15.4528216Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:11:15.4528621Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:11:15.4529038Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:11:15.4529455Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:11:15.4530100Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:11:15.4530717Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:11:15.4531116Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:11:15.4531483Z U c10::SymInt::promote_to_negative() 2025-05-07T20:11:15.4531829Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:11:15.4532202Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:11:15.4532671Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:11:15.4533037Z U c10::SymInt::toSymNode() const 2025-05-07T20:11:15.4533356Z U c10::SymIntType::get() 2025-05-07T20:11:15.4533709Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:11:15.4534154Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:11:15.4534506Z U c10::TensorType::get() 2025-05-07T20:11:15.4534842Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:11:15.4535750Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:11:15.4536652Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:11:15.4537017Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:11:15.4537358Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:11:15.4537710Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:11:15.4538035Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:11:15.4538415Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:11:15.4538852Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:11:15.4539305Z U c10::cuda::device_count() 2025-05-07T20:11:15.4539629Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:11:15.4540000Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:11:15.4540466Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:11:15.4541024Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:11:15.4541445Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:11:15.4541835Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:11:15.4542600Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:11:15.4543502Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:11:15.4544369Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:15.4545332Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:11:15.4546384Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:11:15.4547197Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:11:15.4547553Z U c10::impl::GPUTrace::haveState 2025-05-07T20:11:15.4547922Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:11:15.4548368Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:11:15.4548790Z U c10::impl::device_guard_impl_registry 2025-05-07T20:11:15.4549136Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:11:15.4549500Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:11:15.4549878Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:11:15.4550286Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:11:15.4550692Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:11:15.4551057Z U c10::throwNullDataPtrError() 2025-05-07T20:11:15.4551392Z U c10::warn(c10::Warning const&) 2025-05-07T20:11:15.4551775Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:11:15.4552200Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:11:15.4552624Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:11:15.4553110Z U cudaDeviceGetAttribute@libcudart.so.12 2025-05-07T20:11:15.4553460Z U cudaDeviceSynchronize@libcudart.so.12 2025-05-07T20:11:15.4553799Z U cudaEventCreateWithFlags@libcudart.so.12 2025-05-07T20:11:15.4554145Z U cudaEventDestroy@libcudart.so.12 2025-05-07T20:11:15.4554467Z U cudaEventElapsedTime@libcudart.so.12 2025-05-07T20:11:15.4554800Z U cudaEventQuery@libcudart.so.12 2025-05-07T20:11:15.4555109Z U cudaEventRecord@libcudart.so.12 2025-05-07T20:11:15.4555445Z U cudaEventSynchronize@libcudart.so.12 2025-05-07T20:11:15.4555772Z U cudaFuncSetAttribute@libcudart.so.12 2025-05-07T20:11:15.4556136Z U cudaGetDeviceProperties_v2@libcudart.so.12 2025-05-07T20:11:15.4556496Z U cudaGetErrorString@libcudart.so.12 2025-05-07T20:11:15.4556815Z U cudaGetLastError@libcudart.so.12 2025-05-07T20:11:15.4557175Z U cudaLaunchKernel@libcudart.so.12 2025-05-07T20:11:15.4557484Z U cudaStreamQuery@libcudart.so.12 2025-05-07T20:11:15.4557820Z U cudaStreamSynchronize@libcudart.so.12 2025-05-07T20:11:15.4558145Z U cudaStreamWaitEvent@libcudart.so.12 2025-05-07T20:11:15.4560391Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:11:15.4562956Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:11:15.4563394Z U float at::Tensor::item() const 2025-05-07T20:11:15.4563920Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.4564381Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.4564820Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.4565198Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.4565637Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:11:15.4566056Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:11:15.4566467Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:11:15.4566823Z U memcpy@GLIBC_2.14 2025-05-07T20:11:15.4567124Z U memmove@GLIBC_2.2.5 2025-05-07T20:11:15.4567425Z U memset@GLIBC_2.2.5 2025-05-07T20:11:15.4567736Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:11:15.4568094Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:11:15.4568653Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.4569409Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.4570144Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.4570970Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.4571743Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.4572493Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:11:15.4573310Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:11:15.4574161Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:11:15.4574986Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:11:15.4575817Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:11:15.4576619Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:11:15.4576964Z U std::__throw_bad_array_new_length() 2025-05-07T20:11:15.4577341Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.4577796Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:11:15.4578231Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:11:15.4578653Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:11:15.4579148Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:11:15.4580153Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.4580979Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:11:15.4581355Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:11:15.4581727Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:11:15.4582070Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:11:15.4582496Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.4583034Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:11:15.4583526Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:11:15.4583888Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:11:15.4584198Z U strlen@GLIBC_2.2.5 2025-05-07T20:11:15.4584525Z U torch::CppFunction::~CppFunction() 2025-05-07T20:11:15.4585342Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:11:15.4586513Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:11:15.4587352Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:11:15.4588106Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:11:15.4589172Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:11:15.4593093Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.4596924Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.4600618Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.4604290Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.4608078Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.4612063Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:11:15.4615784Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:11:15.4617790Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:11:15.4618215Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:11:15.4618665Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:11:15.4619281Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:11:15.4620008Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:11:15.4620548Z w _ITM_deregisterTMCloneTable 2025-05-07T20:11:15.4620885Z w _ITM_registerTMCloneTable 2025-05-07T20:11:15.4621334Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:11:15.4621674Z w __gmon_start__ 2025-05-07T20:11:15.4621965Z w __pthread_key_create 2025-05-07T20:11:15.4622315Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:11:15.4622660Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:11:15.4623062Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:11:15.4623565Z + ldd ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:15.4623955Z 2025-05-07T20:11:15.4624145Z linux-vdso.so.1 (0x00007ffdbdb22000) 2025-05-07T20:11:15.4624435Z libc10.so => not found 2025-05-07T20:11:15.4624695Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.4624961Z libc10_cuda.so => not found 2025-05-07T20:11:15.4625234Z libnccl.so.2 => not found 2025-05-07T20:11:15.4636759Z libcuda.so.1 => not found 2025-05-07T20:11:15.4637556Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007fcddcc00000) 2025-05-07T20:11:15.4638375Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.4638647Z libtorch.so => not found 2025-05-07T20:11:15.4638883Z libtorch_cpu.so => not found 2025-05-07T20:11:15.4639152Z libtorch_cuda.so => not found 2025-05-07T20:11:15.4639409Z libcudart.so.12 => not found 2025-05-07T20:11:15.4639710Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fcddc99c000) 2025-05-07T20:11:15.4640116Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fce2b9f6000) 2025-05-07T20:11:15.4640491Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fce2b9c8000) 2025-05-07T20:11:15.4640851Z libc.so.6 => /lib64/libc.so.6 (0x00007fcddc794000) 2025-05-07T20:11:15.4641193Z /lib64/ld-linux-x86-64.so.2 (0x00007fce2ba54000) 2025-05-07T20:11:15.4641500Z libc10.so => not found 2025-05-07T20:11:15.4641728Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.4641977Z libc10_cuda.so => not found 2025-05-07T20:11:15.4642231Z libnccl.so.2 => not found 2025-05-07T20:11:15.4642463Z libcuda.so.1 => not found 2025-05-07T20:11:15.4643052Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007fcddb000000) 2025-05-07T20:11:15.4644007Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fcddac00000) 2025-05-07T20:11:15.4645042Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fcddaa59000) 2025-05-07T20:11:15.4645746Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.4645995Z libtorch.so => not found 2025-05-07T20:11:15.4646485Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so (0x00007fcdda400000) 2025-05-07T20:11:15.4647335Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fcdd9200000) 2025-05-07T20:11:15.4647948Z libtorch_cpu.so => not found 2025-05-07T20:11:15.4648189Z libtorch_cuda.so => not found 2025-05-07T20:11:15.4648446Z libcudart.so.12 => not found 2025-05-07T20:11:15.4648728Z libm.so.6 => /lib64/libm.so.6 (0x00007fce16925000) 2025-05-07T20:11:15.4649021Z libtorch.so => not found 2025-05-07T20:11:15.4649257Z libc10.so => not found 2025-05-07T20:11:15.4649476Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.4649725Z libc10_cuda.so => not found 2025-05-07T20:11:15.4649961Z libnccl.so.2 => not found 2025-05-07T20:11:15.4650195Z libcuda.so.1 => not found 2025-05-07T20:11:15.4650430Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.4650680Z libtorch_cpu.so => not found 2025-05-07T20:11:15.4650992Z libtorch_cuda.so => not found 2025-05-07T20:11:15.4651252Z libcudart.so.12 => not found 2025-05-07T20:11:15.4651491Z libc10.so => not found 2025-05-07T20:11:15.4651720Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.4651966Z libc10_cuda.so => not found 2025-05-07T20:11:15.4652233Z libnccl.so.2 => not found 2025-05-07T20:11:15.4652470Z libcuda.so.1 => not found 2025-05-07T20:11:15.4653176Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so (0x00007fce2b9b1000) 2025-05-07T20:11:15.4653785Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.4654034Z libtorch.so => not found 2025-05-07T20:11:15.4654280Z libtorch_cpu.so => not found 2025-05-07T20:11:15.4654520Z libtorch_cuda.so => not found 2025-05-07T20:11:15.4654772Z libcudart.so.12 => not found 2025-05-07T20:11:15.4655018Z libc10.so => not found 2025-05-07T20:11:15.4655234Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.4655483Z libc10_cuda.so => not found 2025-05-07T20:11:15.4655721Z libnccl.so.2 => not found 2025-05-07T20:11:15.4655962Z libcuda.so.1 => not found 2025-05-07T20:11:15.4656196Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.4656447Z libtorch.so => not found 2025-05-07T20:11:15.4656679Z libtorch_cpu.so => not found 2025-05-07T20:11:15.4656957Z libtorch_cuda.so => not found 2025-05-07T20:11:15.4657197Z libcudart.so.12 => not found 2025-05-07T20:11:15.4657446Z libc10.so => not found 2025-05-07T20:11:15.4657670Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.4657911Z libc10_cuda.so => not found 2025-05-07T20:11:15.4658154Z libnccl.so.2 => not found 2025-05-07T20:11:15.4658384Z libcuda.so.1 => not found 2025-05-07T20:11:15.4658871Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so (0x00007fce2b932000) 2025-05-07T20:11:15.4659390Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.4659645Z libtorch.so => not found 2025-05-07T20:11:15.4659871Z libtorch_cpu.so => not found 2025-05-07T20:11:15.4660225Z libtorch_cuda.so => not found 2025-05-07T20:11:15.4660467Z libtorch.so => not found 2025-05-07T20:11:15.4660881Z libc10.so => not found 2025-05-07T20:11:15.4661173Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.4661425Z libc10_cuda.so => not found 2025-05-07T20:11:15.4661706Z libnccl.so.2 => not found 2025-05-07T20:11:15.4661953Z libcuda.so.1 => not found 2025-05-07T20:11:15.4662218Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.4662478Z libtorch_cpu.so => not found 2025-05-07T20:11:15.4662746Z libtorch_cuda.so => not found 2025-05-07T20:11:15.4663007Z libcudart.so.12 => not found 2025-05-07T20:11:15.4663272Z libtorch.so => not found 2025-05-07T20:11:15.4663514Z libc10.so => not found 2025-05-07T20:11:15.4663755Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.4664014Z libc10_cuda.so => not found 2025-05-07T20:11:15.4664265Z libnccl.so.2 => not found 2025-05-07T20:11:15.4664516Z libcuda.so.1 => not found 2025-05-07T20:11:15.4664765Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.4665043Z libtorch_cpu.so => not found 2025-05-07T20:11:15.4665305Z libtorch_cuda.so => not found 2025-05-07T20:11:15.4665655Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fce2b925000) 2025-05-07T20:11:15.4666033Z libtorch.so => not found 2025-05-07T20:11:15.4666285Z libc10.so => not found 2025-05-07T20:11:15.4666517Z libnvrtc.so.12 => not found 2025-05-07T20:11:15.4666782Z libc10_cuda.so => not found 2025-05-07T20:11:15.4667044Z libnccl.so.2 => not found 2025-05-07T20:11:15.4667292Z libcuda.so.1 => not found 2025-05-07T20:11:15.4667552Z libnvidia-ml.so.1 => not found 2025-05-07T20:11:15.4667819Z libtorch_cpu.so => not found 2025-05-07T20:11:15.4668094Z libtorch_cuda.so => not found 2025-05-07T20:11:15.4668398Z librt.so.1 => /lib64/librt.so.1 (0x00007fce2b91c000) 2025-05-07T20:11:15.4668651Z 2025-05-07T20:11:15.4668763Z [CHECK] Displaying ELF information: 2025-05-07T20:11:15.4669241Z + readelf -d ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:11:15.4669637Z 2025-05-07T20:11:15.4669718Z 2025-05-07T20:11:15.4669885Z Dynamic section at offset 0x147859a8 contains 41 entries: 2025-05-07T20:11:15.4670278Z Tag Type Name/Value 2025-05-07T20:11:15.4670689Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:11:15.4671235Z 0x0000000000000001 (NEEDED) Shared library: [libnvrtc.so.12] 2025-05-07T20:11:15.4671743Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:11:15.4672257Z 0x0000000000000001 (NEEDED) Shared library: [libnccl.so.2] 2025-05-07T20:11:15.4672867Z 0x0000000000000001 (NEEDED) Shared library: [libcuda.so.1] 2025-05-07T20:11:15.4673389Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:11:15.4673934Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:11:15.4674406Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:11:15.4674884Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:11:15.4675357Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:11:15.4675873Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.12] 2025-05-07T20:11:15.4676805Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:11:15.4677311Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:11:15.4677823Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:11:15.4678313Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:11:15.4678833Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:11:15.4679430Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_vbe.so] 2025-05-07T20:11:15.4680010Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:11:15.4680423Z 0x000000000000000c (INIT) 0x1dc000 2025-05-07T20:11:15.4680750Z 0x000000000000000d (FINI) 0xe754cc 2025-05-07T20:11:15.4681105Z 0x0000000000000019 (INIT_ARRAY) 0x1476a588 2025-05-07T20:11:15.4681459Z 0x000000000000001b (INIT_ARRAYSZ) 680 (bytes) 2025-05-07T20:11:15.4681824Z 0x000000000000001a (FINI_ARRAY) 0x1476a830 2025-05-07T20:11:15.4682162Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:11:15.4682511Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:11:15.4682844Z 0x0000000000000005 (STRTAB) 0x1c8a0 2025-05-07T20:11:15.4683165Z 0x0000000000000006 (SYMTAB) 0x6a00 2025-05-07T20:11:15.4683525Z 0x000000000000000a (STRSZ) 1486798 (bytes) 2025-05-07T20:11:15.4683881Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:11:15.4684234Z 0x0000000000000003 (PLTGOT) 0x1478afe8 2025-05-07T20:11:15.4684588Z 0x0000000000000002 (PLTRELSZ) 22152 (bytes) 2025-05-07T20:11:15.4684944Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:11:15.4685261Z 0x0000000000000017 (JMPREL) 0x1d5988 2025-05-07T20:11:15.4685606Z 0x0000000000000007 (RELA) 0x1896c8 2025-05-07T20:11:15.4685968Z 0x0000000000000008 (RELASZ) 312000 (bytes) 2025-05-07T20:11:15.4686318Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:11:15.4686664Z 0x000000006ffffffe (VERNEED) 0x1895a8 2025-05-07T20:11:15.4686991Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:11:15.4687321Z 0x000000006ffffff0 (VERSYM) 0x18786e 2025-05-07T20:11:15.4687649Z 0x000000006ffffff9 (RELACOUNT) 8035 2025-05-07T20:11:15.4687970Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:11:15.4688170Z 2025-05-07T20:11:15.4688282Z ################################################################################ 2025-05-07T20:11:15.4688634Z 2025-05-07T20:11:15.4688708Z 2025-05-07T20:11:15.4689054Z [CHECK] Verifying sample subset of symbols in the built libraries ... 2025-05-07T20:11:15.4730401Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:15.4751385Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:15.4983818Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:15.5021347Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:15.5079323Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:15.5118540Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:15.5147749Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:15.5178807Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:11:15.5286149Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/asmjit.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.5307868Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.5543760Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.5578780Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.5627680Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.5661299Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.5703046Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.5732530Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.6147652Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.6518873Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_inference.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.6741867Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.7692345Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_training_forward.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.7727498Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_embedding_inplace_ops.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.7820949Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_tbe_index_select.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.8157256Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.9/cmake-build/fbgemm_gpu_py.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:11:15.8162611Z ################################################################################ 2025-05-07T20:11:15.8164113Z [BUILD] Wheel Audit: dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:15.8165261Z 2025-05-07T20:11:15.8167131Z + conda run --no-capture-output -n build_binary auditwheel show dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:15.8168799Z 2025-05-07T20:11:27.8703256Z 2025-05-07T20:11:27.8704144Z fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl is 2025-05-07T20:11:27.8706008Z consistent with the following platform tag: "linux_x86_64". 2025-05-07T20:11:27.8706855Z 2025-05-07T20:11:27.8707330Z The wheel references external versioned symbols in these 2025-05-07T20:11:27.8708631Z system-provided shared libraries: librt.so.1 with versions 2025-05-07T20:11:27.8709858Z {'GLIBC_2.2.5'}, libgcc_s.so.1 with versions {'GCC_3.0'}, 2025-05-07T20:11:27.8710510Z libstdc++.so.6 with versions {'GLIBCXX_3.4.11', 'CXXABI_1.3.7', 2025-05-07T20:11:27.8710950Z 'GLIBCXX_3.4.20', 'GLIBCXX_3.4.19', 'CXXABI_1.3', 'CXXABI_1.3.5', 2025-05-07T20:11:27.8711371Z 'GLIBCXX_3.4.9', 'CXXABI_1.3.11', 'GLIBCXX_3.4.21', 'GLIBCXX_3.4.18', 2025-05-07T20:11:27.8711816Z 'GLIBCXX_3.4.15', 'GLIBCXX_3.4.14', 'CXXABI_1.3.3', 'GLIBCXX_3.4'}, 2025-05-07T20:11:27.8712257Z libc.so.6 with versions {'GLIBC_2.3.2', 'GLIBC_2.3.3', 'GLIBC_2.7', 2025-05-07T20:11:27.8712694Z 'GLIBC_2.3', 'GLIBC_2.14', 'GLIBC_2.2.5', 'GLIBC_2.17', 'GLIBC_2.6'}, 2025-05-07T20:11:27.8713114Z libpthread.so.0 with versions {'GLIBC_2.2.5', 'GLIBC_2.3.4', 2025-05-07T20:11:27.8713615Z 'GLIBC_2.3.2'}, libm.so.6 with versions {'GLIBC_2.2.5'}, 2025-05-07T20:11:27.8714036Z libcudart.so.12 with versions {'libcudart.so.12'}, libgomp.so.1 with 2025-05-07T20:11:27.8714527Z versions {'OMP_1.0'}, libdl.so.2 with versions {'GLIBC_2.2.5', 2025-05-07T20:11:27.8714886Z 'GLIBC_2.3.4'} 2025-05-07T20:11:27.8715007Z 2025-05-07T20:11:27.8715206Z This constrains the platform tag to "manylinux_2_27_x86_64". In order 2025-05-07T20:11:27.8715691Z to achieve a more compatible tag, you would need to recompile a new 2025-05-07T20:11:27.8716132Z wheel from source on a system with earlier versions of these 2025-05-07T20:11:27.8716526Z libraries, such as a recent manylinux image. 2025-05-07T20:11:27.9438566Z 2025-05-07T20:11:27.9438725Z 2025-05-07T20:11:27.9439409Z ################################################################################ 2025-05-07T20:11:27.9440242Z [BUILD] Enumerating the built wheels ... 2025-05-07T20:11:27.9440886Z + ls -lth dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:27.9441239Z 2025-05-07T20:11:27.9457820Z -rw-r--r--. 1 root root 505M May 7 20:11 dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:27.9458311Z 2025-05-07T20:11:27.9458433Z [BUILD] Enumerating the wheel SHAs ... 2025-05-07T20:11:27.9458910Z + sha1sum dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:27.9459264Z 2025-05-07T20:11:28.8550245Z 3f2d5b5d5a748724cfa39eef02dfa26aa9e99db5 dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:28.8550794Z 2025-05-07T20:11:28.8551053Z + sha256sum dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:28.8551455Z 2025-05-07T20:11:31.0185352Z 03f7454c442f6e488a67c06774ba52155da77e9d7fe324eb4aa6e479935ce1eb dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:31.0187262Z 2025-05-07T20:11:31.0187971Z + md5sum dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:31.0189017Z 2025-05-07T20:11:31.8253349Z 100a9ed18838c692811fb514d43b0207 dist/fbgemm_gpu_nightly-2025.5.7-cp39-cp39-manylinux_2_28_x86_64.whl 2025-05-07T20:11:31.8253865Z 2025-05-07T20:11:31.8254006Z [BUILD] FBGEMM-GPU build + package completed 2025-05-07T20:11:31.8365031Z ##[group]Run actions/upload-artifact@v4 2025-05-07T20:11:31.8365359Z with: 2025-05-07T20:11:31.8365631Z name: fbgemm_default_x86_clang_py3.9_cu12.6.3.whl 2025-05-07T20:11:31.8365972Z path: fbgemm_gpu/dist/*.whl 2025-05-07T20:11:31.8366275Z if-no-files-found: error 2025-05-07T20:11:31.8366535Z compression-level: 6 2025-05-07T20:11:31.8366795Z overwrite: false 2025-05-07T20:11:31.8367055Z include-hidden-files: false 2025-05-07T20:11:31.8367310Z env: 2025-05-07T20:11:31.8367555Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T20:11:31.8367858Z BUILD_ENV: build_binary 2025-05-07T20:11:31.8368127Z BUILD_TARGET: default 2025-05-07T20:11:31.8368477Z BUILD_VARIANT: cuda 2025-05-07T20:11:31.8368733Z BUILD_CUDA_VERSION: 12.6.3 2025-05-07T20:11:31.8368983Z ##[endgroup] 2025-05-07T20:11:31.8372450Z ##[command]/usr/bin/docker exec bd0f6f4466627651321f9536b804a3259c307db0438d1e394995066ae59c3be1 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:11:32.2849468Z With the provided path, there will be 1 file uploaded 2025-05-07T20:11:32.2850043Z Artifact name is valid! 2025-05-07T20:11:32.2851049Z Root directory input is valid! 2025-05-07T20:11:32.3671195Z Beginning upload of artifact content to blob storage 2025-05-07T20:11:33.1463728Z Uploaded bytes 8388608 2025-05-07T20:11:33.2816506Z Uploaded bytes 16777216 2025-05-07T20:11:33.5707849Z Uploaded bytes 25165824 2025-05-07T20:11:33.8583914Z Uploaded bytes 33554432 2025-05-07T20:11:34.2201833Z Uploaded bytes 41943040 2025-05-07T20:11:34.5096361Z Uploaded bytes 50331648 2025-05-07T20:11:34.8120769Z Uploaded bytes 58720256 2025-05-07T20:11:35.1223205Z Uploaded bytes 67108864 2025-05-07T20:11:35.4302668Z Uploaded bytes 75497472 2025-05-07T20:11:35.7847456Z Uploaded bytes 83886080 2025-05-07T20:11:36.0724520Z Uploaded bytes 92274688 2025-05-07T20:11:36.3093151Z Uploaded bytes 100663296 2025-05-07T20:11:36.6272920Z Uploaded bytes 109051904 2025-05-07T20:11:36.9743979Z Uploaded bytes 117440512 2025-05-07T20:11:37.3382603Z Uploaded bytes 125829120 2025-05-07T20:11:37.6489031Z Uploaded bytes 134217728 2025-05-07T20:11:37.9453491Z Uploaded bytes 142606336 2025-05-07T20:11:38.2634009Z Uploaded bytes 150994944 2025-05-07T20:11:38.6136884Z Uploaded bytes 159383552 2025-05-07T20:11:38.9313597Z Uploaded bytes 167772160 2025-05-07T20:11:39.2308622Z Uploaded bytes 176160768 2025-05-07T20:11:39.5383592Z Uploaded bytes 184549376 2025-05-07T20:11:39.8378550Z Uploaded bytes 192937984 2025-05-07T20:11:40.1883230Z Uploaded bytes 201326592 2025-05-07T20:11:40.4440147Z Uploaded bytes 209715200 2025-05-07T20:11:40.8236586Z Uploaded bytes 218103808 2025-05-07T20:11:41.0659357Z Uploaded bytes 226492416 2025-05-07T20:11:41.4323576Z Uploaded bytes 234881024 2025-05-07T20:11:41.6671175Z Uploaded bytes 243269632 2025-05-07T20:11:41.9470380Z Uploaded bytes 251658240 2025-05-07T20:11:42.2746147Z Uploaded bytes 260046848 2025-05-07T20:11:42.5376913Z Uploaded bytes 268435456 2025-05-07T20:11:42.8007917Z Uploaded bytes 276824064 2025-05-07T20:11:43.1451514Z Uploaded bytes 285212672 2025-05-07T20:11:43.5992852Z Uploaded bytes 293601280 2025-05-07T20:11:43.8852211Z Uploaded bytes 301989888 2025-05-07T20:11:44.0991646Z Uploaded bytes 310378496 2025-05-07T20:11:44.4103198Z Uploaded bytes 318767104 2025-05-07T20:11:44.7561791Z Uploaded bytes 327155712 2025-05-07T20:11:45.0136572Z Uploaded bytes 335544320 2025-05-07T20:11:45.3680214Z Uploaded bytes 343932928 2025-05-07T20:11:45.6823360Z Uploaded bytes 352321536 2025-05-07T20:11:46.0230596Z Uploaded bytes 360710144 2025-05-07T20:11:46.3287886Z Uploaded bytes 369098752 2025-05-07T20:11:46.6687981Z Uploaded bytes 377487360 2025-05-07T20:11:46.9577269Z Uploaded bytes 385875968 2025-05-07T20:11:47.2586366Z Uploaded bytes 394264576 2025-05-07T20:11:47.5971453Z Uploaded bytes 402653184 2025-05-07T20:11:47.9026260Z Uploaded bytes 411041792 2025-05-07T20:11:48.2153514Z Uploaded bytes 419430400 2025-05-07T20:11:48.5777357Z Uploaded bytes 427819008 2025-05-07T20:11:48.8187545Z Uploaded bytes 436207616 2025-05-07T20:11:49.1187699Z Uploaded bytes 444596224 2025-05-07T20:11:49.4401610Z Uploaded bytes 452984832 2025-05-07T20:11:49.7404163Z Uploaded bytes 461373440 2025-05-07T20:11:50.0364557Z Uploaded bytes 469762048 2025-05-07T20:11:50.3705977Z Uploaded bytes 478150656 2025-05-07T20:11:50.6290439Z Uploaded bytes 486539264 2025-05-07T20:11:50.9170290Z Uploaded bytes 494927872 2025-05-07T20:11:51.2351716Z Uploaded bytes 503316480 2025-05-07T20:11:51.5422087Z Uploaded bytes 511705088 2025-05-07T20:11:51.7221559Z Uploaded bytes 518288897 2025-05-07T20:11:51.7402380Z Finished uploading artifact content to blob storage! 2025-05-07T20:11:51.7404340Z SHA256 digest of uploaded artifact zip is b6e8d4d607a80ebc7900f304e4e7bf96f593e18fdd75d6cfb7392dc380b80222 2025-05-07T20:11:51.7406443Z Finalizing artifact upload 2025-05-07T20:11:51.8285382Z Artifact fbgemm_default_x86_clang_py3.9_cu12.6.3.whl.zip successfully finalized. Artifact ID 3081459676 2025-05-07T20:11:51.8288173Z Artifact fbgemm_default_x86_clang_py3.9_cu12.6.3.whl has been successfully uploaded! Final size is 518288897 bytes. Artifact ID is 3081459676 2025-05-07T20:11:51.8297703Z Artifact download URL: https://github.com/pytorch/FBGEMM/actions/runs/14891846252/artifacts/3081459676 2025-05-07T20:11:51.8507099Z Post job cleanup. 2025-05-07T20:11:51.8511842Z ##[command]/usr/bin/docker exec bd0f6f4466627651321f9536b804a3259c307db0438d1e394995066ae59c3be1 sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:11:52.1571708Z [command]/usr/bin/git version 2025-05-07T20:11:52.1607145Z git version 2.47.1 2025-05-07T20:11:52.1639860Z Copying '/github/home/.gitconfig' to '/__w/_temp/e5a6738d-f449-482c-a370-96194345d78c/.gitconfig' 2025-05-07T20:11:52.1645132Z Temporarily overriding HOME='/__w/_temp/e5a6738d-f449-482c-a370-96194345d78c' before making global git config changes 2025-05-07T20:11:52.1646209Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T20:11:52.1650094Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T20:11:52.1682634Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T20:11:52.1708307Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T20:11:52.1978337Z Entering 'external/asmjit' 2025-05-07T20:11:52.2023860Z Entering 'external/composable_kernel' 2025-05-07T20:11:52.2082761Z Entering 'external/cpuinfo' 2025-05-07T20:11:52.2147484Z Entering 'external/cutlass' 2025-05-07T20:11:52.2224898Z Entering 'external/googletest' 2025-05-07T20:11:52.2294509Z Entering 'external/hipify_torch' 2025-05-07T20:11:52.2360891Z Entering 'external/json' 2025-05-07T20:11:52.2422635Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T20:11:52.2440520Z http.https://github.com/.extraheader 2025-05-07T20:11:52.2444143Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-05-07T20:11:52.2474281Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T20:11:52.2736652Z Entering 'external/asmjit' 2025-05-07T20:11:52.2769137Z http.https://github.com/.extraheader 2025-05-07T20:11:52.2809694Z Entering 'external/composable_kernel' 2025-05-07T20:11:52.2843004Z http.https://github.com/.extraheader 2025-05-07T20:11:52.2885828Z Entering 'external/cpuinfo' 2025-05-07T20:11:52.2933737Z http.https://github.com/.extraheader 2025-05-07T20:11:52.2972887Z Entering 'external/cutlass' 2025-05-07T20:11:52.3014549Z http.https://github.com/.extraheader 2025-05-07T20:11:52.3059861Z Entering 'external/googletest' 2025-05-07T20:11:52.3091279Z http.https://github.com/.extraheader 2025-05-07T20:11:52.3121521Z Entering 'external/hipify_torch' 2025-05-07T20:11:52.3154306Z http.https://github.com/.extraheader 2025-05-07T20:11:52.3200941Z Entering 'external/json' 2025-05-07T20:11:52.3242454Z http.https://github.com/.extraheader 2025-05-07T20:11:52.3431579Z Stop and remove container: 8517788a26554f83a076d858efab411e_amazonlinux2023_2de706 2025-05-07T20:11:52.3436734Z ##[command]/usr/bin/docker rm --force bd0f6f4466627651321f9536b804a3259c307db0438d1e394995066ae59c3be1 2025-05-07T20:11:53.0045142Z bd0f6f4466627651321f9536b804a3259c307db0438d1e394995066ae59c3be1 2025-05-07T20:11:53.0071414Z Remove container network: github_network_80f6ca3f88ec44e288b33a2daa062f9f 2025-05-07T20:11:53.0076366Z ##[command]/usr/bin/docker network rm github_network_80f6ca3f88ec44e288b33a2daa062f9f 2025-05-07T20:11:54.0324176Z github_network_80f6ca3f88ec44e288b33a2daa062f9f 2025-05-07T20:11:54.0366878Z A job completed hook has been configured by the self-hosted runner administrator 2025-05-07T20:11:54.0386335Z ##[group]Run '/home/ec2-user/runner-scripts/after_job.sh' 2025-05-07T20:11:54.0392162Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T20:11:54.0392574Z ##[endgroup] 2025-05-07T20:11:54.0504986Z [!ALERT!] Swap in detected! [!ALERT!] 2025-05-07T20:12:04.2280277Z [!ALERT!] Swap out detected [!ALERT!] 2025-05-07T20:12:20.4268390Z Cleaning up orphan processes